so, I have a weird problem with a Dell Latitude 5285, that's a 2-in-1 with a detachable keyboard akin to the MS Surface Pro 5. it has an i5-7300u, 16 GB LPDDR3 (on-board), 500 GB NVMe, 12.3" 1920x1280 3:2 touch screen.

I got it second-hand, unknown history, without a battery. they're stuck at 400 MHz without one, but Thottlestop in Windows and msr-tools in Linux fix the BD_PROCHOT throttling and the machine performed adequately for months.

I've sourced a replacement battery, removed the patch and my problems started. there's weird screen flickering, looks like bad video ram or a flaky connection. it's intermittent, sometimes it runs without issues for hours, sometimes minutes and sometimes it flickers from the start, so troubleshooting and checking if this or that fixed things takes days.

the artefacts are inconsistent with anything that is or isn't happening (load, temps, etc) or power source. the problem is mostly exacerbated when the battery is full and/or when waking from sleep, it's almost always super glitchy then.

here's a demonstration:

would be great if I could try a different battery or try this one in another device, but don't have that option.

at no point are there ANY glitches on the external display (tried DP-Alt over USB Type-C and HDMI over Dell WD19 Dock), regardless if the internal screen is enabled or not.

so, bad luck - faulty screen or backlight or RAM or something, right?

except, when I unplug the battery (but leave it in place) and connect it to power and reenable the BD_PROCHOT patch - zero glitches! it runs for hours - videos, GPU and CPU stress test, not one hiccup, tear, nothing!

if it were a normal laptop, I'd just leave it be and use it as a desktop. it feels like such a waste with the functional touchscreen though.

what I've tried:

  • different USB Type-C chargers
  • fresh paste on CPU, clean vent
  • latest firmware, tried downgrading, no change
  • memtest passed twice on thorough, all clear
  • internal diagnostics also
  • it never froze or crashed
  • screenshot during glitches doesn't contain them
  • disabling turbo, upping/lowering the max/boost GPU clock, forcing cores offline, limiting max frequencies with TLP
  • the battery isn't deformed and doesn't exert pressure on the screen or any cables; also tried running it with the screen slightly lifted from the case, no change
  • pressing, jerking, wiggling of the internal display cable/connector, no change
  • same issues in Windows 11, Ubuntu 23.04 and Fedora WS 38; rarely but sometimes in BIOS/during boot
  • sadly, can't undervolt the CPU/GPU (Throttlestop FIVER says it's locked) but some MSR writes are apparently OK (like disabling BD_PROCHOT works).

at some point, it had both charger and dock with PD attached at the same time to both USB Type-C ports; it's possible this fried something, although I have no evidence of that.

so, I'm sure this is NOT a linux hardware problem, but I would like to use linux to fix the problem. at this point, I am sure it's defective, whether it's age or physical or manufacturing defect or whatever; but since it definitely works perfectly without the battery, I'm looking for some tweaks that makes it perform with the battery the same as without it.

seriously doubt anyone's seen anything similar but are there any ideas what to look at? what to try?

edit: I'm not asking for free hardware troubleshooting, maybe I haven't expressed myself succintly. what I'd like is some sort of snapshot of all relevant registers with battery working. and then one without. and then have somehow the difference between those two computed, so I can see which setting I need to tweak. would this be doable?

  • dutchkimble@lemy.lol
    ·
    6 months ago

    Firstly, hats off to you for trying to properly diagnose the problem and trying everything that you did. Hope you find the solution soon. Some random suggestions if you haven't already tried - clean the battery contacts (I'm not sure of the best method to do so but I'm sure you can find something online), check to see if the problem exists in different screen refresh rates, turn off auto brightness if its on.

    This is something to try but not sure how to do this in Linux - https://www.dell.com/support/kbdoc/en-us/000152765/why-does-the-screen-flicker-while-running-on-battery

    All the best

    • dingdongitsabear@lemmy.ml
      hexagon
      ·
      6 months ago

      nah, tried that when I had windows on it. that and a bunch of other stuff from the unhelpfulest site on the webz - dell.com. screen rates and resolutions and auto brightness as well. the battery contacts are way too tiny for me to do anything meaningful there. besides, I'm thinking that if the battery is the problem, then there shouldn't be any issues when running the thing on external power; it's not like the battery is powering the laptop when connected to external power, it's running on external power and using the surplus to charge the battery.

    • dingdongitsabear@lemmy.ml
      hexagon
      ·
      edit-2
      6 months ago

      tried a bunch of those, did modinfo i915 and then tried all that looked power management related, and in both directions. here's one of those attempts:

      Show

      just wish I could somehow do a snapshot of all kernel and other settings when with and without battery and deduce the difference, but I'm coming up short.

      I'll try to rephrase this and post again, as people assume I'm looking for hardware troubleshooting help.

      • jwt@programming.dev
        ·
        6 months ago

        I deleted my suggestions as I saw I overlooked the part where you state the problem occurs in bios and Windows too. Good luck, this sounds like a tough nut to crack. I get the feeling it's more like some component is/has gone faulty or something.

        • dingdongitsabear@lemmy.ml
          hexagon
          ·
          6 months ago

          it is, that's why I'm looking to linux to overcome the fault; like GRUB can cut out a piece of faulty RAM and work without issues.

  • SteveTech@programming.dev
    ·
    6 months ago

    I do kinda agree with the others that this is a power issue, but I was thinking it wouldn't harm to run a memtest, maybe whatever part of RAM the iGPU is mapped to is dying or something like that.

  • SterbenDeathGun@lemmy.ml
    ·
    6 months ago

    I think everything goes against the battery? Did you try to recalibrate it? Discharge the battery completely, and then go into the BIOS and wait until it turns off. Now charge it for a couple of hours while it stills off.

    I don't think it is gonna fix anything, because it seems like a battery problem. Maybe try to get one from iFixIt, I had bad experiences with batteries from Amazon (if you got it from them).

  • Romkslrqusz@lemm.ee
    ·
    edit-2
    6 months ago

    Awesome breakdown and troubleshooting so far!

    I wonder if the previous owner removed the battery because of this issue in the first place.

    The fact that the flickering is full-width bands that don’t appear in screenshots indicates to me that this is a signal issue to or through the display.

    An important variable to pay attention to and experiment with is the display’s refresh rate. It’s possible that is what is changing with and without the battery, though you most likely would have noticed if that were the case.

    Since the problem varies based on battery presence, it would be appropriate to source a replacement battery - especially if you purchased a cheap aftermarket battery. The real deal for your system is available for $80USD from Parts People compared to $20-$40USD for low quality Amazon junk.

    After the battery, my main suspicion is a fault on the mainboard leaking voltage from the battery circuit and affecting the display signals. Even without the infrequency of the problem that would be tricky to isolate and remedy.

    Overall, this screams hardware issue and I don’t believe you will find a software trace of it. The problem is not visible in screenshots, so the software environment does not know that it exists.