Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0617 |
Symbol | nuoH |
ID | 3928020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 622562 |
End bp | 623665 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637901739 |
Product | NADH dehydrogenase subunit H |
Protein accession | YP_507427 |
Protein GI | 88658474 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00942155 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTATT ATAATATTTT TTTGGATTTT TTAAGTTACA TAGGTAGTGG AGCTGCATTA TTTATAAAAA TAATTGCAGT GATAATATCG GTTATGATAT CTGTTGCTTA TTTAGTGTAC ATGGAGCGTA AGGTTATTGC GGCAATACAG TTGAGGCAAG GTCCTAATGT TGTTGGACCA TTTGGTTTAT TGCAACCTTT TGCTGACGCT GTAAAGCTTA TTATTAAGGA ACATATTATT CCTTTTAAAT CAAATAAAAT ATGTTTTTTG ATTGCTCCGA TTATTACTTT TACTTTAGCT TTGTTAGGGT GGGCTGTCAT TCCATTTGGA GCTGATGTAA TTGTTAATGA TGGATATGAA GTGATAATAC CTAATGCTAT TGCAAATATA AATATTGGTG TACTATATAT TTTAGCTATT TCTTCTCTTG GAGTTTATGG AATTATAATA GCTGGATGGT CAAGTAACTC AAATTATGCT TTTTTAGGTG CAATTAGGTC TGCATCACAA ATGATATCAT ATGAAGTATC TATAGGTTTA ACTATAGTTA CAGTGTTATT AGCAACTGGA TCATTAAAAT TAGGTGAGAT AGTTGTAGCA CGGCATAATA TGCCTTACTG GATAGATTTA TTATTATTGC CAATGGCATG TATTTTTTTT ATTTCAGCTT TAGCTGAGAC TAATAGACAT CCTTTCGATC TACCAGAAGC AGAATCAGAG TTAGTATCTG GATATAATGT TGAATACTCA TCTATGCCAT TTGCATTGTT TTTTTTAGGA GAATATGCAA ATATGATATT AATTAATGCA ATGGCAGTAA TATTTTTCTT TGGTGGGTGG TATCCACCTT TAAATATTGG CTTCTTATAT ATAATTCCTG GGATTGTATG GTTTGTATTG AAAGTGGTAG CATTGTTATT TTGCTTTATA TGGATTCGTG CTACTATCCC TAGGTACCGT TATGATCAGC TAATGGGGTT AGGATGGAAA GTATTTTTAC CTATATCTTT GCTATGGGTA GTATTAGTTT CGAGTATTCT AGTATATACA GATTCATTGC CTAGTAATAA TAAGCAATAT GTATCACATG CTATGCATAA ATAG
|
Protein sequence | MSYYNIFLDF LSYIGSGAAL FIKIIAVIIS VMISVAYLVY MERKVIAAIQ LRQGPNVVGP FGLLQPFADA VKLIIKEHII PFKSNKICFL IAPIITFTLA LLGWAVIPFG ADVIVNDGYE VIIPNAIANI NIGVLYILAI SSLGVYGIII AGWSSNSNYA FLGAIRSASQ MISYEVSIGL TIVTVLLATG SLKLGEIVVA RHNMPYWIDL LLLPMACIFF ISALAETNRH PFDLPEAESE LVSGYNVEYS SMPFALFFLG EYANMILINA MAVIFFFGGW YPPLNIGFLY IIPGIVWFVL KVVALLFCFI WIRATIPRYR YDQLMGLGWK VFLPISLLWV VLVSSILVYT DSLPSNNKQY VSHAMHK
|
| |