Gene ECH_0617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0617 
SymbolnuoH 
ID3928020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp622562 
End bp623665 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content31% 
IMG OID637901739 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_507427 
Protein GI88658474 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00942155 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTATT ATAATATTTT TTTGGATTTT TTAAGTTACA TAGGTAGTGG AGCTGCATTA 
TTTATAAAAA TAATTGCAGT GATAATATCG GTTATGATAT CTGTTGCTTA TTTAGTGTAC
ATGGAGCGTA AGGTTATTGC GGCAATACAG TTGAGGCAAG GTCCTAATGT TGTTGGACCA
TTTGGTTTAT TGCAACCTTT TGCTGACGCT GTAAAGCTTA TTATTAAGGA ACATATTATT
CCTTTTAAAT CAAATAAAAT ATGTTTTTTG ATTGCTCCGA TTATTACTTT TACTTTAGCT
TTGTTAGGGT GGGCTGTCAT TCCATTTGGA GCTGATGTAA TTGTTAATGA TGGATATGAA
GTGATAATAC CTAATGCTAT TGCAAATATA AATATTGGTG TACTATATAT TTTAGCTATT
TCTTCTCTTG GAGTTTATGG AATTATAATA GCTGGATGGT CAAGTAACTC AAATTATGCT
TTTTTAGGTG CAATTAGGTC TGCATCACAA ATGATATCAT ATGAAGTATC TATAGGTTTA
ACTATAGTTA CAGTGTTATT AGCAACTGGA TCATTAAAAT TAGGTGAGAT AGTTGTAGCA
CGGCATAATA TGCCTTACTG GATAGATTTA TTATTATTGC CAATGGCATG TATTTTTTTT
ATTTCAGCTT TAGCTGAGAC TAATAGACAT CCTTTCGATC TACCAGAAGC AGAATCAGAG
TTAGTATCTG GATATAATGT TGAATACTCA TCTATGCCAT TTGCATTGTT TTTTTTAGGA
GAATATGCAA ATATGATATT AATTAATGCA ATGGCAGTAA TATTTTTCTT TGGTGGGTGG
TATCCACCTT TAAATATTGG CTTCTTATAT ATAATTCCTG GGATTGTATG GTTTGTATTG
AAAGTGGTAG CATTGTTATT TTGCTTTATA TGGATTCGTG CTACTATCCC TAGGTACCGT
TATGATCAGC TAATGGGGTT AGGATGGAAA GTATTTTTAC CTATATCTTT GCTATGGGTA
GTATTAGTTT CGAGTATTCT AGTATATACA GATTCATTGC CTAGTAATAA TAAGCAATAT
GTATCACATG CTATGCATAA ATAG
 
Protein sequence
MSYYNIFLDF LSYIGSGAAL FIKIIAVIIS VMISVAYLVY MERKVIAAIQ LRQGPNVVGP 
FGLLQPFADA VKLIIKEHII PFKSNKICFL IAPIITFTLA LLGWAVIPFG ADVIVNDGYE
VIIPNAIANI NIGVLYILAI SSLGVYGIII AGWSSNSNYA FLGAIRSASQ MISYEVSIGL
TIVTVLLATG SLKLGEIVVA RHNMPYWIDL LLLPMACIFF ISALAETNRH PFDLPEAESE
LVSGYNVEYS SMPFALFFLG EYANMILINA MAVIFFFGGW YPPLNIGFLY IIPGIVWFVL
KVVALLFCFI WIRATIPRYR YDQLMGLGWK VFLPISLLWV VLVSSILVYT DSLPSNNKQY
VSHAMHK