Gene ECH74115_1489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1489 
Symbolndh 
ID6967778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1468783 
End bp1470087 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content52% 
IMG OID643385460 
ProductNADH dehydrogenase 
Protein accessionYP_002269954 
Protein GI209398186 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.00503267 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACTACGC CATTGAAAAA AATTGTGATT GTCGGCGGCG GTGCTGGTGG GCTGGAAATG 
GCAACACAGC TGGGGCATAA GCTGGGACGC AAGAAAAAAG CCAAAATTAC GCTGGTCGAT
CGTAACCACA GCCACCTGTG GAAACCGCTG CTGCACGAAG TAGCGACTGG CTCGCTTGAT
GAAGGCGTCG ATGCGTTGAG CTATCTGGCC CATGCGCGCA ATCATGGTTT CCAGTTCCAG
CTGGGTTCCG TCATTGATAT TGATCGTGAC GCGAAAACAA TCACTATTGC AGAACTGCGC
GATGAGAAAG GTGAACTGCT GGTTCCGGAA CGTAAAATCG CCTATGACAC CCTGGTAATG
GCGCTGGGTA GCACCTCTAA CGATTTCAAT ACACCAGGTG TCAAAGAGAA CTGCATTTTC
CTCGATAACC CGCACCAGGC GCGTCGCTTT CACCAGGAGA TGCTGAATCT CTTCCTGAAA
TACTCCGCCA ACCTGGGCGC AAATGGCAAA GTGAACATTG CGATTGTCGG CGGCGGCGCG
ACGGGTGTAG AACTCTCCGC TGAATTGCAC AACGCGGTCA AGCAACTGCA CAGCTACGGT
TACAAAGGCC TGACCAACGA AGCCCTGAAC GTAACGCTGG TAGAAGCGGG AGAACGTATT
TTGCCTGCAT TACCGCCACG TATCTCTGCT GCGGCCCACA ACGAGCTAAC GAAACTTGGC
GTTCGCGTTC TGACGCAAAC CATGGTCACC AGTGCTGATG AAGGCGGCCT GCACACTAAA
GATGGCGAAT ATATTGAGGC TGATCTGATG GTATGGGCAG CCGGGATCAA AGCGCCAGAC
TTCCTGAAAG ATATCGGTGG TCTTGAAACT AACCGTATCA ACCAGCTGGT GGTGGAACTG
ACGCTGCAAA CCACCCGCGA TCCAGACATT TACGCTATTG GCGACTGCGC GTCATGCCCG
CGTCCGGAAG GGGGCTTTGT TCCGCCGCGT GCTCAGGCTG CACACCAGAT GGCGACTTGC
GCAATGAACA ACATTCTGGC GCAGATGAAC GGTAAGCTGC TGAAAAATTA TCAGTATAAA
GATCATGGTT CGCTGGTATC GCTGTCGAAC TTCTCCACCG TTGGTAGCCT GATGGGTAAC
CTGACGCGCG GCTCAATGAT GATTGAAGGA CGAATTGCGC GCTTTGTATA TATCTCGCTA
TACCGAATGC ATCAGATTGC GCTGCATGGT TACTTTAAAA CCGGATTAAT GATGCTGGTG
GGGAGTATTA ACCGCGTTAT CCGTCCGCGT TTGAAGTTGC ATTAA
 
Protein sequence
MTTPLKKIVI VGGGAGGLEM ATQLGHKLGR KKKAKITLVD RNHSHLWKPL LHEVATGSLD 
EGVDALSYLA HARNHGFQFQ LGSVIDIDRD AKTITIAELR DEKGELLVPE RKIAYDTLVM
ALGSTSNDFN TPGVKENCIF LDNPHQARRF HQEMLNLFLK YSANLGANGK VNIAIVGGGA
TGVELSAELH NAVKQLHSYG YKGLTNEALN VTLVEAGERI LPALPPRISA AAHNELTKLG
VRVLTQTMVT SADEGGLHTK DGEYIEADLM VWAAGIKAPD FLKDIGGLET NRINQLVVEL
TLQTTRDPDI YAIGDCASCP RPEGGFVPPR AQAAHQMATC AMNNILAQMN GKLLKNYQYK
DHGSLVSLSN FSTVGSLMGN LTRGSMMIEG RIARFVYISL YRMHQIALHG YFKTGLMMLV
GSINRVIRPR LKLH