Gene EcHS_A1232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1232 
Symbolndh 
ID5595181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1229773 
End bp1231077 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content53% 
IMG OID640920392 
ProductNADH dehydrogenase 
Protein accessionYP_001457954 
Protein GI157160636 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACTACGC CATTGAAAAA AATTGTGATT GTCGGCGGCG GTGCTGGTGG GCTGGAAATG 
GCAACACAGC TGGGGCATAA GCTGGGACGC AAGAAAAAAG CCAAAATTAC GCTGGTCGAT
CGTAACCACA GCCACCTGTG GAAACCGCTG CTGCACGAAG TGGCGACTGG CTCGCTTGAT
GAAGGCGTCG ATGCGTTGAG CTATCTGGCC CATGCGCGCA ATCATGGTTT CCAGTTCCAG
CTGGGTTCCG TCATTGATAT TGATCGTGAA GCGAAAACAA TCACTATTGC AGAACTGCGC
GATGAGAAAG GTGAACTGCT GGTTCCGGAA CGTAAAATCG CCTATGACAC CCTGGTAATG
GCGCTGGGTA GCACCTCTAA CGATTTCAAT ACGCCGGGTG TCAAAGAGAA CTGCATTTTC
CTCGATAACC CGCACCAGGC GCGTCGCTTT CACCAGGAGA TGCTGAATCT CTTCCTGAAA
TACTCCGCCA ACCTGGGCGC GAATGGCAAA GTGAACATTG CGATTGTCGG CGGCGGCGCG
ACGGGTGTAG AACTCTCCGC TGAATTGCAC AACGCGGTCA AGCAACTGCA CAGCTACGGT
TACAAAGGCC TGACCAACGA AGCCCTGAAC GTAACGCTGG TAGAAGCGGG AGAACGTATT
TTGCCTGCGT TACCGCCACG TATCTCTGCT GCGGCCCACA ACGAGCTAAC GAAACTTGGC
GTTCGCGTGC TGACGCAAAC CATGGTCACC AGTGCTGATG AAGGCGGCCT GCACACTAAA
GATGGCGAAT ATATTGAGGC GGATCTGATG GTATGGGCAG CCGGGATCAA AGCGCCAGAC
TTCCTGAAAG ATATCGGTGG TCTTGAAACT AACCGTATCA ACCAGCTGGT GGTGGAACCG
ACGCTGCAAA CCACTCGCGA TCCAGACATT TACGCTATTG GCGACTGCGC GTCATGCCCG
CGTCCGGAAG GGGGCTTTGT TCCGCCGCGT GCTCAGGCTG CACACCAGAT GGCGACTTGC
GCAATGAACA ACATTCTGGC GCAGATGAAC GGTAAACCGC TGAAAAATTA TCAGTATAAA
GATCATGGTT CGCTGGTATC GCTGTCGAAC TTCTCCACCG TTGGTAGCCT GATGGGTAAC
CTGACGCGCG GCTCAATGAT GATTGAAGGA CGAATTGCGC GCTTTGTATA TATCTCGCTA
TACCGAATGC ATCAGATTGC GCTGCATGGT TACTTTAAAA CCGGATTAAT GATGCTGGTG
GGGAGTATTA ACCGCGTTAT CCGTCCGCGT TTGAAGTTGC ATTAA
 
Protein sequence
MTTPLKKIVI VGGGAGGLEM ATQLGHKLGR KKKAKITLVD RNHSHLWKPL LHEVATGSLD 
EGVDALSYLA HARNHGFQFQ LGSVIDIDRE AKTITIAELR DEKGELLVPE RKIAYDTLVM
ALGSTSNDFN TPGVKENCIF LDNPHQARRF HQEMLNLFLK YSANLGANGK VNIAIVGGGA
TGVELSAELH NAVKQLHSYG YKGLTNEALN VTLVEAGERI LPALPPRISA AAHNELTKLG
VRVLTQTMVT SADEGGLHTK DGEYIEADLM VWAAGIKAPD FLKDIGGLET NRINQLVVEP
TLQTTRDPDI YAIGDCASCP RPEGGFVPPR AQAAHQMATC AMNNILAQMN GKPLKNYQYK
DHGSLVSLSN FSTVGSLMGN LTRGSMMIEG RIARFVYISL YRMHQIALHG YFKTGLMMLV
GSINRVIRPR LKLH