Gene EcSMS35_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2017 
Symbolndh 
ID6143761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2039138 
End bp2040442 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content52% 
IMG OID641616893 
ProductNADH dehydrogenase 
Protein accessionYP_001744069 
Protein GI170683647 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0000943189 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACTACGC CATTGAAAAA AATTGTGATT GTCGGCGGCG GTGCTGGTGG GCTGGAAATG 
GCGACACAGC TGGGACATAA GCAGGGACGC AAGAAAAAAG CCAAAATTAC GCTGGTCGAT
CGTAACCACA GCCACTTGTG GAAACCGCTG CTGCACGAAG TGGCGACTGG CTCGCTTGAT
GAAGGCGTCG ATGCGTTGAG CTATCTGGCC CATGCGCGCA ATCATGGTTT CCAGTTCCAG
CTGGGTTCCG TCATTGATAT CGATCGTGAA GCGAAAACAA TCACTATTGC AGAACTGCGC
GACGAAAAAG GTGAATTGCT GGTTCCGGAA CGTAAAATCG CCTATGACAC CCTGGTAATG
GCGCTGGGTA GCACCTCTAA CGATTTCAAT ACGCCAGGTG TCAAAGAGAA CTGCATTTTC
CTCGATAACC CGCACCAGGC GCGTCGCTTC CACCAGGAGA TGCTGAATCT GTTTCTGAAA
TACTCCGCCA ACCTGGGCGC GAATGGCAAA GTGAACATTG CGATTGTCGG CGGCGGCGCG
ACGGGTGTAG AACTCTCCGC TGAATTGCAC AACGCGGTCA AGCAACTGCA CAGCTACGGT
TATAAAGGTC TGACCAACGA AGCCCTGAAC GTAACGCTGG TAGAAGCGGG CGAACGTATT
TTGCCTGCAT TGCCGCCACG TATCTCTGCT GCGGCCCATA GTGAGTTAAC GAAACTTGGC
GTTCGCGTGC TGACGCAAAC CATGGTCACC AGTGCTGATG AAGGCGGCCT GCATACTAAA
GATGGCGAAT ATATTGAGGC TGATCTGATG GTGTGGGCAG CCGGGATCAA AGCGCCAGAC
TTCCTGAAAG ATATCGGTGG TCTTGAAACC AACCGTATCA ACCAGCTGGT GGTGGAACCG
ACGCTGCAAA CCACCCGCGA TCCAGACATT TACGCTATTG GCGATTGCGC GTCATGCCCG
CGTCCGGAAG GGGGCTTTGT TCCGCCGCGC GCTCAGGCTG CACACCAGAT GGCAACTTGC
GCAATGAACA ACATTCTGGC GCAGATGAAC GGTAAGCCGC TGAAAAGTTA TCAGTATAAA
GATCACGGTT CTCTGGTATC GCTGTCGAAC TTCTCCACCG TCGGTAGCCT GATGGGTAAC
CTGACGCGCG GCTCAATGAT GATTGAAGGA CGAATTGCGC GCTTTGTATA CATCTCGCTA
TACAGAATGC ATCAGATTGC GCTGCATGGT TACTTTAAAA CCGGATTAAT GATGCTGGTG
GGGAGTATTA ACCGCGTTAT CCGTCCACGT TTGAAGTTGC ATTAA
 
Protein sequence
MTTPLKKIVI VGGGAGGLEM ATQLGHKQGR KKKAKITLVD RNHSHLWKPL LHEVATGSLD 
EGVDALSYLA HARNHGFQFQ LGSVIDIDRE AKTITIAELR DEKGELLVPE RKIAYDTLVM
ALGSTSNDFN TPGVKENCIF LDNPHQARRF HQEMLNLFLK YSANLGANGK VNIAIVGGGA
TGVELSAELH NAVKQLHSYG YKGLTNEALN VTLVEAGERI LPALPPRISA AAHSELTKLG
VRVLTQTMVT SADEGGLHTK DGEYIEADLM VWAAGIKAPD FLKDIGGLET NRINQLVVEP
TLQTTRDPDI YAIGDCASCP RPEGGFVPPR AQAAHQMATC AMNNILAQMN GKPLKSYQYK
DHGSLVSLSN FSTVGSLMGN LTRGSMMIEG RIARFVYISL YRMHQIALHG YFKTGLMMLV
GSINRVIRPR LKLH