Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2017 |
Symbol | ndh |
ID | 6143761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2039138 |
End bp | 2040442 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616893 |
Product | NADH dehydrogenase |
Protein accession | YP_001744069 |
Protein GI | 170683647 |
COG category | [C] Energy production and conversion |
COG ID | [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0000943189 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGACTACGC CATTGAAAAA AATTGTGATT GTCGGCGGCG GTGCTGGTGG GCTGGAAATG GCGACACAGC TGGGACATAA GCAGGGACGC AAGAAAAAAG CCAAAATTAC GCTGGTCGAT CGTAACCACA GCCACTTGTG GAAACCGCTG CTGCACGAAG TGGCGACTGG CTCGCTTGAT GAAGGCGTCG ATGCGTTGAG CTATCTGGCC CATGCGCGCA ATCATGGTTT CCAGTTCCAG CTGGGTTCCG TCATTGATAT CGATCGTGAA GCGAAAACAA TCACTATTGC AGAACTGCGC GACGAAAAAG GTGAATTGCT GGTTCCGGAA CGTAAAATCG CCTATGACAC CCTGGTAATG GCGCTGGGTA GCACCTCTAA CGATTTCAAT ACGCCAGGTG TCAAAGAGAA CTGCATTTTC CTCGATAACC CGCACCAGGC GCGTCGCTTC CACCAGGAGA TGCTGAATCT GTTTCTGAAA TACTCCGCCA ACCTGGGCGC GAATGGCAAA GTGAACATTG CGATTGTCGG CGGCGGCGCG ACGGGTGTAG AACTCTCCGC TGAATTGCAC AACGCGGTCA AGCAACTGCA CAGCTACGGT TATAAAGGTC TGACCAACGA AGCCCTGAAC GTAACGCTGG TAGAAGCGGG CGAACGTATT TTGCCTGCAT TGCCGCCACG TATCTCTGCT GCGGCCCATA GTGAGTTAAC GAAACTTGGC GTTCGCGTGC TGACGCAAAC CATGGTCACC AGTGCTGATG AAGGCGGCCT GCATACTAAA GATGGCGAAT ATATTGAGGC TGATCTGATG GTGTGGGCAG CCGGGATCAA AGCGCCAGAC TTCCTGAAAG ATATCGGTGG TCTTGAAACC AACCGTATCA ACCAGCTGGT GGTGGAACCG ACGCTGCAAA CCACCCGCGA TCCAGACATT TACGCTATTG GCGATTGCGC GTCATGCCCG CGTCCGGAAG GGGGCTTTGT TCCGCCGCGC GCTCAGGCTG CACACCAGAT GGCAACTTGC GCAATGAACA ACATTCTGGC GCAGATGAAC GGTAAGCCGC TGAAAAGTTA TCAGTATAAA GATCACGGTT CTCTGGTATC GCTGTCGAAC TTCTCCACCG TCGGTAGCCT GATGGGTAAC CTGACGCGCG GCTCAATGAT GATTGAAGGA CGAATTGCGC GCTTTGTATA CATCTCGCTA TACAGAATGC ATCAGATTGC GCTGCATGGT TACTTTAAAA CCGGATTAAT GATGCTGGTG GGGAGTATTA ACCGCGTTAT CCGTCCACGT TTGAAGTTGC ATTAA
|
Protein sequence | MTTPLKKIVI VGGGAGGLEM ATQLGHKQGR KKKAKITLVD RNHSHLWKPL LHEVATGSLD EGVDALSYLA HARNHGFQFQ LGSVIDIDRE AKTITIAELR DEKGELLVPE RKIAYDTLVM ALGSTSNDFN TPGVKENCIF LDNPHQARRF HQEMLNLFLK YSANLGANGK VNIAIVGGGA TGVELSAELH NAVKQLHSYG YKGLTNEALN VTLVEAGERI LPALPPRISA AAHSELTKLG VRVLTQTMVT SADEGGLHTK DGEYIEADLM VWAAGIKAPD FLKDIGGLET NRINQLVVEP TLQTTRDPDI YAIGDCASCP RPEGGFVPPR AQAAHQMATC AMNNILAQMN GKPLKSYQYK DHGSLVSLSN FSTVGSLMGN LTRGSMMIEG RIARFVYISL YRMHQIALHG YFKTGLMMLV GSINRVIRPR LKLH
|
| |