Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2781 |
Symbol | gabD |
ID | 6146446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2864889 |
End bp | 2866337 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641617650 |
Product | succinate-semialdehyde dehydrogenase I |
Protein accession | YP_001744810 |
Protein GI | 170679622 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01780] succinate-semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.610238 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTA ACGACAGTAA CTTATTCCGC CAGCAGGCGT TGATTAACGG GGAATGGCTG GACGCCAACA ATGGCGAGGT CATCGACGTC ACCAATCCGG CGAACGGCGA CAAGCTGGGT AGCGTACCCA AAATGGGCGC TGATGAAACC CGCGCCGCTA TCGACGCCGC CAACCGCGCT CTGCCCGCCT GGCGTGCGCT CACCGCCAAA GAACGCGCCA ACATTCTGCG CAACTGGTTC AATTTGATGA TGGAGCATCA GGACGATTTA GCGCGCCTGA TGACCCTCGA ACAGGGCAAA CCACTGGCCG AAGCGAAAGG CGAAATCAGC TACGCCGCCT CATTTATTGA GTGGTTTGCT GAAGAAGGCA AACGTATTTA TGGCGACACC ATTCCCGGTC ATCAGGCCGA TAAACGCCTG ATTGTTATCA AGCAGCCGAT TGGCGTCACC GCGGCTATCA CGCCGTGGAA CTTCCCGGCG GCGATGATTA CCCGTAAAGC AGGTCCGGCG CTGGCGGCAG GCTGCACGAT GGTGCTGAAG CCCGCCAGTC AGACGCCGTT CTCTGCGCTG GCGCTGGCAG AACTGGCGAT TCGCGCGGGC GTTCCGGCTG GCGTGTTTAA CGTGGTCACC GGTTCGGCGG GCGCGGTCGG TAACGAACTG ACCAGCAACC CGCTGGTGCG CAAGCTGTCG TTTACCGGCT CGACCGAAAT TGGCCGCCAG TTAATGGAAC AGTGCGCGAA AGACATCAAG AAAGTGTCGC TGGAGCTGGG CGGCAACGCG CCGTTTATCG TCTTTGACGA TGCCGACCTC GACAAAGCCG TGGAAGGCGC ACTGGCCTCG AAATTCCGCA ACGCCGGGCA AACCTGCGTC TGCGCCAACC GTTTATACGT GCAGAACGGC GTGTATGACC GCTTTGCCGA AAAATTGCAG CAGGCGGTGA GCAAACTGCA CATCGGCAAC GGGCTGGAGA AAGGCGTCAC CATCGGGCCG CTGATCGATG AAAAAGCGGT AGCAAAAGTG GAAGAGCATA TTGCCGATGC GCTGGAGAAA GGCGCGCGCG TGGTTTGCGG CGGTAAAGCA CATGAACGCG GCGGTAACTT CTTCCAGCCG ACCATTCTGG TGGACGTTCC GGCTAACGCC AAAGTGTCGA AAGAAGAGAC GTTCGGCCCC CTTGCCCCGC TATTCCGCTT TAAAGATGAA GCCGATGTGA TTGCGCAGGC CAATGACACC GAGTTTGGCC TTGCCGCCTA TTTCTACGCC CGTGATTTAA GCCGCGTCTT CCGCGTGGGC GAAGCGCTGG AGTACGGCAT CGTCGGCATC AATACCGGCA TTATTTCCAA TGAAGTGGCC CCGTTCGGCG GCATTAAAGC CTCGGGTCTG GGTCGTGAAG GTTCGAAGTA TGGCATCGAA GATTACTTAG AAATCAAATA TATGTGCATC GGTCTTTAA
|
Protein sequence | MKLNDSNLFR QQALINGEWL DANNGEVIDV TNPANGDKLG SVPKMGADET RAAIDAANRA LPAWRALTAK ERANILRNWF NLMMEHQDDL ARLMTLEQGK PLAEAKGEIS YAASFIEWFA EEGKRIYGDT IPGHQADKRL IVIKQPIGVT AAITPWNFPA AMITRKAGPA LAAGCTMVLK PASQTPFSAL ALAELAIRAG VPAGVFNVVT GSAGAVGNEL TSNPLVRKLS FTGSTEIGRQ LMEQCAKDIK KVSLELGGNA PFIVFDDADL DKAVEGALAS KFRNAGQTCV CANRLYVQNG VYDRFAEKLQ QAVSKLHIGN GLEKGVTIGP LIDEKAVAKV EEHIADALEK GARVVCGGKA HERGGNFFQP TILVDVPANA KVSKEETFGP LAPLFRFKDE ADVIAQANDT EFGLAAYFYA RDLSRVFRVG EALEYGIVGI NTGIISNEVA PFGGIKASGL GREGSKYGIE DYLEIKYMCI GL
|
| |