Gene EcSMS35_2781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2781 
SymbolgabD 
ID6146446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2864889 
End bp2866337 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content57% 
IMG OID641617650 
Productsuccinate-semialdehyde dehydrogenase I 
Protein accessionYP_001744810 
Protein GI170679622 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.610238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTA ACGACAGTAA CTTATTCCGC CAGCAGGCGT TGATTAACGG GGAATGGCTG 
GACGCCAACA ATGGCGAGGT CATCGACGTC ACCAATCCGG CGAACGGCGA CAAGCTGGGT
AGCGTACCCA AAATGGGCGC TGATGAAACC CGCGCCGCTA TCGACGCCGC CAACCGCGCT
CTGCCCGCCT GGCGTGCGCT CACCGCCAAA GAACGCGCCA ACATTCTGCG CAACTGGTTC
AATTTGATGA TGGAGCATCA GGACGATTTA GCGCGCCTGA TGACCCTCGA ACAGGGCAAA
CCACTGGCCG AAGCGAAAGG CGAAATCAGC TACGCCGCCT CATTTATTGA GTGGTTTGCT
GAAGAAGGCA AACGTATTTA TGGCGACACC ATTCCCGGTC ATCAGGCCGA TAAACGCCTG
ATTGTTATCA AGCAGCCGAT TGGCGTCACC GCGGCTATCA CGCCGTGGAA CTTCCCGGCG
GCGATGATTA CCCGTAAAGC AGGTCCGGCG CTGGCGGCAG GCTGCACGAT GGTGCTGAAG
CCCGCCAGTC AGACGCCGTT CTCTGCGCTG GCGCTGGCAG AACTGGCGAT TCGCGCGGGC
GTTCCGGCTG GCGTGTTTAA CGTGGTCACC GGTTCGGCGG GCGCGGTCGG TAACGAACTG
ACCAGCAACC CGCTGGTGCG CAAGCTGTCG TTTACCGGCT CGACCGAAAT TGGCCGCCAG
TTAATGGAAC AGTGCGCGAA AGACATCAAG AAAGTGTCGC TGGAGCTGGG CGGCAACGCG
CCGTTTATCG TCTTTGACGA TGCCGACCTC GACAAAGCCG TGGAAGGCGC ACTGGCCTCG
AAATTCCGCA ACGCCGGGCA AACCTGCGTC TGCGCCAACC GTTTATACGT GCAGAACGGC
GTGTATGACC GCTTTGCCGA AAAATTGCAG CAGGCGGTGA GCAAACTGCA CATCGGCAAC
GGGCTGGAGA AAGGCGTCAC CATCGGGCCG CTGATCGATG AAAAAGCGGT AGCAAAAGTG
GAAGAGCATA TTGCCGATGC GCTGGAGAAA GGCGCGCGCG TGGTTTGCGG CGGTAAAGCA
CATGAACGCG GCGGTAACTT CTTCCAGCCG ACCATTCTGG TGGACGTTCC GGCTAACGCC
AAAGTGTCGA AAGAAGAGAC GTTCGGCCCC CTTGCCCCGC TATTCCGCTT TAAAGATGAA
GCCGATGTGA TTGCGCAGGC CAATGACACC GAGTTTGGCC TTGCCGCCTA TTTCTACGCC
CGTGATTTAA GCCGCGTCTT CCGCGTGGGC GAAGCGCTGG AGTACGGCAT CGTCGGCATC
AATACCGGCA TTATTTCCAA TGAAGTGGCC CCGTTCGGCG GCATTAAAGC CTCGGGTCTG
GGTCGTGAAG GTTCGAAGTA TGGCATCGAA GATTACTTAG AAATCAAATA TATGTGCATC
GGTCTTTAA
 
Protein sequence
MKLNDSNLFR QQALINGEWL DANNGEVIDV TNPANGDKLG SVPKMGADET RAAIDAANRA 
LPAWRALTAK ERANILRNWF NLMMEHQDDL ARLMTLEQGK PLAEAKGEIS YAASFIEWFA
EEGKRIYGDT IPGHQADKRL IVIKQPIGVT AAITPWNFPA AMITRKAGPA LAAGCTMVLK
PASQTPFSAL ALAELAIRAG VPAGVFNVVT GSAGAVGNEL TSNPLVRKLS FTGSTEIGRQ
LMEQCAKDIK KVSLELGGNA PFIVFDDADL DKAVEGALAS KFRNAGQTCV CANRLYVQNG
VYDRFAEKLQ QAVSKLHIGN GLEKGVTIGP LIDEKAVAKV EEHIADALEK GARVVCGGKA
HERGGNFFQP TILVDVPANA KVSKEETFGP LAPLFRFKDE ADVIAQANDT EFGLAAYFYA
RDLSRVFRVG EALEYGIVGI NTGIISNEVA PFGGIKASGL GREGSKYGIE DYLEIKYMCI
GL