Gene ECH74115_3903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3903 
SymbolgabD 
ID6971660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3617696 
End bp3619144 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content57% 
IMG OID643387677 
Productsuccinate-semialdehyde dehydrogenase I 
Protein accessionYP_002272125 
Protein GI209399689 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.85959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA ACGACAGTAA CTTATTCCGC CAGCAGGCGT TGATTAACGG GGAATGGCTG 
GACGCCAACA ATGGTGAAGT CATCGACGTC ACCAATCCGG CGAACGGCGA CAAGCTGGGT
AGCGTGCCGA AAATGGGCGC TGATGAAACT CGCGCCGCTA TCGACGCCGC CAACCGCGCC
CTGCCCGTCT GGCGTGCGCT CACCGCCAAA GAACGCGCCA ACATTCTGCG CAACTGGTTC
AATTTGATGA TGGAGCATCA GGACGATTTA GCGCGCCTGA TGACCCTCGA ACAGGGTAAA
CCGCTGGCTG AAGCGAAAGG CGAAATCAGC TACGCCGCCT CTTTTATTGA GTGGTTTGCC
GAAGAAGGCA AACGCATTTA TGGCGACACC ATTCCCGGTC ATCAGGCCGA TAAGCGCCTG
ATTGTGATCA AGCAGCCGAT TGGCGTTACA GCGGCCATCA CGCCGTGGAA CTTTCCGGCG
GCAATGATTA CCCGCAAAGC CGGTCCGGCG CTGGCGGCAG GCTGCACCAT GGTGCTGAAG
CCCGCCAGTC AGACGCCGTT CTCTGCGCTG GCGCTGGCGG AGCTGGCGAT CCGCGCGGGC
ATTCCGGCTG GCGTGTTTAA CGTGGTCACC GGTTCGGCGG GCGCGGTGGG TAATGAACTG
ACCAGCAACC CGCTGGTACG CAAACTGTCG TTTACCGGCT CGACCGAAAT TGGCCGCCAG
TTAATGGAAC AGTGCGCGAA AGACATCAAA AAAGTGTCGC TGGAGCTGGG CGGCAACGCG
CCGTTTATCG TCTTTGACGA TGCCGATCTC GATAAAGCCG TGGAAGGCGC ACTGGCCTCG
AAATTCCGCA ACGCCGGGCA AACCTGCGTC TGCGCCAACC GTTTATACGT GCAGGACGGC
GTGTATGACC GCTTTGCCGA AAAATTGCAG CAGGCGGTGA GCAAACTGCA TATCGGCGAC
GGGCTGGATA AAGGCGTCAC CATCGGGCCG CTGATCGATG AAAAAGCGGT AGCAAAAGTG
GAAGAGCATA TTGCCGATGC GCTGGAAAAA GGCGCGCGCG TAGTTTGCGG CGGTAAAGCA
CATGAACGCG GCGGCAACTT CTTCCAGCCC ACCATTCTGG TGGACGTGCC GGCCAACGCC
AAAGTGTCGA AAGAAGAGAC CTTCGGCCCC CTCGCCCCGC TGTTCCGTTT TAAAGATGAA
GCCGATGTGA TCGCGCAAGC CAATGACACC GAGTTTGGCC TTGCCGCCTA TTTCTATGCC
CGTGATTTAA GTCGCGTCTT CCGCGTGGGC GAGGCGCTGG AGTACGGCAT CGTCGGCATC
AATACCGGGA TTATTTCCAA TGAAGTGGCC CCGTTCGGCG GCATTAAAGC CTCGGGTCTG
GGTCGTGAAG GTTCGAAGTA TGGCATCGAA GATTACTTAG AAATCAAATA TATGTGCATC
GGTCTTTAA
 
Protein sequence
MKLNDSNLFR QQALINGEWL DANNGEVIDV TNPANGDKLG SVPKMGADET RAAIDAANRA 
LPVWRALTAK ERANILRNWF NLMMEHQDDL ARLMTLEQGK PLAEAKGEIS YAASFIEWFA
EEGKRIYGDT IPGHQADKRL IVIKQPIGVT AAITPWNFPA AMITRKAGPA LAAGCTMVLK
PASQTPFSAL ALAELAIRAG IPAGVFNVVT GSAGAVGNEL TSNPLVRKLS FTGSTEIGRQ
LMEQCAKDIK KVSLELGGNA PFIVFDDADL DKAVEGALAS KFRNAGQTCV CANRLYVQDG
VYDRFAEKLQ QAVSKLHIGD GLDKGVTIGP LIDEKAVAKV EEHIADALEK GARVVCGGKA
HERGGNFFQP TILVDVPANA KVSKEETFGP LAPLFRFKDE ADVIAQANDT EFGLAAYFYA
RDLSRVFRVG EALEYGIVGI NTGIISNEVA PFGGIKASGL GREGSKYGIE DYLEIKYMCI
GL