Gene EcolC_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1045 
SymbolgabD 
ID6066422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1131449 
End bp1132897 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content57% 
IMG OID641600458 
Productsuccinate-semialdehyde dehydrogenase I 
Protein accessionYP_001724041 
Protein GI170019087 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTA ACGACAGTAA ATTATTCCGC CAGCAGGCGT TGATTAACGG GGAATGGCTG 
GACGCCAACA ATGGCGAGGT CATCGATGTC ACCAATCCGG CGAACGGCGA CAAGCTGGGT
AGCGTACCCA AAATGGGCAC TGATGAAACC CGCGCCGCTA TCGACGCCGC CAACCGCGCT
CTGCCCGCCT GGCGTGCGCT CACCGCCAAA GAACGCGCCA ACATTCTGCG CAACTGGTTC
AATTTGTTGA TGGAGCATCA GGACGATTTA GCGCGCCTGA TGACCCTAGA ACAGGGCAAA
CCGCTGGCTG AAGCGAAAGG CGAAATTAGC TACGCCGCCT CCTTTATTGA GTGGTTTGCT
GAAGAAGGCA AACGCATTTA TGGCGACACC ATTCCTGGTC ATCAGGCCGA TAAGCGCCTG
ATTGTTATCA AGCAGCCGAT TGGCGTCACC GCAGCTATCA CGCCGTGGAA CTTCCCGGCG
GCGATGATTA CCCGCAAAGC CGGTCCGGCG CTGGCGGCAG GCTGCACCAT GGTGCTGAAG
CCTGCCAGTC AGACGCCGTT CTCTGCGCTG GCGCTGGCGG AGCTGGCGAT TCGCGCGGGC
ATTCCGGCTG GGGTATTTAA CGTGGTCACC GGTTCGGCGG GCGCGGTGGG TAACGAACTG
ACCAGCAACC CGCTGGTGCG CAAGCTGTCG TTTACCGGTT CGACCGAAAT TGGCCGCCAG
TTAATGGAAC AGTGCGCGAA AGACATCAAA AAAGTGTCGC TGGAGCTGGG CGGCAACGCG
CCGTTTATCG TCTTTGACGA TGCCGACCTC GACAAAGCCG TGGAAGGTGC GCTGTCCTCG
AAATTCCGCA ACGCCGGGCA AACCTGCGTC TGCGCCAACC GTTTATACGT GCAGGACGGC
GTGTATGACC GCTTTGCCGA AAAATTGCAG CAGGCGGTGA GCAAATTGCA CATCGGCGAC
GGGCTGGAGA AAGGCGTCAC CATCGGGCCG CTGATCGATG AAAAAGCGGT AGCAAAAGTG
GAAGAGCATA TTGCCGATGC GCTGGAGAAA GGTGCGCGTG TGGTTTGCGG CGGTAAAGCG
GATGAACGCG GCGGCAACTT CTTCCAGCCG ACCATTCTGG TGGACGTTCC AGCCAACGCC
AAAGTGTCGA AAGAAGAGAC GTTCGGCCCC CTCGCCCCAC TGTTCCGTTT TAAAGATGAA
GCCGATGTGA TTGCGCAAGC CAATGACACC GAGTTTGGCC TTGCCGCCTA TTTCTACGCC
CGTGATTTAA GCCGCGTCTT CCGCGTGGGC GAAGCGCTGG AGTACGGCAT CGTCGGCATC
AATACCGGCA TCATTTCCAA TGAAGTGGCC CCGTTCGGCG GCATTAAAGC CTCGGGTCTG
GGTCGTGAAG GTTCGAAGTA TGGCATCGAA GATTACTTAG AAATCAAATA TATGTGCATC
GGTCTTTAA
 
Protein sequence
MKLNDSKLFR QQALINGEWL DANNGEVIDV TNPANGDKLG SVPKMGTDET RAAIDAANRA 
LPAWRALTAK ERANILRNWF NLLMEHQDDL ARLMTLEQGK PLAEAKGEIS YAASFIEWFA
EEGKRIYGDT IPGHQADKRL IVIKQPIGVT AAITPWNFPA AMITRKAGPA LAAGCTMVLK
PASQTPFSAL ALAELAIRAG IPAGVFNVVT GSAGAVGNEL TSNPLVRKLS FTGSTEIGRQ
LMEQCAKDIK KVSLELGGNA PFIVFDDADL DKAVEGALSS KFRNAGQTCV CANRLYVQDG
VYDRFAEKLQ QAVSKLHIGD GLEKGVTIGP LIDEKAVAKV EEHIADALEK GARVVCGGKA
DERGGNFFQP TILVDVPANA KVSKEETFGP LAPLFRFKDE ADVIAQANDT EFGLAAYFYA
RDLSRVFRVG EALEYGIVGI NTGIISNEVA PFGGIKASGL GREGSKYGIE DYLEIKYMCI
GL