Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3903 |
Symbol | gabD |
ID | 6971660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3617696 |
End bp | 3619144 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643387677 |
Product | succinate-semialdehyde dehydrogenase I |
Protein accession | YP_002272125 |
Protein GI | 209399689 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01780] succinate-semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.85959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGA ACGACAGTAA CTTATTCCGC CAGCAGGCGT TGATTAACGG GGAATGGCTG GACGCCAACA ATGGTGAAGT CATCGACGTC ACCAATCCGG CGAACGGCGA CAAGCTGGGT AGCGTGCCGA AAATGGGCGC TGATGAAACT CGCGCCGCTA TCGACGCCGC CAACCGCGCC CTGCCCGTCT GGCGTGCGCT CACCGCCAAA GAACGCGCCA ACATTCTGCG CAACTGGTTC AATTTGATGA TGGAGCATCA GGACGATTTA GCGCGCCTGA TGACCCTCGA ACAGGGTAAA CCGCTGGCTG AAGCGAAAGG CGAAATCAGC TACGCCGCCT CTTTTATTGA GTGGTTTGCC GAAGAAGGCA AACGCATTTA TGGCGACACC ATTCCCGGTC ATCAGGCCGA TAAGCGCCTG ATTGTGATCA AGCAGCCGAT TGGCGTTACA GCGGCCATCA CGCCGTGGAA CTTTCCGGCG GCAATGATTA CCCGCAAAGC CGGTCCGGCG CTGGCGGCAG GCTGCACCAT GGTGCTGAAG CCCGCCAGTC AGACGCCGTT CTCTGCGCTG GCGCTGGCGG AGCTGGCGAT CCGCGCGGGC ATTCCGGCTG GCGTGTTTAA CGTGGTCACC GGTTCGGCGG GCGCGGTGGG TAATGAACTG ACCAGCAACC CGCTGGTACG CAAACTGTCG TTTACCGGCT CGACCGAAAT TGGCCGCCAG TTAATGGAAC AGTGCGCGAA AGACATCAAA AAAGTGTCGC TGGAGCTGGG CGGCAACGCG CCGTTTATCG TCTTTGACGA TGCCGATCTC GATAAAGCCG TGGAAGGCGC ACTGGCCTCG AAATTCCGCA ACGCCGGGCA AACCTGCGTC TGCGCCAACC GTTTATACGT GCAGGACGGC GTGTATGACC GCTTTGCCGA AAAATTGCAG CAGGCGGTGA GCAAACTGCA TATCGGCGAC GGGCTGGATA AAGGCGTCAC CATCGGGCCG CTGATCGATG AAAAAGCGGT AGCAAAAGTG GAAGAGCATA TTGCCGATGC GCTGGAAAAA GGCGCGCGCG TAGTTTGCGG CGGTAAAGCA CATGAACGCG GCGGCAACTT CTTCCAGCCC ACCATTCTGG TGGACGTGCC GGCCAACGCC AAAGTGTCGA AAGAAGAGAC CTTCGGCCCC CTCGCCCCGC TGTTCCGTTT TAAAGATGAA GCCGATGTGA TCGCGCAAGC CAATGACACC GAGTTTGGCC TTGCCGCCTA TTTCTATGCC CGTGATTTAA GTCGCGTCTT CCGCGTGGGC GAGGCGCTGG AGTACGGCAT CGTCGGCATC AATACCGGGA TTATTTCCAA TGAAGTGGCC CCGTTCGGCG GCATTAAAGC CTCGGGTCTG GGTCGTGAAG GTTCGAAGTA TGGCATCGAA GATTACTTAG AAATCAAATA TATGTGCATC GGTCTTTAA
|
Protein sequence | MKLNDSNLFR QQALINGEWL DANNGEVIDV TNPANGDKLG SVPKMGADET RAAIDAANRA LPVWRALTAK ERANILRNWF NLMMEHQDDL ARLMTLEQGK PLAEAKGEIS YAASFIEWFA EEGKRIYGDT IPGHQADKRL IVIKQPIGVT AAITPWNFPA AMITRKAGPA LAAGCTMVLK PASQTPFSAL ALAELAIRAG IPAGVFNVVT GSAGAVGNEL TSNPLVRKLS FTGSTEIGRQ LMEQCAKDIK KVSLELGGNA PFIVFDDADL DKAVEGALAS KFRNAGQTCV CANRLYVQDG VYDRFAEKLQ QAVSKLHIGD GLDKGVTIGP LIDEKAVAKV EEHIADALEK GARVVCGGKA HERGGNFFQP TILVDVPANA KVSKEETFGP LAPLFRFKDE ADVIAQANDT EFGLAAYFYA RDLSRVFRVG EALEYGIVGI NTGIISNEVA PFGGIKASGL GREGSKYGIE DYLEIKYMCI GL
|
| |