Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2464 |
Symbol | astD |
ID | 6968544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2334062 |
End bp | 2335540 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643386333 |
Product | succinylglutamic semialdehyde dehydrogenase |
Protein accession | YP_002270815 |
Protein GI | 209399129 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03240] succinylglutamic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.844706 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTAT GGATTAACGG TGACTGGATA ACGGGCCAGG GTGCATCGCG TGTGAAGCGT AATCCGGTAT CGGGCGAGGT GTTATGGCAA GGCAATGATG CCGATGCCGC TCAGGTCGGG CAGGCTTGTC GGGCAGCCCG TGCGGCGTTT CCGCGCTGGG CGCGGCTCTC ATTTGCTGAA CGTCAGGCCG TTGTCGAACG CTTTGCCGGA TTGCTGGAAA GGAATAAAGG CGAATTAACC GCAATCATTG CCCGAGAAAC CGGCAAGCCG CGCTGGGAGG CGGCAACCGA ATTGACGGCG ATGATCAATA AAATCGCGAT ATCAATTAAG GCGTATCACG TTCGTACCGG CGAGCAGCGT AGTGAAATGC CGGACGGTGC GGCGAGCCTG CGACATCGCC CGCACGGCGT GCTGGCGGTG TTTGGGCCGT ATAATTTCCC TGGTCATTTG CCGAACGGAC ATATCGTTCC CGCATTGCTG GCAGGTAACA CCATTATCTT TAAACCCAGC GAACTGACAC CGTGGAGTGG CGAAGCGGTA ATGCGTTTAT GGCAGCAGGC TGGCTTGCCG CCAGGCGTGC TGAACCTGGT GCAGGGCGGG TGTGAAACGG GTCAGGCGCT GAGTGCGCTG GAGGATCTCG ACGGCTTGCT GTTTACCGGC AGCGCCAATA CCGGCTACCA GCTGCATCGC CAGCTCTCCG GTCAGCCGGA GAAAATTCTC GCCCTTGAGA TGGGCGGCAA TAACCCGCTG ATTATCGATG AGGTGGCGGA TATCGACGCG GCTGTCCATC TGACCATTCA GTCGGCGTTT GTTACAGCCG GGCAACGCTG CACCTGCGCC CGCCGTGTAT TGCTGAAAAG CGGGGCGCAG GGCGATGCAT TTCTTGCTCG TCTGGTTGCC GTCAGCCAGC GATTAACGCC GGGCAACTGG GATGACGAAC CGCAGCCGTT TATTGGCGGG CTGATTTCTG AACAGGCCGC ACAGCAGGTG TTTACTGCCT GGCAGCAACT GGAAGCGATG GGTGGACGAA CCCTGCTTGC GCCGCGCTTA TTACAAGCAG GGACATCGTT GCTGACGCCG GGGATCATTG AAATGACAGG CGTTGCTGGC GTACCAGATG AAGAGGTGTT CGGACCGTTA TTGCGCGTCT GGCGTTATGA TTCTTTCGAG GAAGCGATTC GAATGGCGAA TAACACTCGC TTCGGACTCT CTTGCGGTCT GGTTTCCCCC GAGCGGGAAA AATTCGATCA ACTGTTGCTG GAGGCGCGGG CGGGGATTGT TAACTGGAAC AAACCGCTTA CTGGTGCTGC CAGTACCGCG CCATTCGGCG GCATTGGTGC ATCCGGTAAC CATCGCCCCA GCGCCTGGTA TGCCGCAGAT TACTGCGCAT GGCCTATGGC GAGCCTGGAG TCGGACTCGT TAACTTTGCC CGCAACGCTT AACCCCGGGC TGGATTTTTC CGATGAGGTG GTGCGATGA
|
Protein sequence | MTLWINGDWI TGQGASRVKR NPVSGEVLWQ GNDADAAQVG QACRAARAAF PRWARLSFAE RQAVVERFAG LLERNKGELT AIIARETGKP RWEAATELTA MINKIAISIK AYHVRTGEQR SEMPDGAASL RHRPHGVLAV FGPYNFPGHL PNGHIVPALL AGNTIIFKPS ELTPWSGEAV MRLWQQAGLP PGVLNLVQGG CETGQALSAL EDLDGLLFTG SANTGYQLHR QLSGQPEKIL ALEMGGNNPL IIDEVADIDA AVHLTIQSAF VTAGQRCTCA RRVLLKSGAQ GDAFLARLVA VSQRLTPGNW DDEPQPFIGG LISEQAAQQV FTAWQQLEAM GGRTLLAPRL LQAGTSLLTP GIIEMTGVAG VPDEEVFGPL LRVWRYDSFE EAIRMANNTR FGLSCGLVSP EREKFDQLLL EARAGIVNWN KPLTGAASTA PFGGIGASGN HRPSAWYAAD YCAWPMASLE SDSLTLPATL NPGLDFSDEV VR
|
| |