Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1445 |
Symbol | astD |
ID | 6145919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1429365 |
End bp | 1430843 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641616323 |
Product | succinylglutamic semialdehyde dehydrogenase |
Protein accession | YP_001743503 |
Protein GI | 170679694 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03240] succinylglutamic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.545801 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTAT GGATTAACGG TGACTGGATA ACGGGCCAGG GCGCATCGCG TGTGAAGCGT AATCCGGTAT CGGGCGAGGT GTTATGGCAG GGCAATGATG CCGATGCCGC TCAGGTCGGG CTGGCTTGTC GGGCTGCCCG TGCGGCGTTT CCGCGTTGGG CGCGGCTCTC CTTTGGCGAT CGTCAGGTAA GAGTTGAACG CTTTGCCGCA CTGCTGGAAA GCAATAAAGC CGAATTAACC GCGATTATTG CCAGAGAAAC GGGTAAGCCG CGCTGGGAAG CGGCAACTGA AGTGACGGCG ATGATCAATA AAATCGCGAT ATCAATTAAG GCGTATCACG TTCGTACCGG CGAGCAGCGT AGCGAAATGC CGGACGGCGC GGCAAGCCTG CGCCATCGTC CACACGGCGT GCTGGCGGTG TTTGGGCCGT ATAATTTCCC CGGTCATTTA CCGAACGGAC ATATCGTTCC GGCATTGCTG GCAGGTAACA CCATTATCTT TAAACCCAGC GAACTGACAC CGTGGAGTGG CGAAGCGGTA ATGCGTTTAT GGCAGCAGGC TGGCTTGCCG CCGGGCGTGC TGAACCTGGT GCAGGGCGGG CGCGAAACCG GTCAGGCGCT GAGTGCACTT GAGGATCTCG ACGGCTTGCT GTTTACCGGT AGCGCCAATA CCGGCTACCA GTTGCATCGC CAGCTCTCCG GTCAGCCGGA GAAAATTCTC GCCCTTGAGA TGGGCGGTAA TAATCCGCTA ATTATCGATG AGGCGGCGGA TATCGATGCC GCTGTCCATC TGACCATTCA GTCGGCGTTT GTCACAGCCG GGCAACGCTG CACCTGCGCT CGCCGTTTAT TGCTGAAAAG CGGGGCGCAG GGCGATGCAT TTCTTGCTCG CCTGGTTGCC GTCAGCCAGC GGTTAACGCC GGGCACTTGG GATGACGAAC CGCAGCCATT TATTGGTGGG CTGATTTCTG AGCAGGCCGC ACAGCAGGTG GTTACTGCCT GGCATGAACT GGAGGCGATG GGCGGACGGA CCCTGCTTGC GCCGCGCTTA TTACAAGCAG GGACATCGTT GCTGACGCCG GGGATCATTG AAATGACAGG CGTTACTGGC GTGCCGGATG AAGAAGTGTT TGGGCCGCTA TTGCGCGTCT GGCGTTATGA CAATTTCGAT GAAGCGATTC GAATGGCGAA TAACACTCGC TTCGGCCTCT CTTGCGGTCT GGTTTCCCCC GAGCGGGAAA AATTCGATCA ACTGTTGCTG GAGGCACGGG CGGGGATTGT TAACTGGAAC AAACCGCTTA CCGGTGCTGC CAGTACCGCG CCATTCGGCG GCATTGGCGC TTCCGGCAAT CATCGCCCCA GCGCCTGGTA TGCCGCAGAT TATTGTGCCT GGCCGATGGC AAGCCTGGAG TCGGACTCGT TAACGTTGCC AGCAACGCTT AACCCCGGGC TGGATTTTTC CGATGAGGTG GTGCGATGA
|
Protein sequence | MTLWINGDWI TGQGASRVKR NPVSGEVLWQ GNDADAAQVG LACRAARAAF PRWARLSFGD RQVRVERFAA LLESNKAELT AIIARETGKP RWEAATEVTA MINKIAISIK AYHVRTGEQR SEMPDGAASL RHRPHGVLAV FGPYNFPGHL PNGHIVPALL AGNTIIFKPS ELTPWSGEAV MRLWQQAGLP PGVLNLVQGG RETGQALSAL EDLDGLLFTG SANTGYQLHR QLSGQPEKIL ALEMGGNNPL IIDEAADIDA AVHLTIQSAF VTAGQRCTCA RRLLLKSGAQ GDAFLARLVA VSQRLTPGTW DDEPQPFIGG LISEQAAQQV VTAWHELEAM GGRTLLAPRL LQAGTSLLTP GIIEMTGVTG VPDEEVFGPL LRVWRYDNFD EAIRMANNTR FGLSCGLVSP EREKFDQLLL EARAGIVNWN KPLTGAASTA PFGGIGASGN HRPSAWYAAD YCAWPMASLE SDSLTLPATL NPGLDFSDEV VR
|
| |