Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1968 |
Symbol | astD |
ID | 5586005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1953878 |
End bp | 1955356 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640925640 |
Product | succinylglutamic semialdehyde dehydrogenase |
Protein accession | YP_001463043 |
Protein GI | 157156088 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03240] succinylglutamic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTAT GGATTAACGG TGACTGGATA ACGGGCCAGG GCGCATCGCG TGTGAAGCGT AATCCGGTAT CGGGCGAGGT GTTATGGCAA GGCAATGATG CCGATGCCGC TCAGGTCGGG CAGGCTTGTC GCGCAGCCCG TGCGGCGTTT CCGCGCTGGG CGCGGCTCTC ATTGGCTGAA CGTCAGGTCG TTGTCGAACG CTTTGCCGGA TTGCTGGAAA GGAATAAAGG CGAATTAACC GCGATTATTG CCAGAGAAAC GGGTAAGCCG CGCTGGGAAG CGGCAACCGA AGTGACGGCG ATGATCAATA AAATCGCGAT ATCAATTAAG GCGTATCACG TTCGTACCGG CGAGCAACGT AGTGAAATGC CGGACGGTGC GGCGAGCCTG CGACATCGCC CGCACGGCGT GCTGGCGGTG TTTGGGCCGT ATAATTTCCC TGGTCATTTG CCGAACGGAC ATATCGTTCC GGCATTGTTG GCAGGTAACA CCATTATCTT TAAACCCAGC GAACTGACAC CGTGGAGTGG CGAAGCGGTA ATGCGTTTAT GGCAGCAGGC TGGCTTGCCG CCGGGCGTAC TGAACCTGGT GCAGGGCGGG CGTGAAACGG GTCAGGCGCT GAGTGCGCTG GAGGATCTCG ACGGTTTGCT GTTTACCGGT AGCGCCAATA CCGGCTACCA GCTGCATCGC CAGCTCTCCG GTCAGCCGGA GAAAATTCTC GCCCTTGAGA TGGGCGGTAA TAATCCGCTA ATTATCGATG AGGTGGCGGA TATCGACGCG GCTGTCCATC TGACCATTCA GTCGGCGTTT GTCACAGCCG GGCAACGCTG CACCTGCGCC CGCCGTTTAT TGCTCAAAAG CGGAGCGCAG GGCGATGCGT TTCTTGCTCG TCTGGTTGCC GTCAGCCAGC GATTAACGCC GGGCAACTGG GATGACGAAC CGCAGCCGTT TATTGGCGGG CTGATTTCTG AACAGGCCGC ACAGCAGGTG GTTACTGCCT GGCAGCAACT GGAAGCGATG GGCGGACGAA CCCTGCTTGC GCCGCGCTTA TTACAATCAG AGACATCGTT GCTGACGCCG GGGATCATTG AAATGACAGG CGTTGCTGGC GTACCAGATG AAGAGGTGTT CGGACCGTTA TTGCGCGTCT GGCGTTATGA TTCTTTCGAG GAAGCGATTC TAATGGCGAA TAACACTCGC TTCGGACTCT CTTGCGGTCT GGTTTCCCCC GAGCGGGAAA AATTCGATCA ACTGTTGCTG GAGGCGCGGG CGGGGATTGT TAACTGGAAC AAACCGCTTA CTGGTGCTGC CAGTACCGCG CCATTCGGCG GCATTGGTGC ATCCGGTAAC CATCGCCCCA GCGCCTGGTA TGCCGCAGAT TACTGCGCAT GGCCTATGGC GAGCCTGGAG TCGGACTCGT TAACTTTGCC CGCAACGCTT AACCCCGGGC TGGATTTTTC CGATGAGGTG GTGCGATGA
|
Protein sequence | MTLWINGDWI TGQGASRVKR NPVSGEVLWQ GNDADAAQVG QACRAARAAF PRWARLSLAE RQVVVERFAG LLERNKGELT AIIARETGKP RWEAATEVTA MINKIAISIK AYHVRTGEQR SEMPDGAASL RHRPHGVLAV FGPYNFPGHL PNGHIVPALL AGNTIIFKPS ELTPWSGEAV MRLWQQAGLP PGVLNLVQGG RETGQALSAL EDLDGLLFTG SANTGYQLHR QLSGQPEKIL ALEMGGNNPL IIDEVADIDA AVHLTIQSAF VTAGQRCTCA RRLLLKSGAQ GDAFLARLVA VSQRLTPGNW DDEPQPFIGG LISEQAAQQV VTAWQQLEAM GGRTLLAPRL LQSETSLLTP GIIEMTGVAG VPDEEVFGPL LRVWRYDSFE EAILMANNTR FGLSCGLVSP EREKFDQLLL EARAGIVNWN KPLTGAASTA PFGGIGASGN HRPSAWYAAD YCAWPMASLE SDSLTLPATL NPGLDFSDEV VR
|
| |