Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1886 |
Symbol | astD |
ID | 6065099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2086956 |
End bp | 2088434 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641601299 |
Product | succinylglutamic semialdehyde dehydrogenase |
Protein accession | YP_001724861 |
Protein GI | 170019907 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03240] succinylglutamic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0118018 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTAT GGATTAACGG TGACTGGATA ACGGGCCAGG GCGCATTGCG TGTGAGGCGT AATCCGGTAT CGGGCGAGGT GTTATGGCAG GGCAATGATG CCGATGCCGC TCAGGTCGAG CAGGCTTGTC GGGCAGCCCG TGCGGCGTTT CCTCGCTGGG CGCGGCTCTC ATTTGCTGAA CGTCAGGCCG TTGTCGAACG CTTTGCCGGA TTGCTGGAAA GGAATAAAGG CGAATTAACC GCAATCATTG CCCGAGAAAC CGGCAAGCCG CGCTGGGAAG CGGCAACCGA AGTGACGGCG ATGATCAATA AAATCGCGAT ATCAATTAAG GCGTATCACG TTCGTACCGG CGAGCAGCGT AGCGAAATGC CGGACGGCGC GGCGAGCCTG CGACATCGCC CGCACGGCGT GCTGGCGGTG TTTGGGCCGT ATAATTTCCC TGGTCATTTG CCGAACGGAC ATATCGTTCC GGCATTGCTG GCAGGTAACA CCATTATCTT TAAACCCAGC GAACTGACAC CGTGGAGTGG CGAAGCGGTA ATGCGTTTAT GGCAGCAGGC TGGCTTGCCG CCGGGCGTAC TGAACCTGGT GCAGGGCGGG CGAGAAACGG GTCAGGCGCT GAGTGCGCTG GAGGATCTCG ACGGTTTGCT GTTTACCGGT AGCGCCAATA CCGGCTACCA GCTGCGTCGC CAGCTCTCCG GTCAGCCGGA GAAAATTCTC GCCCTTGAGA TGGGCGGTAA TAATCCGCTA ATTATCGATG AGGTGGCGGA TATCGACGCG GCTGTCCATC TGACCATTCA GTCGGCGTTT GTCACAGCCG GGCAACGCTG CACCTGCGCC CGCCGTTTAT TGCTCAAAAG CGGAGCGCAG GGCGATGCGT TTCTTGCTCG TCTGGTTGCC GTCAGCCAGC GATTAACGCC GGGCAACTGG GATGACGAAC CGCAGCCGTT TATTGGCGGG CTGATTTCTG AACAGGCCGC ACAGCAGGTG GTTACTGCCT GGCAGCAACT GGAAGCGATG GGCGGACGAA CCCTGCTTGC GCCGCGCTTA TTACAATCAG AGACATCGTT GCTGACGCCG GGGATCATTG AAATGACAGG CGTTGCTGGC GTACCAGATG AAGAGGTGTT CGGGCCGCTT TTGCGCGTCT GGCGTTATGA TTCTTTCGAG GAAGCGATTC TAATGGCGAA TAACACTCGC TTCGGACTCT CTTGCGGTCT GGTTTCCCCC GAGCGGGAAA AATTCGATCA ACTGTTGCTG GAGGCGCGGG CGGGGATTGT TAACTGGAAC AAACCGCTTA CTGGTGCTGC CAGTACCGCG CCATTCGGCG GCATTGGTGC ATCCGGTAAC CATCGCCCCA GCGCCTGGTA TGCCGCAGAT TACTGCGCAT GGCCTATGGC GAGCCTGGAG TCGGACTCGT TAACTTTGCC CGCAACGCTT AACCCCGGGC TGGATTTTTC CGATGAGGTG GTGCGATGA
|
Protein sequence | MTLWINGDWI TGQGALRVRR NPVSGEVLWQ GNDADAAQVE QACRAARAAF PRWARLSFAE RQAVVERFAG LLERNKGELT AIIARETGKP RWEAATEVTA MINKIAISIK AYHVRTGEQR SEMPDGAASL RHRPHGVLAV FGPYNFPGHL PNGHIVPALL AGNTIIFKPS ELTPWSGEAV MRLWQQAGLP PGVLNLVQGG RETGQALSAL EDLDGLLFTG SANTGYQLRR QLSGQPEKIL ALEMGGNNPL IIDEVADIDA AVHLTIQSAF VTAGQRCTCA RRLLLKSGAQ GDAFLARLVA VSQRLTPGNW DDEPQPFIGG LISEQAAQQV VTAWQQLEAM GGRTLLAPRL LQSETSLLTP GIIEMTGVAG VPDEEVFGPL LRVWRYDSFE EAILMANNTR FGLSCGLVSP EREKFDQLLL EARAGIVNWN KPLTGAASTA PFGGIGASGN HRPSAWYAAD YCAWPMASLE SDSLTLPATL NPGLDFSDEV VR
|
| |