Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1896 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 2048744 |
End bp | 2050222 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | |
Product | succinylglutamic semialdehyde dehydrogenase |
Protein accession | ACX39554 |
Protein GI | 260449132 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.2733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTAT GGATTAACGG TGACTGGATA ACGGGCCAGG GCGCATCGCG TGTGAAGCGT AATCCGGTAT CGGGCGAGGT GTTATGGCAA GGCAATGATG CCGATGCCGC TCAGGTCGAG CAGGCTTGTC GGGCAGCCCG TGCGGCGTTT CCGCGCTGGG CGCGGCTTTC ATTTGCTGAA CGTCATGCCG TTGTCGAACG CTTTGCCGCA CTGCTGGAAA GCAATAAAGC CGAATTAACC GCGATTATTG CCAGAGAAAC GGGTAAGCCG CGCTGGGAAG CGGCAACCGA AGTGACGGCG ATGATCAATA AAATCGCGAT ATCAATTAAG GCGTATCACG TTCGTACCGG CGAGCAGCGT AGTGAAATGC CGGACGGCGC GGCGAGCCTG CGACATCGCC CGCACGGCGT GCTGGCGGTG TTTGGGCCGT ATAATTTCCC TGGTCATTTG CCGAACGGAC ATATCGTTCC GGCATTGCTG GCAGGTAACA CCATTATCTT TAAACCCAGC GAACTGACAC CGTGGAGTGG CGAAGCGGTA ATGCGTTTAT GGCAGCAGGC TGGCTTGCCG CCAGGCGTGC TGAACCTGGT GCAGGGCGGG CGTGAAACGG GTCAGGCGCT GAGTGCGCTG GAGGATCTCG ACGGTTTGCT GTTTACCGGT AGCGCCAATA CAGGCTACCA GTTGCATCGC CAGCTCTCCG GTCAGCCGGA GAAAATTCTC GCCCTTGAGA TGGGCGGTAA TAATCCGCTA ATTATCGATG AGGTGGCGGA TATCGACGCG GCTGTCCATC TGACCATTCA GTCGGCGTTT GTCACAGCCG GTCAACGCTG CACCTGCGCC CGCCGTTTAT TGCTGAAAAG CGGGGCGCAG GGCGATGCGT TTCTTGCTCG TCTGGTTGCC GTCAGCCAGC GATTAACGCC GGGCAACTGG GATGACGAAC CGCAGCCGTT TATTGGCGGG CTGATTTCTG AACAGGCCGC ACAGCAGGTG GTTACTGCAT GGCAGCAACT GGAAGCGATG GGCGGACGAC CCCTGCTTGC GCCGCGCTTA TTACAAGCAG GGACATCGTT GCTGACGCCG GGGATCATTG AAATGACAGG CGTTGCTGGC GTACCAGATG AAGAGGTGTT CGGACCGTTA TTGCGCGTCT GGCGTTATGA TACTTTCGAT GAAGCGATTC GAATGGCGAA TAACACTCGC TTCGGACTCT CTTGCGGTCT GGTTTCCCCC GAGCGGGAAA AGTTCGATCA ACTGTTGCTG GAGGCGCGGG CGGGGATTGT TAACTGGAAC AAACCGCTTA CCGGTGCTGC CAGTACCGCG CCATTCGGCG GCATTGGTGC TTCCGGTAAC CATCGCCCCA GCGCCTGGTA TGCCGCAGAT TACTGCGCAT GGCCGATGGC GAGCCTGGAG TCGGACTCGT TAACATTGCC CGCCACGCTT AACCCCGGGC TGGATTTTTC CGATGAGGTG GTGCGATGA
|
Protein sequence | MTLWINGDWI TGQGASRVKR NPVSGEVLWQ GNDADAAQVE QACRAARAAF PRWARLSFAE RHAVVERFAA LLESNKAELT AIIARETGKP RWEAATEVTA MINKIAISIK AYHVRTGEQR SEMPDGAASL RHRPHGVLAV FGPYNFPGHL PNGHIVPALL AGNTIIFKPS ELTPWSGEAV MRLWQQAGLP PGVLNLVQGG RETGQALSAL EDLDGLLFTG SANTGYQLHR QLSGQPEKIL ALEMGGNNPL IIDEVADIDA AVHLTIQSAF VTAGQRCTCA RRLLLKSGAQ GDAFLARLVA VSQRLTPGNW DDEPQPFIGG LISEQAAQQV VTAWQQLEAM GGRPLLAPRL LQAGTSLLTP GIIEMTGVAG VPDEEVFGPL LRVWRYDTFD EAIRMANNTR FGLSCGLVSP EREKFDQLLL EARAGIVNWN KPLTGAASTA PFGGIGASGN HRPSAWYAAD YCAWPMASLE SDSLTLPATL NPGLDFSDEV VR
|
| |