Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0550 |
Symbol | gcl |
ID | 6145023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 557104 |
End bp | 558885 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615444 |
Product | glyoxylate carboligase |
Protein accession | YP_001742651 |
Protein GI | 170682793 |
COG category | [R] General function prediction only |
COG ID | [COG3960] Glyoxylate carboligase |
TIGRFAM ID | [TIGR01504] glyoxylate carboligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAAA TGAGAGCCGT TGACGCGGCA ATGTATGTGC TGGAGAAAGA AGGTATCATC ACCGCCTTCG GCGTTCCGGG AGCTGCAATC AATCCGTTCT ACTCAGCGAT GCGTAAGCAC GGCGGTATTC GTCACATTCT GGCGCGTCAT GTGGAAGGTG CTTCGCACAT GGCGGAAGGT TATACCCGCG CAACGGCAGG GAATATCGGC GTATGTCTGG GGACTTCCGG TCCTGCGGGC ACGGACATGA TCACCGCGCT CTATTCTGCT TCTGCTGATT CCATTCCTAT TCTGTGTATT ACCGGCCAGG TTCCGCGCGC CCGTCTGCAT AAAGAAGATT TCCAGGCCGT TGATATTGAA GCAATTGCTA AACCGGTCAG CAAAATGGCG GTTACAGTTC GTGAAGCGGC GCTGGTGCCT CGTGTGCTGC AACAGGCGTT TCACCTGATG CGTTCTGGTC GCCCGGGTCC GGTACTGGTG GATTTACCGT TCGACGTTCA GGTTGCGGAA ATCGAGTTTG ATCCTGACAT GTACGAACCG CTGCCGGTCT ACAAACCTGC TGCCAGCCGT ATGCAGATCG AAAAAGCTGT AGAAATGTTA ATTCAGGCCG AACGTCCGGT GATTGTTGCC GGGGGCGGGG TAATCAATGC TGACGCTGCT GCACTGTTAC AACAGTTTGC TGAACTGACC AGCGTTCCGG TGATCCCAAC GCTGATGGGC TGGGGCTGTA TTCCCGACGA TCATGAACTG ATGGCCGGGA TGGTGGGTCT GCAAACCGCG CATCGTTACG GTAACGCAAC GCTGCTGGCG TCCGACATGG TGTTTGGTAT CGGTAACCGT TTTGCTAACC GTCATACCGG TTCGGTAGAG AAATACACCG AAGGGCGCAA AATCGTTCAT ATCGATATTG AGCCGACGCA AATTGGCCGC GTGCTGTGTC CGGATCTCGG CATTGTCTCT GATGCTAAAG CAGCGCTGAC GCTGCTGGTT GAAGTGGCGC AGGAGATGCA AAAAGCGGGT CGTCTGCCGT GTCGTAAAGA ATGGGTCGCC GACTGCCAGC AGCGCAAACG CACTTTGCTG CGCAAAACCC ACTTCGACAA TGTGCCGGTG AAACCGCAGC GCGTGTATGA AGAGATGAAC AAAGCCTTTG GTCGCGATGT TTGTTATGTC ACCACCATTG GTCTGTCACA AATCGCTGCG GCACAAATGC TGCATGTCTT TAAAGACCGC CACTGGATCA ACTGTGGGCA GGCAGGCCCG TTGGGCTGGA CCATTCCGGC GGCGCTGGGC GTGTGTGCCG CTGATCCAGA GCGCAATGTG GTGGCGATTT CCGGTGACTT CGACTTCCAG TTCCTGATTG AAGAATTAGC CGTTGGCGCA CAGTTCAATA TTCCGTACAT CCATGTACTG GTGAACAACG CTTATCTGGG GCTGATTCGC CAGTCGCAGC GCGCGTTTGA TATGGATTAC TGCGTACAGC TCGCTTTCGA AAATATTAAC TCCAGCGAAG TGAACGGTTA TGGCGTCGAC CACGTAAAAG TAGCGGAAGG TTTAGGTTGT AAAGCAATTC GCGTTTTCAA ACCGGAAGAT ATTGCACCAG CCTTTGAACA GGCGAAAGTC TTAATAGCGC AATATCGGGT ACCTGTAGTC GTGGAAGTTA TTCTCGAGCG TGTGACCAAT ATTTCGATGG GCAGTGAACT GGATAACGTC ATGGAATTTG AAGATATCGC CGATAACGCA GCGGACGCAC CGACTGAAAC CTGTTTCATG CACTATGAAT AA
|
Protein sequence | MAKMRAVDAA MYVLEKEGII TAFGVPGAAI NPFYSAMRKH GGIRHILARH VEGASHMAEG YTRATAGNIG VCLGTSGPAG TDMITALYSA SADSIPILCI TGQVPRARLH KEDFQAVDIE AIAKPVSKMA VTVREAALVP RVLQQAFHLM RSGRPGPVLV DLPFDVQVAE IEFDPDMYEP LPVYKPAASR MQIEKAVEML IQAERPVIVA GGGVINADAA ALLQQFAELT SVPVIPTLMG WGCIPDDHEL MAGMVGLQTA HRYGNATLLA SDMVFGIGNR FANRHTGSVE KYTEGRKIVH IDIEPTQIGR VLCPDLGIVS DAKAALTLLV EVAQEMQKAG RLPCRKEWVA DCQQRKRTLL RKTHFDNVPV KPQRVYEEMN KAFGRDVCYV TTIGLSQIAA AQMLHVFKDR HWINCGQAGP LGWTIPAALG VCAADPERNV VAISGDFDFQ FLIEELAVGA QFNIPYIHVL VNNAYLGLIR QSQRAFDMDY CVQLAFENIN SSEVNGYGVD HVKVAEGLGC KAIRVFKPED IAPAFEQAKV LIAQYRVPVV VEVILERVTN ISMGSELDNV MEFEDIADNA ADAPTETCFM HYE
|
| |