Gene EcSMS35_0550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0550 
Symbolgcl 
ID6145023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp557104 
End bp558885 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content53% 
IMG OID641615444 
Productglyoxylate carboligase 
Protein accessionYP_001742651 
Protein GI170682793 
COG category[R] General function prediction only 
COG ID[COG3960] Glyoxylate carboligase 
TIGRFAM ID[TIGR01504] glyoxylate carboligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAA TGAGAGCCGT TGACGCGGCA ATGTATGTGC TGGAGAAAGA AGGTATCATC 
ACCGCCTTCG GCGTTCCGGG AGCTGCAATC AATCCGTTCT ACTCAGCGAT GCGTAAGCAC
GGCGGTATTC GTCACATTCT GGCGCGTCAT GTGGAAGGTG CTTCGCACAT GGCGGAAGGT
TATACCCGCG CAACGGCAGG GAATATCGGC GTATGTCTGG GGACTTCCGG TCCTGCGGGC
ACGGACATGA TCACCGCGCT CTATTCTGCT TCTGCTGATT CCATTCCTAT TCTGTGTATT
ACCGGCCAGG TTCCGCGCGC CCGTCTGCAT AAAGAAGATT TCCAGGCCGT TGATATTGAA
GCAATTGCTA AACCGGTCAG CAAAATGGCG GTTACAGTTC GTGAAGCGGC GCTGGTGCCT
CGTGTGCTGC AACAGGCGTT TCACCTGATG CGTTCTGGTC GCCCGGGTCC GGTACTGGTG
GATTTACCGT TCGACGTTCA GGTTGCGGAA ATCGAGTTTG ATCCTGACAT GTACGAACCG
CTGCCGGTCT ACAAACCTGC TGCCAGCCGT ATGCAGATCG AAAAAGCTGT AGAAATGTTA
ATTCAGGCCG AACGTCCGGT GATTGTTGCC GGGGGCGGGG TAATCAATGC TGACGCTGCT
GCACTGTTAC AACAGTTTGC TGAACTGACC AGCGTTCCGG TGATCCCAAC GCTGATGGGC
TGGGGCTGTA TTCCCGACGA TCATGAACTG ATGGCCGGGA TGGTGGGTCT GCAAACCGCG
CATCGTTACG GTAACGCAAC GCTGCTGGCG TCCGACATGG TGTTTGGTAT CGGTAACCGT
TTTGCTAACC GTCATACCGG TTCGGTAGAG AAATACACCG AAGGGCGCAA AATCGTTCAT
ATCGATATTG AGCCGACGCA AATTGGCCGC GTGCTGTGTC CGGATCTCGG CATTGTCTCT
GATGCTAAAG CAGCGCTGAC GCTGCTGGTT GAAGTGGCGC AGGAGATGCA AAAAGCGGGT
CGTCTGCCGT GTCGTAAAGA ATGGGTCGCC GACTGCCAGC AGCGCAAACG CACTTTGCTG
CGCAAAACCC ACTTCGACAA TGTGCCGGTG AAACCGCAGC GCGTGTATGA AGAGATGAAC
AAAGCCTTTG GTCGCGATGT TTGTTATGTC ACCACCATTG GTCTGTCACA AATCGCTGCG
GCACAAATGC TGCATGTCTT TAAAGACCGC CACTGGATCA ACTGTGGGCA GGCAGGCCCG
TTGGGCTGGA CCATTCCGGC GGCGCTGGGC GTGTGTGCCG CTGATCCAGA GCGCAATGTG
GTGGCGATTT CCGGTGACTT CGACTTCCAG TTCCTGATTG AAGAATTAGC CGTTGGCGCA
CAGTTCAATA TTCCGTACAT CCATGTACTG GTGAACAACG CTTATCTGGG GCTGATTCGC
CAGTCGCAGC GCGCGTTTGA TATGGATTAC TGCGTACAGC TCGCTTTCGA AAATATTAAC
TCCAGCGAAG TGAACGGTTA TGGCGTCGAC CACGTAAAAG TAGCGGAAGG TTTAGGTTGT
AAAGCAATTC GCGTTTTCAA ACCGGAAGAT ATTGCACCAG CCTTTGAACA GGCGAAAGTC
TTAATAGCGC AATATCGGGT ACCTGTAGTC GTGGAAGTTA TTCTCGAGCG TGTGACCAAT
ATTTCGATGG GCAGTGAACT GGATAACGTC ATGGAATTTG AAGATATCGC CGATAACGCA
GCGGACGCAC CGACTGAAAC CTGTTTCATG CACTATGAAT AA
 
Protein sequence
MAKMRAVDAA MYVLEKEGII TAFGVPGAAI NPFYSAMRKH GGIRHILARH VEGASHMAEG 
YTRATAGNIG VCLGTSGPAG TDMITALYSA SADSIPILCI TGQVPRARLH KEDFQAVDIE
AIAKPVSKMA VTVREAALVP RVLQQAFHLM RSGRPGPVLV DLPFDVQVAE IEFDPDMYEP
LPVYKPAASR MQIEKAVEML IQAERPVIVA GGGVINADAA ALLQQFAELT SVPVIPTLMG
WGCIPDDHEL MAGMVGLQTA HRYGNATLLA SDMVFGIGNR FANRHTGSVE KYTEGRKIVH
IDIEPTQIGR VLCPDLGIVS DAKAALTLLV EVAQEMQKAG RLPCRKEWVA DCQQRKRTLL
RKTHFDNVPV KPQRVYEEMN KAFGRDVCYV TTIGLSQIAA AQMLHVFKDR HWINCGQAGP
LGWTIPAALG VCAADPERNV VAISGDFDFQ FLIEELAVGA QFNIPYIHVL VNNAYLGLIR
QSQRAFDMDY CVQLAFENIN SSEVNGYGVD HVKVAEGLGC KAIRVFKPED IAPAFEQAKV
LIAQYRVPVV VEVILERVTN ISMGSELDNV MEFEDIADNA ADAPTETCFM HYE