Gene EcolC_3115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3115 
Symbol 
ID6066333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3411846 
End bp3413627 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content53% 
IMG OID641602531 
Productglyoxylate carboligase 
Protein accessionYP_001726065 
Protein GI170021111 
COG category[R] General function prediction only 
COG ID[COG3960] Glyoxylate carboligase 
TIGRFAM ID[TIGR01504] glyoxylate carboligase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.252845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAA TGAGAGCCGT TGACGCGGCA ATGTATGTGC TGGAGAAAGA AGGTATCACC 
ACCGCCTTCG GTGTTCCGGG AGCTGCAATC AATCCGTTCT ACTCAGCGAT GCGTAAGCAC
GGCGGTATTC GTCACATTCT GGCGCGTCAT GTGGAAGGTG CTTCGCACAT GGCGGAAGGT
TATACCCGCG CAACGGCAGG AAATATCGGC GTATGTCTGG GGACTTCCGG TCCTGCGGGC
ACGGACATGA TCACCGCGCT CTATTCCGCT TCTGCTGATT CCATTCCTAT TCTGTGCATT
ACCGGCCAGG CACCGCGCGC CCGTCTGCAT AAAGAAGATT TTCAGGCCGT AGATATTGAA
GCAATTGCTA AACCGGTCAG CAAAATGGCG GTTACAGTTC GTGAAGCGGC GCTGGTGCCT
CGCGTGCTGC AACAGGCATT TCACCTGATG CGTTCTGGTC GTCCGGGTCC GGTACTGGTG
GATTTACCGT TCGACGTTCA GGTTGCGGAA ATCGAGTTTG ATCCTGACAT GTACGAACCG
CTGCCGGTCT ACAAACCTGC TGCCAGCCGT ATGCAGATCG AAAAAGCTGT AGAAATGTTA
ATCCAGGCCG AACGTCCGGT GATTGTTGCC GGGGGCGGGG TAATTAATGC TGACGCAGCT
GCACTGTTAC AACAGTTTGC TGAACTGACC AGCGTTCCGG TGATCCCAAC GCTGATGGGC
TGGGGCTGTA TCCCGGACGA TCATGAACTG ATGGCCGGGA TGGTGGGTCT GCAAACCGCG
CATCGTTACG GTAACGCAAC GTTGCTGGCG TCCGACATGG TGTTTGGTAT CGGTAACCGT
TTTGCTAACC GTCATACCGG TTCGGTAGAG AAATACACCG AAGGGCGCAA AATCGTTCAT
ATCGATATTG AGCCGACGCA AATTGGTCGC GTGCTGTGTC CGGATCTCGG CATTGTCTCT
GATGCTAAAG CGGCGCTGAC ACTGCTGGTT GAAGTGGCGC AGGAGATGCA AAAAGCGGGT
CGTCTGCCGT GTCGTAAAGA ATGGGTCGCC GACTGCCAGC AGCGCAAACG CACTTTGCTG
CGCAAAACCC ATTTCGACAA CGTGCCGGTG AAACCGCAGC GCGTGTATGA AGAGATGAAC
AAAGCCTTTG GTCGCGATGT TTGCTATGTC ACCACCATTG GTCTGTCACA AATCGCTGCG
GCACAAATGC TGCATGTCTT TAAAGACCGC CACTGGATCA ACTGTGGTCA GGCTGGTCCG
TTAGGCTGGA CGATTCCGGC GGCGCTGGGG GTTTGTGCCG CTGATCCGAA ACGCAATGTG
GTGGCGATTT CTGGCGACTT TGACTTCCAG TTCCTGATTG AAGAGTTAGC CGTTGGCGCG
CAGTTCAACA TTCCGTACAT CCATGTGCTG GTGAATAACG CCTATCTCGG CCTGATTCGT
CAGTCACAAC GCGCTTTTGA CATGGACTAC TGCGTGCAAC TCGCTTTCGA GAATATCAAC
TCCAGTGAAG TGAATGGCTA CGGTGTTGAC CACGTAAAAG TAGCGGAAGG TTTAGGTTGT
AAAGCTATTC GGGTCTTCAA ACCGGAAGAT ATTGCGCCAG CCTTTGAACA GGCGAAAGCC
TTAATGGCGC AATATCGGGT ACCGGTAGTC GTGGAAGTTA TTCTCGAGCG TGTGACCAAT
ATTTCGATGG GCAGCGAACT GGATAACGTC ATGGAATTTG AAGATATCGC CGATAACGCA
GCGGACGCAC CGACTGAAAC CTGCTTCATG CACTATGAAT AA
 
Protein sequence
MAKMRAVDAA MYVLEKEGIT TAFGVPGAAI NPFYSAMRKH GGIRHILARH VEGASHMAEG 
YTRATAGNIG VCLGTSGPAG TDMITALYSA SADSIPILCI TGQAPRARLH KEDFQAVDIE
AIAKPVSKMA VTVREAALVP RVLQQAFHLM RSGRPGPVLV DLPFDVQVAE IEFDPDMYEP
LPVYKPAASR MQIEKAVEML IQAERPVIVA GGGVINADAA ALLQQFAELT SVPVIPTLMG
WGCIPDDHEL MAGMVGLQTA HRYGNATLLA SDMVFGIGNR FANRHTGSVE KYTEGRKIVH
IDIEPTQIGR VLCPDLGIVS DAKAALTLLV EVAQEMQKAG RLPCRKEWVA DCQQRKRTLL
RKTHFDNVPV KPQRVYEEMN KAFGRDVCYV TTIGLSQIAA AQMLHVFKDR HWINCGQAGP
LGWTIPAALG VCAADPKRNV VAISGDFDFQ FLIEELAVGA QFNIPYIHVL VNNAYLGLIR
QSQRAFDMDY CVQLAFENIN SSEVNGYGVD HVKVAEGLGC KAIRVFKPED IAPAFEQAKA
LMAQYRVPVV VEVILERVTN ISMGSELDNV MEFEDIADNA ADAPTETCFM HYE