Gene EcSMS35_4866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4866 
SymbolgcxC 
ID6147418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4975189 
End bp4976928 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content48% 
IMG OID641619670 
Productglyoxylate carboligase 
Protein accessionYP_001746777 
Protein GI170682697 
COG category[R] General function prediction only 
COG ID[COG3960] Glyoxylate carboligase 
TIGRFAM ID[TIGR01504] glyoxylate carboligase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.553307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGGA TGCGTGCAAT CGAAGCTGCA GTAGAAATAC TGAAAAAAGA AGGCATTAGC 
GTTGCTTTTG GTGTACCCGG CGCAGCAATC AACCCACTCT ATGCCGCGAT GAAAAAATCA
GGCGGTATTG ATCATATTCT GGCCCGTCAT GTTGAGGGCG CCTCGCATAT GGCTGAAGGT
TACACCCGCA GTCAGAATGG CAATATTGGC GTGTGTATTG GCACATCAGG CCCTGCGGGC
ACTGATATGA TTACGGGGCT TTATTCCGCC TCCGCGGATT CCATTCCTAT TCTCTGCATT
ACTGGTCAAG CTCCCGTAGG AAAACTTCAT AAAGAAGATT TCCAGGCCGT CGATATTGAG
GCTATCGCCA CGCCAGTAAC CAAAATGGCT CGCACAATTC TCGAAGCGGG TCAGTTGCCG
GGGATATTCC AGAAAGCCTT CTGGGAAATG CGTTCAGGCC GCCCTGGCCC GGTTCTTTTA
GATCTCCCTT TTGATGTTCA GATGACCGAA ATTGAGTTTG ATATCGATCT CTATCAGCCA
CTAACCCCCT GGCAACCTCA GGCAACACGG GCCCAGGCCG TACGCGCATT AGAAATGCTA
AATGGCGCCG AAAAGCCCGT CATCATCGCT GGCGGTGGCA TTATTAACGC CGAAGCGAGT
GAATTGTTGC GTGAGTTTGT CGAGTTAACT GGCGTTCCCG TCATTCAAAC CTTACGCGGT
TGGGGCGCAT TATCAGATGA TCATCCCCTA ATGATTGGTC GCATGGGATG TCAGGCGGGT
CATCGCTATG GTAACGCCAG TTATCTGGCC TCCGATTTTG TTTTTGGTAT CGGTAACCGT
TGGGCAAACC GTCATACCGG AGCCATTGAG ACCTACACCG AAGGCCGCAA ATTCATTCAT
GTTGATATCG AACCTGCACA AATTGGCCGG ATATTTGCGC CGGATCTGGG CATTGTTTCT
GATGCTGAAA GCGCCTTAAC GCTATTTATT CAGGTCGCCC GTGATATGAA GTCACGTGGA
GAACTGAAAG ACCGCAGTCG CTGGATTGCC GAATGTGCCG AGCGTAAACG CACAATGCTT
CGTCGTTCAG ACTTTGACTG CAATCCAATT AAACCGCAGC GCGTGTATCA CGAAATGAAT
AAGGTTTTCG GACCGGAGAC ACGCTATATT TCGACAATTG GTCTGGCGCA AATCGCCGCA
AACCAATTTT TACATGTTTA CCGCCCACGC CACTGGATTA ATGCCTGTCA GGCTGGGCCG
CTAGGCTGGA CAATGCCCGC AGCACTGGGC GCGGTAAAAG CTGATCCATC TGTTCCCGTT
GTGGCAATAT CCGGTGATTA CGATTTTCAA TTCTTGATTG AAGAATTAGC CGTCGGCGCA
CAATTTAATC TTCCGTATAT CCATATTTTA CTTAACAACG CTTATCTTGG TTTGATTCGT
CAATCACAAC GTGCATTTGA TATTGATTAT TGCGTTCAAC TGTCATTTGA AAATATTAAT
GCTCCAGAAA TTAATGGTTA TGGCGTTGAT CATAAAGCTG TGGTTGAAGG ATTAGGATGT
AAAGCGATTC GCGTATTCGC ATCACAGGAT ATTGCGCCTG CACTGCAAGA AGCCCAGCGT
TTGCGTGACG AATTTCACGT ACCTGTTGTT GTGGAAATTA TTGCTGAACG CGTGACTAAT
ATCGCTATGG GACCTGACAT TAATAAAGTC ACAGAATTTG AAGAAATTCT CGATTTATAA
 
Protein sequence
MARMRAIEAA VEILKKEGIS VAFGVPGAAI NPLYAAMKKS GGIDHILARH VEGASHMAEG 
YTRSQNGNIG VCIGTSGPAG TDMITGLYSA SADSIPILCI TGQAPVGKLH KEDFQAVDIE
AIATPVTKMA RTILEAGQLP GIFQKAFWEM RSGRPGPVLL DLPFDVQMTE IEFDIDLYQP
LTPWQPQATR AQAVRALEML NGAEKPVIIA GGGIINAEAS ELLREFVELT GVPVIQTLRG
WGALSDDHPL MIGRMGCQAG HRYGNASYLA SDFVFGIGNR WANRHTGAIE TYTEGRKFIH
VDIEPAQIGR IFAPDLGIVS DAESALTLFI QVARDMKSRG ELKDRSRWIA ECAERKRTML
RRSDFDCNPI KPQRVYHEMN KVFGPETRYI STIGLAQIAA NQFLHVYRPR HWINACQAGP
LGWTMPAALG AVKADPSVPV VAISGDYDFQ FLIEELAVGA QFNLPYIHIL LNNAYLGLIR
QSQRAFDIDY CVQLSFENIN APEINGYGVD HKAVVEGLGC KAIRVFASQD IAPALQEAQR
LRDEFHVPVV VEIIAERVTN IAMGPDINKV TEFEEILDL