Gene EcHS_A0581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0581 
Symbolgcl 
ID5591961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp593493 
End bp595274 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content53% 
IMG OID640919765 
Productglyoxylate carboligase 
Protein accessionYP_001457348 
Protein GI157160030 
COG category[R] General function prediction only 
COG ID[COG3960] Glyoxylate carboligase 
TIGRFAM ID[TIGR01504] glyoxylate carboligase 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA TGAGAGCCGT TGACGCGGCA ATGTATGTGC TGGAGAAAGA AGGTATCACC 
ACCGCCTTCG GTGTTCCGGG AGCTGCAATC AATCCGTTCT ACTCAGCGAT GCGTAAGCAC
GGCGGTATTC GTCACATTCT GGCGCGTCAT GTGGAAGGTG CTTCGCACAT GGCGGAAGGT
TATACCCGCG CAACGGCAGG AAATATCGGC GTATGTCTGG GGACTTCCGG TCCTGCGGGC
ACGGACATGA TCACCGCGCT CTATTCCGCT TCTGCTGATT CCATTCCTAT TCTGTGCATT
ACCGGCCAGG CACCGCGCGC CCGTCTGCAT AAAGAAGATT TTCAGGCCGT AGATATTGAA
GCAATTGCTA AACCGGTCAG CAAAATGGCG GTTACAGTTC GTGAAGCGGC GCTGGTGCCT
CGCGTGCTGC AACAGGCATT TCACCTGATG CGTTCTGGTC GTCCGGGTCC GGTACTGGTG
GATTTACCGT TCGACGTTCA GGTTGCGGAA ATCGAGTTTG ATCCTGACAT GTACGAACCG
CTGCCGGTCT ACAAACCTGC TGCCAGCCGT ATGCAGATCG AAAAAGCTGT AGAAATGTTA
ATCCAGGCCG AACGTCCGGT GATTGTTGCC GGGGGCGGGG TAATTAATGC TGACGCAGCT
GCACTGTTAC AACAGTTTGC TGAACTGACC AGCGTTCCGG TGATCCCAAC GCTGATGGGC
TGGGGCTGTA TCCCGGACGA TCATGAACTG ATGGCCGGGA TGGTGGGTCT GCAAACCGCG
CATCGTTACG GTAACGCAAC GTTGCTGGCG TCCGACATGG TGTTTGGTAT CGGTAACCGT
TTTGCTAACC GTCATACCGG TTCGGTAGAG AAATACACCG AAGGGCGCAA AATCGTTCAT
ATCGATATTG AGCCGACGCA AATTGGCCGC GTGCTGTGTC CGGATCTGGG GATTGTCTCT
GATGCTAAAG CGGCGCTGAC ACTGCTGGTT GAAGTGGCGC AGGAAATGCA AAAAGCAGGG
CGTCTGCCAT GCCGTAAAGA GTGGGTTGCT GAGTGCCAGC AGCGCAAACG TACTTTGTTG
CGTAAAACAC ACTTCGACAA CGTGCCGGTG AAACCGCAGC GCGTGTATGA AGAGATGAAC
AAAGCTTTTG GACGTGATGT TTGCTATGTC ACCACCATTG GTCTGTCGCA AATTGCCGCT
GCGCAAATGC TGCATGTCTT TAAAGACCGC CACTGGATCA ACTGTGGTCA GGCTGGTCCG
TTAGGCTGGA CGATTCCGGC TGCGCTAGGG GTTTGTGCCG CTGATCCGAA ACGCAATGTG
GTGGCGATTT CTGGCGACTT TGACTTCCAG TTCCTGATTG AAGAGTTAGC CGTTGGCGCG
CAGTTCAAAA TTCCGTACAT CCATGTACTG GTCAATAACG CTTATCTGGG GCTGATTCGC
CAGTCGCAGC GCGCGTTTGA TATGGACTAC TGCGTGCAAC TCGCTTTCGA GAATATCAAC
TCCAGCGAAG TGAACGGTTA CGGCGTCGAC CACGTAAAAG TAGCGGAAGG TTTAGGTTGT
AAAGCGATTC GCGTCTTCAA ACCGGAAGAT ATTGCGCCAG CCTTTGAACA GGCGAAAGCC
TTAATGGCGC AATATCGGGT ACCGGTAGTC GTGGAAGTTA TTCTCGAGCG TGTGACCAAT
ATTTCGATGG GCAGCGAACT GGATAACGTC ATGGAATTTG AAGATATCGC CGATAACGCA
GCGGACGCAC CGACTGAAAC CTGCTTCATG CACTATGAAT AA
 
Protein sequence
MAKMRAVDAA MYVLEKEGIT TAFGVPGAAI NPFYSAMRKH GGIRHILARH VEGASHMAEG 
YTRATAGNIG VCLGTSGPAG TDMITALYSA SADSIPILCI TGQAPRARLH KEDFQAVDIE
AIAKPVSKMA VTVREAALVP RVLQQAFHLM RSGRPGPVLV DLPFDVQVAE IEFDPDMYEP
LPVYKPAASR MQIEKAVEML IQAERPVIVA GGGVINADAA ALLQQFAELT SVPVIPTLMG
WGCIPDDHEL MAGMVGLQTA HRYGNATLLA SDMVFGIGNR FANRHTGSVE KYTEGRKIVH
IDIEPTQIGR VLCPDLGIVS DAKAALTLLV EVAQEMQKAG RLPCRKEWVA ECQQRKRTLL
RKTHFDNVPV KPQRVYEEMN KAFGRDVCYV TTIGLSQIAA AQMLHVFKDR HWINCGQAGP
LGWTIPAALG VCAADPKRNV VAISGDFDFQ FLIEELAVGA QFKIPYIHVL VNNAYLGLIR
QSQRAFDMDY CVQLAFENIN SSEVNGYGVD HVKVAEGLGC KAIRVFKPED IAPAFEQAKA
LMAQYRVPVV VEVILERVTN ISMGSELDNV MEFEDIADNA ADAPTETCFM HYE