Gene EcHS_A3152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3152 
SymbolglcD 
ID5595188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3164156 
End bp3165646 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content57% 
IMG OID640922272 
Productglycolate oxidase subunit GlcD 
Protein accessionYP_001459770 
Protein GI157162452 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR00387] glycolate oxidase, subunit GlcD 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTACGAAG AGCGTCTTGA TGGCGCTTTA CCCGATGTCG ACCGCACATC GGTACTGATG 
GCACTGCGTG AGCATGTCCC TGGACTTGAG ATCCTGCATA CCGATGAGGA GATCATTCCT
TACGAGTGTG ACGGGTTGAG CGCGTATCGC ACGCGTCCAT TACTGGTTGT TCTGCCTAAG
CAAATGGAAC AGGTGACAGC GATTCTGGCT GTCTGCCATC GCCTGCGTGT ACCGGTGGTG
ACCCGTGGTG CAGGCACCGG GCTTTCTGGT GGCGCGCTGC CGCTGGAAAA AGGTGTGTTG
TTGGTGATGG CGCGCTTTAA AGAGATCCTC GACATTAACC CCGTTGGTCG CCGCGCGCGC
GTGCAGCCAG GCGTGCGTAA CCTGGCGATC TCCCAGGCCG TTGCACCGCA TAATCTCTAC
TACGCACCGG ACCCTTCCTC ACAAATCGCC TGTTCCATTG GCGGCAATGT GGCTGAAAAT
GCCGGCGGCG TCCACTGCCT GAAATATGGT CTGACCGTAC ATAACCTGCT GAAAATTGAA
GTGCAAACGC TGGACGGCGA GGCACTGACG CTGGGATCGG ACGCGCTGGA TTCACCTGGT
TTTGACCTGC TGGCGCTGTT CACCGGATCG GAAGGTATGC TCGGCGTGAC CACCGAAGTG
ACGGTAAAAC TGCTGCCGAA GCCGCCCGTG GCGCGGGTTC TGTTAGCCAG CTTTGACTCG
GTAGAAAAAG CCGGACTTGC GGTTGGTGAC ATCATCGCCA ATGGCATTAT CCCCGGCGGG
CTGGAGATGA TGGATAACCT GTCGATCCGC GCGGCGGAAG ATTTTATTCA TGCCGGTTAT
CCCGTCGACG CCGAAGCGAT TTTGTTATGC GAGCTGGACG GCGTGGAGTC TGACGTACAG
GAAGACTGCG AGCGGGTTAA CGACATCTTG TTGAAAGCGG GCGCGACTGA CGTCCGTCTG
GCACAGGACG AAGCAGAGCG CGTACGTTTC TGGGCCGGTC GCAAAAATGC GTTCCCGGCG
GTAGGACGTA TCTCCCCGGA TTACTACTGC ATGGATGGCA CCATCCCGCG TCGCGCCCTG
CCTGGCGTAC TGGAAGGCAT TGCCCGTTTA TCGCAGCAAT ATGATTTACG TGTTGCCAAC
GTCTTTCATG CCGGAGATGG CAACATGCAC CCGTTAATCC TTTTCGATGC CAACGAACCC
GGTGAATTTG CCCGCGCGGA AGAGCTGGGC GGGAAGATCC TCGAACTCTG CGTTGAAGTT
GGCGGCAGCA TCAGTGGCGA ACATGGCATC GGGCGAGAAA AAATCAATCA AATGTGCGCC
CAGTTCAACA GCGATGAAAT CACGACCTTC CATGCGGTCA AGGCGGCGTT TGACCCCGAT
GGTTTGCTGA ACCCTGGGAA AAACATTCCC ACGCTACACC GCTGTGCTGA ATTTGGTGCC
ATGCATGTGC ATCACGGTCA TTTACCTTTC CCTGAACTGG AGCGTTTCTG A
 
Protein sequence
MYEERLDGAL PDVDRTSVLM ALREHVPGLE ILHTDEEIIP YECDGLSAYR TRPLLVVLPK 
QMEQVTAILA VCHRLRVPVV TRGAGTGLSG GALPLEKGVL LVMARFKEIL DINPVGRRAR
VQPGVRNLAI SQAVAPHNLY YAPDPSSQIA CSIGGNVAEN AGGVHCLKYG LTVHNLLKIE
VQTLDGEALT LGSDALDSPG FDLLALFTGS EGMLGVTTEV TVKLLPKPPV ARVLLASFDS
VEKAGLAVGD IIANGIIPGG LEMMDNLSIR AAEDFIHAGY PVDAEAILLC ELDGVESDVQ
EDCERVNDIL LKAGATDVRL AQDEAERVRF WAGRKNAFPA VGRISPDYYC MDGTIPRRAL
PGVLEGIARL SQQYDLRVAN VFHAGDGNMH PLILFDANEP GEFARAEELG GKILELCVEV
GGSISGEHGI GREKINQMCA QFNSDEITTF HAVKAAFDPD GLLNPGKNIP TLHRCAEFGA
MHVHHGHLPF PELERF