Gene EcSMS35_3257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3257 
SymbolglcD 
ID6146214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3334633 
End bp3336123 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content57% 
IMG OID641618087 
Productglycolate oxidase subunit GlcD 
Protein accessionYP_001745237 
Protein GI170682557 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR00387] glycolate oxidase, subunit GlcD 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTACGAAG AGCGTCTTGA TGGCGCTTTA CCCGATGTCG ACCGCACATC GGTACTGATG 
GCACTGCGTG AGCATGTCCC TGGACTGGAG ATCCTGCATA CCGATGAGGA GATCATTCCT
TACGAGTGTG ACGGGTTGAG CGCGTATCGC ACGCGTCCAT TACTGGTTGT TCTGCCTAAG
CAGATGGAAC AGGTGACGGC GATTCTGGCT GTCTGCCACC GCCTGCGTGT ACCGGTGGTG
ACCCGTGGTG CAGGCACCGG GCTTTCTGGT GGCGCGCTGC CGCTGGAAAA AGGTGTGTTG
TTGGTGATGG CGCGCTTTAA AGAGATCCTC GACATTAACC CCGTTGGTCG CCGTGCGCGC
GTGCAGCCGG GCGTGCGTAA CCTGGCGATC TCCCAGGCCG TTGCGCCGCA TAATCTCTAT
TACGCGCCGG ATCCGTCCTC ACAAATCGCC TGTTCGATTG GCGGCAATGT GGCGGAAAAT
GCCGGTGGCG TTCACTGCCT GAAATATGGT CTAACCGTAC ATAACCTGCT GAAAATTGAA
GTGCAAACGC TGGACGGCGA GGCGCTGACG CTGGGATCGG ACGCGCTGGA TTCGCCTGGT
TTTGACCTGC TGGCGCTGTT TACCGGATCG GAAGGTATGC TCGGCGTGAC CACCGAAGTG
ACGGTAAAAC TGCTGCCGAA GCCGCCCGTG GCGCGGGTGC TGTTAGCCAG CTTTGACTCG
GTAGAAAAAG CCGGACTTGC GGTTGGTGAC ATCATCGCCA ATGGCATTAT CCCCGGCGGG
CTGGAGATGA TGGATAACCT GTCGATTCGC GCGGCGGAAG ATTTTATTCA TGCCGGTTAT
CCCGTCGATG CCGAAGCGAT TTTGTTATGC GAACTGGACG GCGTGGAGTC TGACGTACAG
GAAGACTGCG AGCAGGTTAA CGACATCTTG TTGAACGCGG GTGCGACTGA CGTCCGTCTG
GCACAGGACG AAGCAGAGCG AGTACGTTTC TGGGCCGGTC GCAAAAATGC GTTCCCGGCG
GTAGGACGTA TCTCCCCGGA TTACTACTGC ATGGATGGCA CCATCCCGCG CCGCGCCCTG
CCTGGCGTAC TGGAAGGCAT TGCCCGTTTA TCGCAGCAAT ATGATTTACG CGTTGCCAAC
GTCTTTCATG CCGGAGACGG CAACATGCAC CCGTTAATCC TTTTCGATGC CAACGAACCC
GGTGAATTTG CCCGCGCGGA AGAGCTGGGC GGGAAGATCC TCGAACTCTG CGTTGAAGTT
GGCGGCAGCA TCAGTGGGGA ACATGGCATT GGGCGCGAAA AAATCAATCA AATGTGCGCC
CAGTTCAACA GCGATGAAAT CACGACCTTC CATGCGGTCA AGGCGGCGTT TGACCCCGAT
GGTTTGCTGA ACCCAGGAAA AAACATTCCC ACGCTACACC GCTGTGCTGA ATTTGGTGCC
ATGCATGTGC ATCACGGTCA TTTACCTTTT CCTGAACTGG AGCGTTTCTG A
 
Protein sequence
MYEERLDGAL PDVDRTSVLM ALREHVPGLE ILHTDEEIIP YECDGLSAYR TRPLLVVLPK 
QMEQVTAILA VCHRLRVPVV TRGAGTGLSG GALPLEKGVL LVMARFKEIL DINPVGRRAR
VQPGVRNLAI SQAVAPHNLY YAPDPSSQIA CSIGGNVAEN AGGVHCLKYG LTVHNLLKIE
VQTLDGEALT LGSDALDSPG FDLLALFTGS EGMLGVTTEV TVKLLPKPPV ARVLLASFDS
VEKAGLAVGD IIANGIIPGG LEMMDNLSIR AAEDFIHAGY PVDAEAILLC ELDGVESDVQ
EDCEQVNDIL LNAGATDVRL AQDEAERVRF WAGRKNAFPA VGRISPDYYC MDGTIPRRAL
PGVLEGIARL SQQYDLRVAN VFHAGDGNMH PLILFDANEP GEFARAEELG GKILELCVEV
GGSISGEHGI GREKINQMCA QFNSDEITTF HAVKAAFDPD GLLNPGKNIP TLHRCAEFGA
MHVHHGHLPF PELERF