Gene Cphy_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1750 
Symbol 
ID5741424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2158070 
End bp2159029 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content41% 
IMG OID641292850 
Productglycosy hydrolase family protein 
Protein accessionYP_001558861 
Protein GI160879893 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAGAT CGGAAAAATT AGAGAAAGTA AAAAAGGCTA CACTTGCAAT GCAACGCTGG 
CCTTGGGAGC AGGGGGTAGT AGCACAAGCA TTCTTCGAGG CTGGCGATAT TGAAATGGCC
ACTCTTATGG CTAGAGAAGC AGTGACCAAT CAATGTAAAG ATGGAAGACT CGGAATGAAG
TATGAAAGAG GAGCAGCTAC CGATCCTGCA GCAAACGGAG AGGTAGTGTT GCGTGCAGCT
GAAATTACAG GAGAGGAAAT TTTCAAGACT GCAGTTCAAA AAATGCTTAA TTATCTGCTT
TACAGGGCTC CAAAATCGAA AGACGGTATA ATTTATCACA ACGAGAATGA GGGCAAAATA
TGGGTTGATT CTTTCTATAT GGCACCACCA TTTTTAGCGG TAGCAGGATA TCCCAAAGAA
GCTGTAAAAC AGATAGAAGG TTATAGAAAA TATTTATGGA ATAACGAAAA GATGCTTTAT
GCTCACCAGT GGGATGATAA CCTTGGTCTA TTTTCTCGTG AACTCCACTG GGGAGTAGGA
AACGGTTGGG CAGCAGCCGG TATCGTCAGA GTATTAATAG CCTTGCCTGA GAAAATGCAA
ACTGAGAAAA GTAATCTAAT AGAGTATTTA AAGGAAATAA TAGATGGTTG CCTGGCTCAC
CAATGTGAAA ATGGTTTGTT TCATGATATT GTAGACGATC CCTCAACATT TATTGATTCA
AATCTAGCTC AAATGCTCAG TTATTCAATC TACAGGAGTG TTAAAAATGG CTGGTTAAAT
ACAGAGTATA TTGAATATGC TGACAAAATG CGAGAAGCTG CATGTCTAAG GGTGGATCAG
AATGGCCTAA TTCAAGGCTC CTGCGGAGTA CCTGATTTTA ACGCCCCAGC AACGGCACCA
GAAATTCAGG CGTTTTATAT TCTAATGGAG GCTGCTTATG ATGACTATTA CATGTCTTGA
 
Protein sequence
MLRSEKLEKV KKATLAMQRW PWEQGVVAQA FFEAGDIEMA TLMAREAVTN QCKDGRLGMK 
YERGAATDPA ANGEVVLRAA EITGEEIFKT AVQKMLNYLL YRAPKSKDGI IYHNENEGKI
WVDSFYMAPP FLAVAGYPKE AVKQIEGYRK YLWNNEKMLY AHQWDDNLGL FSRELHWGVG
NGWAAAGIVR VLIALPEKMQ TEKSNLIEYL KEIIDGCLAH QCENGLFHDI VDDPSTFIDS
NLAQMLSYSI YRSVKNGWLN TEYIEYADKM REAACLRVDQ NGLIQGSCGV PDFNAPATAP
EIQAFYILME AAYDDYYMS