Gene Cphy_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0203 
Symbol 
ID5745071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp252199 
End bp253326 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content36% 
IMG OID641291293 
Productglycosy hydrolase family protein 
Protein accessionYP_001557329 
Protein GI160878361 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000431819 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATGA TTGTAAAGTA CATTAATGAG TTACTTGATA AGAGTACACC AGAAGTACCG 
ATGTGGAACA TAGAAAAAAT TAAGAGCGGC GAAAAATCAG AATGGAACTA CATTGACGGT
TGTATGATTA AGGCTGTTCT TGAGATGTAC GCAATAACAA AAGAAGAGAA GTACCTAAAA
TTTGCAGATG ATTTTATTGA TTATCGTGTG GATGAGGAAG GTAATATTTC CGGGTATGAA
GTGGAAAAGT TCAACATTGA CGATGTAAAT GCAGGTAAAA CATTATTTGA ACTTTATGAT
TTAACTGGGA AAGAAAAGTA CCGCAAAGCA ATTGATATCA TTTATAAGCA AGTAAAAACA
CAGCCAAGAA CTAGAGAAGG TAACTTTTGG CATAAACTAA TTTATCCTCA ACAGGTATGG
TTGGATGGTT TATATATGGG TCAGCCATTT TACATGGAAT ATGAGACTCG TTTTAATAAT
AAAAAGAACT ATGAGGATAT CTTTCATCAG TTCTTTAATG TATATGAGAT GTTAAGAGAT
GAAAAGACTG GTTTATATTA TCATGCATTT GACTCTTCAA GAGAAATGTT CTGGTGTGAC
AAAGAAACAG GATTATCCAA GCATTTTTGG TTAAGAGCTC TTGGCTGGTA TGCGATGGCA
CTCTTAGATA CTTTAGATAA GTGCGAGCCA ACTGGTTATG AGAAAGAGTA TGAAAGATTA
AAGCAAATCT TTATTGAATA TATGGAAACA ATTTTAAAAT ATCAGGATGA AAGCGGTATG
TGGTATCAGA TTCCTGATAT GGGTGGACGT GAGCGCAACT ACCTTGAGAC AAGCGGAAGT
TCTATCATGG CATACGCATT ACTAAAGGGT GTACGTCTTG GTTTCTTACC AGAGAGCTAT
CGTGAGAATG CAAAGAAGGC AATGGACGGT ATCTGTGAGA AATACCTTCA TACAGAAGAA
GGCAAGATGA GCCTTGGAGG AATTTGTCTT GTGGCTGGTC TTGGCGGTAA GCAAATGAGA
GACGGTACTT ATGATTATTA CATGTCGGAG CCTATTGTAA AAGACGACGC TAAGGGTGTT
GGACCATTCC TATTAGCATA TACAGAATTA CTTCGTCTTC AGAAATAA
 
Protein sequence
MDMIVKYINE LLDKSTPEVP MWNIEKIKSG EKSEWNYIDG CMIKAVLEMY AITKEEKYLK 
FADDFIDYRV DEEGNISGYE VEKFNIDDVN AGKTLFELYD LTGKEKYRKA IDIIYKQVKT
QPRTREGNFW HKLIYPQQVW LDGLYMGQPF YMEYETRFNN KKNYEDIFHQ FFNVYEMLRD
EKTGLYYHAF DSSREMFWCD KETGLSKHFW LRALGWYAMA LLDTLDKCEP TGYEKEYERL
KQIFIEYMET ILKYQDESGM WYQIPDMGGR ERNYLETSGS SIMAYALLKG VRLGFLPESY
RENAKKAMDG ICEKYLHTEE GKMSLGGICL VAGLGGKQMR DGTYDYYMSE PIVKDDAKGV
GPFLLAYTEL LRLQK