Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0203 |
Symbol | |
ID | 5745071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 252199 |
End bp | 253326 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641291293 |
Product | glycosy hydrolase family protein |
Protein accession | YP_001557329 |
Protein GI | 160878361 |
COG category | [R] General function prediction only |
COG ID | [COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000431819 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATGA TTGTAAAGTA CATTAATGAG TTACTTGATA AGAGTACACC AGAAGTACCG ATGTGGAACA TAGAAAAAAT TAAGAGCGGC GAAAAATCAG AATGGAACTA CATTGACGGT TGTATGATTA AGGCTGTTCT TGAGATGTAC GCAATAACAA AAGAAGAGAA GTACCTAAAA TTTGCAGATG ATTTTATTGA TTATCGTGTG GATGAGGAAG GTAATATTTC CGGGTATGAA GTGGAAAAGT TCAACATTGA CGATGTAAAT GCAGGTAAAA CATTATTTGA ACTTTATGAT TTAACTGGGA AAGAAAAGTA CCGCAAAGCA ATTGATATCA TTTATAAGCA AGTAAAAACA CAGCCAAGAA CTAGAGAAGG TAACTTTTGG CATAAACTAA TTTATCCTCA ACAGGTATGG TTGGATGGTT TATATATGGG TCAGCCATTT TACATGGAAT ATGAGACTCG TTTTAATAAT AAAAAGAACT ATGAGGATAT CTTTCATCAG TTCTTTAATG TATATGAGAT GTTAAGAGAT GAAAAGACTG GTTTATATTA TCATGCATTT GACTCTTCAA GAGAAATGTT CTGGTGTGAC AAAGAAACAG GATTATCCAA GCATTTTTGG TTAAGAGCTC TTGGCTGGTA TGCGATGGCA CTCTTAGATA CTTTAGATAA GTGCGAGCCA ACTGGTTATG AGAAAGAGTA TGAAAGATTA AAGCAAATCT TTATTGAATA TATGGAAACA ATTTTAAAAT ATCAGGATGA AAGCGGTATG TGGTATCAGA TTCCTGATAT GGGTGGACGT GAGCGCAACT ACCTTGAGAC AAGCGGAAGT TCTATCATGG CATACGCATT ACTAAAGGGT GTACGTCTTG GTTTCTTACC AGAGAGCTAT CGTGAGAATG CAAAGAAGGC AATGGACGGT ATCTGTGAGA AATACCTTCA TACAGAAGAA GGCAAGATGA GCCTTGGAGG AATTTGTCTT GTGGCTGGTC TTGGCGGTAA GCAAATGAGA GACGGTACTT ATGATTATTA CATGTCGGAG CCTATTGTAA AAGACGACGC TAAGGGTGTT GGACCATTCC TATTAGCATA TACAGAATTA CTTCGTCTTC AGAAATAA
|
Protein sequence | MDMIVKYINE LLDKSTPEVP MWNIEKIKSG EKSEWNYIDG CMIKAVLEMY AITKEEKYLK FADDFIDYRV DEEGNISGYE VEKFNIDDVN AGKTLFELYD LTGKEKYRKA IDIIYKQVKT QPRTREGNFW HKLIYPQQVW LDGLYMGQPF YMEYETRFNN KKNYEDIFHQ FFNVYEMLRD EKTGLYYHAF DSSREMFWCD KETGLSKHFW LRALGWYAMA LLDTLDKCEP TGYEKEYERL KQIFIEYMET ILKYQDESGM WYQIPDMGGR ERNYLETSGS SIMAYALLKG VRLGFLPESY RENAKKAMDG ICEKYLHTEE GKMSLGGICL VAGLGGKQMR DGTYDYYMSE PIVKDDAKGV GPFLLAYTEL LRLQK
|
| |