Gene Cphy_3207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3207 
Symbol 
ID5741985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3910130 
End bp3911275 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content39% 
IMG OID641294307 
Productglycoside hydrolase family protein 
Protein accessionYP_001560300 
Protein GI160881332 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGTG CAATAAAAAG TGGGTATTAC CGGAATGTGT TTACCGAATT AGGCTATAAA 
GAAGAGGACG TAACTAAGAA AGTTGAGGAC AGCTTTCAGA CCTTGTTCTA TGGTCTACCA
GAAGAACGCA TATATTATCC TGTGGGTGAA GATTTAGGAT ATATCGTGGA TACCGGAAAT
CATGATGTTC GAACAGAAGG AATGTCTTAT GGAATGATGA TGTGTTTGCA ATTAGATAAA
AAAGAGGAAT TTGATCGCTT ATGGAAATGG GCTAAGACAT ATATGTTTAT GGATTCTGGT
GTTAACAAAG GCTATTTTGC TTGGTCATGT AAAACAGATG GTACGAAAAA TTCCTATGGA
CCAGCACCGG ATGGTGAGGA GTATTTTGCT TTAGCATTAT TCTTTGCTTC CAACCGCTGG
GGTGATGGAA ATGGAATTTT CGAATATAGT AAACAAGCGA GAGAACTTCT TCATGAATGT
ATCCATAAAG GAGAAGAGGA TGGAATCGGA GAACCTATGT GGGAGCCATC GAACTACCTG
ATAAAATTCA TACCTAATTG CAATTTTACA GATCCATCGT ATCATTTACC ACACTTTTAC
GGGTTATTTG CACTTTGGGC ATATGAAGAA GATAGAGAAT TCTTTAAAAA GGCAGCGGAA
GCGAGTCGTT CGTATCTAAA ACTTGCATGT CATGAAAAGA CTGGGCTTTG CGCAGAATAC
ACAGAGTACG ATGGAACTGC TCATAGTGGA GATCAAGAAA TATTTGGACG CCATGATTGG
TATTACAGCG ATGCTTATCG AACCATTGCA AACATTGGTC TTGATTATCT CTGGTTTGCA
GCTGATGAAT GGCAAGTAAC ATGCGCTAAT CATTTACAGC AATTCTTTTG TGAGACAGTA
AAAGAGCATG CAAGTGGTAT TTATCAAGTA GATGGAACGA TAATCAAGGG CGAGGCTTTA
CACCCTGTTG CAATAATCGC TACCAATGCG CAAGCATCTT TAGCAGCAAA TGGTCCCTTT
GCGAAAGAAT GTGTCGACAA GTTTTATCAT ACAGAATTAA GGACTGGCGA TAGAAGATAT
TACGACAATT GCTTGTATAT GTTTGCACTC TTAGCTTTAA GTGGAAAGTA TCGTATGTGG
ATGTAA
 
Protein sequence
MNGAIKSGYY RNVFTELGYK EEDVTKKVED SFQTLFYGLP EERIYYPVGE DLGYIVDTGN 
HDVRTEGMSY GMMMCLQLDK KEEFDRLWKW AKTYMFMDSG VNKGYFAWSC KTDGTKNSYG
PAPDGEEYFA LALFFASNRW GDGNGIFEYS KQARELLHEC IHKGEEDGIG EPMWEPSNYL
IKFIPNCNFT DPSYHLPHFY GLFALWAYEE DREFFKKAAE ASRSYLKLAC HEKTGLCAEY
TEYDGTAHSG DQEIFGRHDW YYSDAYRTIA NIGLDYLWFA ADEWQVTCAN HLQQFFCETV
KEHASGIYQV DGTIIKGEAL HPVAIIATNA QASLAANGPF AKECVDKFYH TELRTGDRRY
YDNCLYMFAL LALSGKYRMW M