Gene Cphy_1937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1937 
Symbol 
ID5744616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2394710 
End bp2395993 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content42% 
IMG OID641293033 
Productglycoside hydrolase family protein 
Protein accessionYP_001559044 
Protein GI160880076 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.607025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTTC AATTATTACC TAATATGCAG CTTGGAGTAG CTAGTGCACC TGCCCAGATT 
GAAGGCGGAG ATGTTAATCA TAACTGGAAC AATTGGTACC ACCTTGGGCA TATTAAGGAT
GCTTCAAGTC CACAGCGGGC GAATCAGCAC TGGGAACACT GGCAGGAAGA CATTGAATTA
ATGCATAGCA TGGGAGTCAA GAGATATCGC CTCGGCATCG AATGGGCCCG AATAGAACCT
AGCGAGGGCA ATTGGAATAA AGAAGTGATA AAGCATTATC GTAAGTTGCT GACCTTTATG
AAAAGCCAAG GAATTGAGCC GTTGTTAACC CTGCACCATT TTACAAATCC AATGTGGTTT
GAAAAAAAGG AAGGATTTAC GAAAGAACAA AACATTCCTG CTTTTCTACG CTATGTCTCC
TATGCTGTCC ATTCGTTTGG TGATTTGGTT TCAGAATATA TTACCATCAA CGAACCAAAT
GTCTATGCAA CTTTGGGGTA TTACGGCGGA GGATTTCCTC CGGGAGATAA TTCCGTCCAA
TTGACTTCTA AGGTAATGTC CGTCATGGCA ACCTGCCATA TCAAATCCTA CCGTATGATT
CATAGAATCC GCAGTAAGAT GGGATATACC GACACTAAGG TTTCGTTTGC CCACCATGCA
CGCGTATTTG CACCAGAAAA TCCAAGGAAT CCTCTTCACA TCTCCTATAC AGTACTTTCC
AAATGGATGT TTCAGGGTGC CCTTGCAAAG GCTTGTTTGA CCGGACAATT TTTACCTCCA
CTAAGAAATA TTAATCATGT CCCTCGCGGA CAATATGCTG ATTTTCTTGG GTTAAATTAC
TACACCCGTT CCACGATCAG TAAGCTGGGT GACGGAGTAG CAAATGATGG TCCGAAGAAT
GATCTTGGAT GGGAAATATA TCCTCATGGA ATCGTTTCTT GTGCACAAGA ATTGTATTCT
ATTCTGAAAC GCCCAATCTA TATTACCGAA AATGGTACCT GTGACAATCA GGATACCTTT
CGTTCCCGCT ATATTTATGA GCATTTGAAA GCCTTGTGTG CAAGTAATCT TCCTATAACT
CGCTACTATC ATTGGTGCTT CTGTGACAAT TTTGAATGGC TAGAGGGGGA AAGTGCACGC
TTTGGAATTG TACATATTGA TTACGAAACA CAGAAACGAA CCATAAAGCA GAGCGGTCGC
TTTTACAATG AAATCATAGA ACAGGGAGGC GTAACGGAAC AACTCTATGA AAAATACGTC
CATGAGGAGG AATACCATTC ATGA
 
Protein sequence
MAFQLLPNMQ LGVASAPAQI EGGDVNHNWN NWYHLGHIKD ASSPQRANQH WEHWQEDIEL 
MHSMGVKRYR LGIEWARIEP SEGNWNKEVI KHYRKLLTFM KSQGIEPLLT LHHFTNPMWF
EKKEGFTKEQ NIPAFLRYVS YAVHSFGDLV SEYITINEPN VYATLGYYGG GFPPGDNSVQ
LTSKVMSVMA TCHIKSYRMI HRIRSKMGYT DTKVSFAHHA RVFAPENPRN PLHISYTVLS
KWMFQGALAK ACLTGQFLPP LRNINHVPRG QYADFLGLNY YTRSTISKLG DGVANDGPKN
DLGWEIYPHG IVSCAQELYS ILKRPIYITE NGTCDNQDTF RSRYIYEHLK ALCASNLPIT
RYYHWCFCDN FEWLEGESAR FGIVHIDYET QKRTIKQSGR FYNEIIEQGG VTEQLYEKYV
HEEEYHS