Gene Cphy_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3571 
Symbol 
ID5742975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4409134 
End bp4411029 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content37% 
IMG OID641294682 
Productglycoside hydrolase family protein 
Protein accessionYP_001560659 
Protein GI160881691 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.475227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCCGT TTCATGTATT GGCAGATCAG GCATGGCAAT ATGAACTTAT TGAGGAATTG 
AAAAAATCCT TACCATTTAC CCTTTCAGAA GATGGCTTAC CTGTTGAGGT AAAGCATCAA
GGTATTGGAA TTCAAGTAAC AAAGAAGGAT AATGTAAGAC AAATTACAGT GGGAGATAAA
AACCAGTTAG CAAGAGCATG GTACCTCATG CTTCGGAATG AATCTCTTAA TGAATACGAT
ATCAAGGAAC AATGTAGCTT TGGAGATTTG GGTGTGATGC TAGATTGTTC TAGAAATGCA
GTGATGAAAC TGCCTACCCT ATTAGAATAC ATAAGGCAGT TAGCAATATT TGGATATCAT
AGTTTACAAC TATATACTGA GGATACGATT AGAATTGAGG AAGAACCGTA CTTTGGTTAT
ATGCGTGGAG CTTATACAAA AGAAGAAATA AAAAAGGTTG ATGCTTACTG CGCAAAATTC
GGCATAGAAT TAGTGCCATG CGTACAAACA TTAGCTCACA TTAATCAGAT TACAAGATAT
GAGCGTTACC AAGACATCAT CGATGTGAAT GATATTTTCT TAGTAGGAAA TGAAAAAACA
TATCAATTAA TTGAACGTAT CATATCGACT GCATCCGAGT GTTTTACTTC AAGAAGAATC
AACATCGGTA TGGATGAAGC TCATATGCTT GGTTTAGGAA AATACCTAGA TCAGAACGGT
TATCAGAACC GTTTCACCAT TATGGTTGAG CATTTAAAGA GAGTACAAGA AATTTTGCAT
CGTTATGGAT TTACTGCAAT GATGTGGAGT GATATGTTTT TTAGATTGCT TGCAAACGGT
GAATATTATT CCCTTAAGGA AGAGCAGTTA AAAAGTGATT TATTACAACA TGTACCGAAG
GATATCGAGT TAGTTTACTG GGATTATTAT TCAAGAGACT ATCATCATTA CGAACAGAAC
TTAGTGACAC ATTTTAGAAT TTCAGACCGC ATAGGTTTCG CTGCCGGTGC ATGGAAATGG
ACTGGATTTG CACCAGAGAA TAGCTTTAGC CAGGTGGCTG GTAAAGAAGC AATGAAGGCT
TGTGTAGAAA AGGGAGTAAA TACTTTCTTA GTAACCTGTT GGGGAGATAA CGGCGCAGAA
GCAAGTGCCC TCTCGGTGTT ACCAACCCTG TTTTATTATG CGGAATTAGC TTATCAGTAT
GAAAGCCTTG TAGATAAAGA GTTTACCAAA GAGAAAGATT ATTCCGAGTA TTTTAAAGTT
GCAACAGGAA TTTCTTTTGC AGAATTTATG CTACTGGATA GTCCAAATGA AGTATTTGCA
GAAACTACTT ATACACACAG TAATGCTTGT AAGTTCTTAT TATACAACGA TGTGTTAATC
GGAACATTTG ATTCCATTGT GAAGAGTGAA ACAAGGATAG CTTACGAAGA CAAGAAGAAT
CAATTAGCTG CAGTAGCGAA GGCGGGTACA AGATATTCCT ATCTATTTCA GACCCTATCA
AGTCTATGTT CCCTATTAGA GGAGAAAGCA GACTTGGGAG TAGAGATAAA AAATGCATAT
GATAAAAAGG ATTTTGATAG GTTACGTGAA ATTGCTGAAT CAAAAATTCC CGAAGTATTG
AAGAGACTCG ACCAGTTTAT ACGAGATTTT CGGTATCAAT GGCATAAGGA GAACAAATCC
TTTGGCTTTG AAATACAATT GATACGTCTT GGCGGATTAA AGGAAAGACT TTACTACGCA
AAGGAACAAA TTTTACTATG GACAGAAGGA GATATTGAAC GGATCGACGA ATTAGAAGAA
GAGAGACTTC CATTTGCTTA TTTTGAACAA GAGGATGGAA GTAGACTAAA TTATAATTTG
TGGAACGTTA TTGTATCACC GGCTGTTATG GGATGA
 
Protein sequence
MIPFHVLADQ AWQYELIEEL KKSLPFTLSE DGLPVEVKHQ GIGIQVTKKD NVRQITVGDK 
NQLARAWYLM LRNESLNEYD IKEQCSFGDL GVMLDCSRNA VMKLPTLLEY IRQLAIFGYH
SLQLYTEDTI RIEEEPYFGY MRGAYTKEEI KKVDAYCAKF GIELVPCVQT LAHINQITRY
ERYQDIIDVN DIFLVGNEKT YQLIERIIST ASECFTSRRI NIGMDEAHML GLGKYLDQNG
YQNRFTIMVE HLKRVQEILH RYGFTAMMWS DMFFRLLANG EYYSLKEEQL KSDLLQHVPK
DIELVYWDYY SRDYHHYEQN LVTHFRISDR IGFAAGAWKW TGFAPENSFS QVAGKEAMKA
CVEKGVNTFL VTCWGDNGAE ASALSVLPTL FYYAELAYQY ESLVDKEFTK EKDYSEYFKV
ATGISFAEFM LLDSPNEVFA ETTYTHSNAC KFLLYNDVLI GTFDSIVKSE TRIAYEDKKN
QLAAVAKAGT RYSYLFQTLS SLCSLLEEKA DLGVEIKNAY DKKDFDRLRE IAESKIPEVL
KRLDQFIRDF RYQWHKENKS FGFEIQLIRL GGLKERLYYA KEQILLWTEG DIERIDELEE
ERLPFAYFEQ EDGSRLNYNL WNVIVSPAVM G