Gene Cphy_3239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3239 
Symbol 
ID5742017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3943604 
End bp3945511 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content35% 
IMG OID641294339 
Productglycoside hydrolase family protein 
Protein accessionYP_001560332 
Protein GI160881364 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTATAA ATATGAACAT AATATTAAAG GGATTTACAG ATGAGATAAA AGAAGGACTA 
AATCTACTAT TACAGGTGCA TTCGTTTGTA AATCAGGACC AGAGAGAAAT CGAGATAACC
GTAGAAAAGA TATTAGAGGG GAATACTTCT CTTATTGTAT CGGGAAAGGT TGGTTGGCAC
ATTCAATATA AGGAAACGGT ACATTTTTTC CGCGCCCTTG GTATCCTTAT TGAAAACATT
GAGAATACAT CCTTTGAAAC GACGGAAACA GCACTTTTTG AAGGTTGCTC TAATATGATT
GACCTTTCAA GAAACGCTGT CTACACTGTG GATGAAATGA AAAGAATGCT ATGTTACCTT
GCACTTACTG GGCATAACAA ATGCTATTTA TATATGGAAG ATACATATGA GCTACCAGAC
TATCCTTATT TCGGTTATTT AAGAGGTAGA TATTCCATCG CAGAGATGAA GGAAATCGAT
GATTTTGCAT ATGCTTTAGG AATCGAGGCA ATCCCATGTA TTCAAACATT AGCACATTTG
AAGACAACGT TAAAATGGAA TTATGCTTTA TCAATGAAAG ATACTGCGGA TATCCTACTT
GTGGGAGAAG AAAAAACTTA TCAATTTATT GAGGCGATGT TTGTCTCCCT GAAGCAGACC
TTCCGTTCTA GAACCATCCA TATAGGTATG GATGAGGCTA TGGATCTTGG TAGTGGTAAA
TATCTAAGAG AAAATGGTTA CCGCGTTCAA TATGATTTAA TGACAGAACA TTTAGCTAAG
GTCAATGAGA TTGCGATAAA ACATGGTATC AAGCCGTTGA TTTGGGATGA TATGTTTTAC
CGTTCCTTAA ATAAAGATCA TGAGTATTAT GATACAACAA TTCCTCTTAC AGACGAGCAT
ATAAAAAAGG TCCCTAGCAA TATTGGATTG GTATATTGGG ATTATTATCA TAATAATAAA
GAGGATTATG AGACCTTACT TACGATGAGA GACCGTTTTC CAAATGATAT TATCTTTGCA
GGAGGAATCT GGCGTTGGAT GGGTTATGTT CCAGGGTATA CAAAAACATT TGCAACAACA
AATGCAGCAC TCGACCGATG CAAGCATCAT AAAGTAAAAG AAATTATGTC CACTTGTTGG
GGCGATGACG GAGCGGAAAC ACCAATCGAA ACCATTATAC CAGGTCTAAT TCTTTTTGGT
GAGCACGGTT ATGGACAGGA TACTTCTATG GATGCGATTA GTAGTAAGTG TAAATTCTTA
ACTGGTGTTT CTTTATTTGA CTTTATGAAA ATTGAAGAAA TTGATATTAT TCCAGGATGT
GAAGCACAAA ATATCAAGAC TAGAAATCCA TCCAAACATA TTTTATTTCA AGACTTGCTA
CTAGGTGCCT TTGATACCTA TTTTGATAGA GAAGGACTTG AGGAACATTA TGCGAAAGTA
AAAGAAGAGC TATATACGAT CTCAAAAACT GCAGGTAAGT TTGAACAACT TTTTGTTATG
TATGCAAAGC TTGCAACGGT TCTAGAGAAA AAGGTGAAGC TTGGAATTAA AATAAGAAAG
GCATATCAAT TAAAAGATAA AGATACATTA AAGACAATCT GTGAACAGAT ATTGCCGGTT
CTAAAAGAAG ATGTGGAAAA CTTTAAAAAG GAATATACAA AGGTGTGGTT CAATGAAAGT
AAGGGACATG GCTTTGAAGT TATTGATGTT AGACTTGGTG GTTTAATGAG CAGAATTGAT
ACGGTAAAAT ATCGATTGGA AGACTATATA ACTGGTGATA TTTTAAAAGT TGAAGAGTTA
GAGGAAACAA TTTTACCATA TGAACTTGGC GGATATCCAG AAGGACCGTA TTTAGCTTAT
AACAAATACA AAGATATTGT AACTCAGAAT CTGCTTTCTC ATCACTAA
 
Protein sequence
MVINMNIILK GFTDEIKEGL NLLLQVHSFV NQDQREIEIT VEKILEGNTS LIVSGKVGWH 
IQYKETVHFF RALGILIENI ENTSFETTET ALFEGCSNMI DLSRNAVYTV DEMKRMLCYL
ALTGHNKCYL YMEDTYELPD YPYFGYLRGR YSIAEMKEID DFAYALGIEA IPCIQTLAHL
KTTLKWNYAL SMKDTADILL VGEEKTYQFI EAMFVSLKQT FRSRTIHIGM DEAMDLGSGK
YLRENGYRVQ YDLMTEHLAK VNEIAIKHGI KPLIWDDMFY RSLNKDHEYY DTTIPLTDEH
IKKVPSNIGL VYWDYYHNNK EDYETLLTMR DRFPNDIIFA GGIWRWMGYV PGYTKTFATT
NAALDRCKHH KVKEIMSTCW GDDGAETPIE TIIPGLILFG EHGYGQDTSM DAISSKCKFL
TGVSLFDFMK IEEIDIIPGC EAQNIKTRNP SKHILFQDLL LGAFDTYFDR EGLEEHYAKV
KEELYTISKT AGKFEQLFVM YAKLATVLEK KVKLGIKIRK AYQLKDKDTL KTICEQILPV
LKEDVENFKK EYTKVWFNES KGHGFEVIDV RLGGLMSRID TVKYRLEDYI TGDILKVEEL
EETILPYELG GYPEGPYLAY NKYKDIVTQN LLSHH