Gene Ccel_3207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3207 
Symbol 
ID7311790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3742975 
End bp3744762 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content34% 
IMG OID643610109 
Productsulfatase 
Protein accessionYP_002507477 
Protein GI220930568 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.803481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAATA ATCTTTTTAG TACATTTTCC AATGCTTTCA AAAATAAACC ATACCGAAAA 
GCAGCTATTG TTTCGGCTAT TTTTATAATT TTAAATGCAT TTAAGACAAG TCTTTTCAAC
TACCTTTTGC TGCCCAAAAC CGGTGTACAT TTATTTACTT ACAAATTCTG GATATCCTTA
CTTGTTTGTA TTATAGTCTT TTCATTTGTT CTCAGCTTGA AGTCCAGATA CGTTTTTTTA
ATCGTTTATA TAATACAGGG TATATATTGT TTTACTAATA TATCATACTT CCTTTATTAT
CATAGTTACC TACATTTTCT TCAGTGGATT TCCTTGTTTA AGGAAGCCAT GATTTCAGCT
TCACACTTTG CTAATCCAAT AAGCGTTCAG CTGTTGGTAG TTTTTATTGA TGTTCCTGTA
GCTTTATTTA TTTTCTTTAA ATGTTTTAAA CGGGAAGTTA AAAGTACAAG GCTTCCTTTA
CTGCGTAACG TTCTTATTGC ATTATCTGTG GCGATTCTGA TAATTATTGA GATATTTAAT
TTTTCAAACA GACAGTCTGT TGTACAGTTC ATGAACGACA GGTATTCTGG AGAAACCCGT
ATAGTCGAAA GGTACGGTAC TGTTGCAAAC GGAATTGTTA GTATAGTTCA AAACAACACT
GAAGAAAAAC TGATTAAACA AATCAATTAC GGTAAAAATA TTTCTTCCCC CAGTAATGTA
ACTGCTTCCA AAAGTACTGT TGAGCAGCCT AATTACGTTG TTATACAAGT TGAGTCAATG
GACTCAAATA TTGTTAAACA AAAGTATAAA GGCTCATATA TCATGCCTTA TATGAGTTCT
TTGATGAATA ACAGTGTTTA TTATCCGTAC ACACTTAGCT ATCATAAGGG TGGAGGTACA
TCGGATGCCG AATTTTCGGT TATTAACAGT GCTGAAACTC TTGATTCCTT TCCTGCTATA
AAGCTGTCGT CGTATAATTA TCCCAACTCC GTTGTCTCAA AGCTTGCAAA AGCTTCATAC
AACACAATGG CTTTTCATGG CAACGTGGGA ACATTTTATA ACAGAAATAT AGCATTTTCA
AAAATGGGCT TTAAAAAGTT TTATGATATT AACTCAATGA ACTTCGATGA CGAGGGCTGG
GGTGCACCGG ATGATAAGGT TTTCTCATTT GCTTTTGGAA AGATTGATAA AAGTACAAGA
CCCTTTTATG CTCATATTAT TACAATGACG AGTCACGGCC CTTTCGAAAG TGCTAGAAAT
TACTATAATA ACAAGGCTTA TGATGATATT GAAAATGAAA TAGTGAAGAA CTTTTATAAT
TCCTTCAGTT ATGTTGATGA ATCAATAAAA GATTTTGTGG AAAAGATTCA GACAAAATAC
AGCAACACAT ATATAATTAT TTACGGTGAC CACACTCCAA ATATCAGCTC AAAGGATTTT
GCCCAAGCTT CATTTATAGA CGATGGCAAG TACTTTGAAT TTGTTCCCAT GTTTGTAATT
ACACCTGACC ATAAGAAGTA TGTGGAAGAT TCTGTTGTGG CCTCCTTCCT TGATGTTTCC
CCCACCATAA TGGCTACATC AAAGCTGGCA TATAACATTA AAACCGATGG AAGAAGCCTG
TTGGACACAC AAACTACCCC AGCAGATATC CCTTTTAAAG GTGGTTCCTT TGACCGCATC
CAGTTGTATA ATAAGATATC AACCCATAAG TATGTACAGG AAGAGCCTTT ATGGCGTAAA
TATTTGCCAT CCTTTATTTC TTCCAGTCTT ATAGAGCGGC ATAAATAA
 
Protein sequence
MFNNLFSTFS NAFKNKPYRK AAIVSAIFII LNAFKTSLFN YLLLPKTGVH LFTYKFWISL 
LVCIIVFSFV LSLKSRYVFL IVYIIQGIYC FTNISYFLYY HSYLHFLQWI SLFKEAMISA
SHFANPISVQ LLVVFIDVPV ALFIFFKCFK REVKSTRLPL LRNVLIALSV AILIIIEIFN
FSNRQSVVQF MNDRYSGETR IVERYGTVAN GIVSIVQNNT EEKLIKQINY GKNISSPSNV
TASKSTVEQP NYVVIQVESM DSNIVKQKYK GSYIMPYMSS LMNNSVYYPY TLSYHKGGGT
SDAEFSVINS AETLDSFPAI KLSSYNYPNS VVSKLAKASY NTMAFHGNVG TFYNRNIAFS
KMGFKKFYDI NSMNFDDEGW GAPDDKVFSF AFGKIDKSTR PFYAHIITMT SHGPFESARN
YYNNKAYDDI ENEIVKNFYN SFSYVDESIK DFVEKIQTKY SNTYIIIYGD HTPNISSKDF
AQASFIDDGK YFEFVPMFVI TPDHKKYVED SVVASFLDVS PTIMATSKLA YNIKTDGRSL
LDTQTTPADI PFKGGSFDRI QLYNKISTHK YVQEEPLWRK YLPSFISSSL IERHK