Gene Ccel_3382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3382 
Symbol 
ID7311947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3920352 
End bp3922829 
Gene Length2478 bp 
Protein Length825 aa 
Translation table11 
GC content35% 
IMG OID643610286 
Productsulfatase 
Protein accessionYP_002507650 
Protein GI220930741 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.636616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA TAAGAAATAA TTTAATTATC TTTTTTTTAG TTGCATTAAT AAGTACAGTG 
GATATATTTA ATACCGGAAA CGGCTTTGTT CAATTGGCTT TAGTTCTTTT GGCTTCAATT
GTAATATTCA ATATGTCTGC GGTTAAATTT AAACATAAGA GGCTGATATT TACTATTCTC
TTTATTGCCA ATGTCGCTGT TTTGGCGTTG ACTTTCACTC AAAGAATATA CCTGTTAAAT
ATGCTTTGTC TCTCTTATGT GGTATGGAGT ATTATTACTC CTAAATTGCA CCGGTTCAAA
TTATTGTTCC CTATAACCTT TTTACTTTCG TTTATTTCCG TGTATGCCGC AGCCTTGCTG
GATTTTAGTT TACCGATTTT TATTATATCG TTTTATCCTG TATATGTATT CGGCTCCATG
TTAGATGGAA AAAATATAAT CAAATCTAAA AAGATATGGA GTTTTGCATT TTTAAATCTA
GCAATTTCTG CCTTAGGCTT GGCGATCTTT TTGTTTAAAT CTCTTGGCCG TTCAATTATT
GACAGTGCAC TTTTACTTAA TTTTGTAAAA CTTGGTTCGG TAACAGCTGT TTATTTACCG
TTTATTACTC TTTTTATTAT TTGGTTTGTA ATATCCGCAA ATACGATTTT CCGTATGCTT
ATTTCGCGAT TTATAAACGG TAGTTCAGCT TTTGATCTTA CTGATTATAT TAAACCCGTT
ATAGATCTTA TATCTTTCTT TGCAGTTACA GGTATTACAA CCTTTATTTG TGAATTTTCA
ATAAGGCGGG ATTTTAAGGA AACAGTTAAG GACGTTCTTG ACCCTAATCT TCTGTTTAAT
ATGCTGGTTC TATGCGGTAT CTATTTGTGC CTGATGGCAT TGATCGGAAA GGGTATTCCT
AAAATTATTA TTGGTATATT AACAATTTTC CTTACAGTTG CCAACTATAT TAAGTTCACC
TATTTTGATG AACCTTTTTA CCCATGGGAC TTATATCTGG TACGGAATTT AATCGGAATT
TCCAGAGAGT ATCTGAATAT ACCTATTATA ATTGGTGTCG TTGCAGCTAT TGCATTTCTG
GTTTTTCTTG TAATACGTTT CAGAAAGGCT ATAGGAAAAT ATCTTAAACC TAGGTTTACT
TTATTCCTGC TTCCTTTTGC AGCTGCACTG TTTCTATTGA ATTCAGTAAT TCTGACAAAT
ACTCCTCTGT CAGTACAGTT GGGAATACAG AAGTCATGGT ATATCGGCAA GGATGAAATA
ATGGCTAACG GTATGTTTGC CCAGAACTAT TTTTATCTTA CAGAGCTTGA TAAGTATCTT
AATCCTAAGC CTCAAGGATA CAATGAAAAT AAAATGGCTG AAATTAATTC AAAGTACGGT
AAAACAGGCG AAAGTGTGGC CGCCTCTGCT GTTGTTTCAA AAGAAAAGCC TAACGTTATT
GTGATTATGA GTGAGAGCTT TTGGGATATT ACCAAGCTGA ACGATGTAAA ATTCAGCAAA
GATATAACAG AGTATACCCG TAAATATCAT AAAGGTCAGA TTGCCGCACC TATTATCGGT
GGAGGAACAG CCAACACTGA ATTCGAAGCA CTTACAGGTA TGTCGATTTC TTCTTTAAGT
CCGGGTATTA TCGTTTACAA TGCATACCTT AGGACAGAAA CATCCAGCAT TGCATCTGTT
TTCAAGGACA ACGGCTACAG CACCACTGCT ATCCACCCTA ATTACGGATG GTTTTACAAT
AGGGACAAGG TATACAATTA CTTCGGATTC GATAATTTCT ACGATGTTGA TAGCTTTAGT
CTTAGCACCC AGTGTAAAGG TCCGTATATA TCCGACTATG CATTAGTTGA TAAAATTGTA
GATACTCTGA ATAACTCTAA TAAGCCGGCA TTTGTCTTCG GAGTTTCAAT GCAGAACCAT
GATCCGTACA TTGATAAATA CAGTTCTCAT GATGTAACTG TGGAATCAAC CAAATTGGAT
AGTGAGCAGA AAAATATTGT AGGTAACTTT GCTCAAGGTA TTTACGATGC TGATCAGTCT
TTTGGAAAGC TTATACAGGA ATTAGGCAAG ATTAATAAGC CTACGTTGGT ATATTTCTTT
GGGGACCATG CACCTAGACT TGGAAGTCTT AACGATTATT ACAAGGTGTA CGATCTGCTG
GGTACTGACG ACAGGTCTGC CCAGAACCAA GGCCTGGAAA AACTTAAATA TTACACGACT
CCGTTTGCTG CATGGTCAAA CTATAAAGAT ATTGATTCCT TCTCCGAAAT TGTTTCTCCA
TCGCATTTAG CATATAAGAT ACTTAAGGAC ACAGGTATTA AATATCCAAA CTATTTTAAT
ATACTTTCTG AGCTGGAGAA AAGCTTCCCT GTTCTCCATC AGCAGACCAT AAATACAGTT
GACAATAACA ATGATCTTAT AAAGGATTAC CGTTTAATCC AATATGATCT CTTATTTGGA
AAAAAATATT TGTATTAG
 
Protein sequence
MKNIRNNLII FFLVALISTV DIFNTGNGFV QLALVLLASI VIFNMSAVKF KHKRLIFTIL 
FIANVAVLAL TFTQRIYLLN MLCLSYVVWS IITPKLHRFK LLFPITFLLS FISVYAAALL
DFSLPIFIIS FYPVYVFGSM LDGKNIIKSK KIWSFAFLNL AISALGLAIF LFKSLGRSII
DSALLLNFVK LGSVTAVYLP FITLFIIWFV ISANTIFRML ISRFINGSSA FDLTDYIKPV
IDLISFFAVT GITTFICEFS IRRDFKETVK DVLDPNLLFN MLVLCGIYLC LMALIGKGIP
KIIIGILTIF LTVANYIKFT YFDEPFYPWD LYLVRNLIGI SREYLNIPII IGVVAAIAFL
VFLVIRFRKA IGKYLKPRFT LFLLPFAAAL FLLNSVILTN TPLSVQLGIQ KSWYIGKDEI
MANGMFAQNY FYLTELDKYL NPKPQGYNEN KMAEINSKYG KTGESVAASA VVSKEKPNVI
VIMSESFWDI TKLNDVKFSK DITEYTRKYH KGQIAAPIIG GGTANTEFEA LTGMSISSLS
PGIIVYNAYL RTETSSIASV FKDNGYSTTA IHPNYGWFYN RDKVYNYFGF DNFYDVDSFS
LSTQCKGPYI SDYALVDKIV DTLNNSNKPA FVFGVSMQNH DPYIDKYSSH DVTVESTKLD
SEQKNIVGNF AQGIYDADQS FGKLIQELGK INKPTLVYFF GDHAPRLGSL NDYYKVYDLL
GTDDRSAQNQ GLEKLKYYTT PFAAWSNYKD IDSFSEIVSP SHLAYKILKD TGIKYPNYFN
ILSELEKSFP VLHQQTINTV DNNNDLIKDY RLIQYDLLFG KKYLY