Gene Ccel_0704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0704 
Symbol 
ID7309562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp814396 
End bp816282 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content37% 
IMG OID643607643 
Productsulfatase 
Protein accessionYP_002505063 
Protein GI220928154 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000394083 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAATT TATTTACGGT CAAACCCCAA AAGAGGATTG AATGGGGGTA TGTATGTACT 
GCCATTCTGA TAGCTGTTTT TACTGTATTT TTGACAATAA CGGGTTTCCT CCTTCAACCA
AATGACTTTC ATACCATAGC ATCTAATGTA CTTAGGCATC CTACTTTATT TATACTTAAC
AGTTTTCCTG TTTTTGCAGG CATATTATTT TTCTATCTGG TGTTCAACAA TGTATTTTTC
TCAGCTGCAC TGGTTTCAAC AGTGTTCCAC GCAGCATCTC TTGTAAACCG CTTTAAGATT
ATTTTTAGAG ATGACCCTTT TGTACCACAG GATGTGTTTC TTGGTGCAGA GGCGGCAAAT
ATAATGTCTG ATACAAAAAT GAAGTTGGAT TATACTCTAA TCATAGCAAT ACTGCTTTTT
TCTTTGGTCA TGTTAATTTT AGGAGTATTT ATAAAGTTTA AAAAAGTGAA ATTGCCCCTT
AAGGTAGCAG GATGTCTCGC TATAGTGGGG ATAGCAATTT TATCAAATAC TTTTGTGTAC
AGCTCCAGGG AAATATACGA TAGAATGCCA TCAAAGTCAA AAGCGTATGT AGCTGGTGTC
TTTAATGATG TTGGTTTTAA TTATTGCTTT TTGTATAATT TGAGTGCATA TAAGATGGAA
GTGCCTGAAA ACTACAGTAA GTCAGCGGCT GGAAAACTTG TAAAAAAATA TACAAAAAGT
AAAGCTGAGC CAAAGGTTAG GGCCAATGTA ATAATAGTTA TGAACGAAGC TTTTTCAGAT
ATAAGTAGGG AAAAAGTATT TAATTTCAGC CCTGAGGATG ATCCCCTTAG ATATTTTAAG
AACATAGGAC AGGAAAATAA TTCGATAACT GGGCAGGTGG TAGTACCTAA TTTTGGGGGC
GGAACAGCAA ATACGGAATT TGATGTAATG ACAGGAATGC AGACATTAAA CCTGAACACT
GTACCCACTT CTGCTTTCAG ACTTGTTCAC CGGAATATCA AATCTATAGC CGGAGTACTG
GGAAGCGATT CATATAAAAA ATATTTTATG CATCCCGGCG ACAGTTGGTT TTATAACAGG
GCAAACGTAT ACAGGTTCCT CGGAGTGGAA GAGCAGATAT TTATAAATCA ATTCAAAAAG
CCGGAAGACT TAAAGGGGAC GTTGATATCA GATAAGGCAG CAGGTGAAAA GATTATAAGC
CTGTATGAAA AAAATATGGC ATCCGGCAAA AATCCTGTAT TCAATTACAA TGTAACTATT
CAGAACCATA TGCCTTATAC TGCCAACAAG TACTACGGCT TAAAGATCAA AGAAGTACCT
ACAGACAAAA AGCTGTCCTT TCAGGCTAAG ACTCTTTTAG CAAATTATTT TGAGGGAGTT
AGGGATGGGG ACAAGCTGCT GAAAACCCTC ACAGATTACT TCAGGGAGAG ACAGGAACCT
GTCATTCTTG TGTTCTTTGG AGATCACAAG CCGTCCTTAG GTGACAGTTA TCTTGCATAC
AGGGAAGCGG GTATAGATAT AAATGAAAGC GGAACAATTG AGCAGGCCAT GAAGAGTCGT
GAAGTACCTT ATATCATATG GGCAAATAAA AGTGCGGGCA ATATTCTGGA TTTCCCTAAC
GTTGTCAGAA ACCTTGATTT ACCCAAGGAT AATACAATAA GTGCAAACTA TCTTGGGCCA
ATTATTCTGC AATTGATGGG TTACGAGGGG TATGATCCTT ATTTTGATTT TCTTAATGAA
TTGAGAAAGG AACTTCCGGT TATAACAAGA TACAATTTTA AAACAAACAG CGGTTATACT
GATAAGCTTT CCGAAAAACA GCAGGCTATG GTTAATGACT TCAGAATATG GCAGTATTAC
AGAATGACCA GTGATAAGGC AGACTGA
 
Protein sequence
MVNLFTVKPQ KRIEWGYVCT AILIAVFTVF LTITGFLLQP NDFHTIASNV LRHPTLFILN 
SFPVFAGILF FYLVFNNVFF SAALVSTVFH AASLVNRFKI IFRDDPFVPQ DVFLGAEAAN
IMSDTKMKLD YTLIIAILLF SLVMLILGVF IKFKKVKLPL KVAGCLAIVG IAILSNTFVY
SSREIYDRMP SKSKAYVAGV FNDVGFNYCF LYNLSAYKME VPENYSKSAA GKLVKKYTKS
KAEPKVRANV IIVMNEAFSD ISREKVFNFS PEDDPLRYFK NIGQENNSIT GQVVVPNFGG
GTANTEFDVM TGMQTLNLNT VPTSAFRLVH RNIKSIAGVL GSDSYKKYFM HPGDSWFYNR
ANVYRFLGVE EQIFINQFKK PEDLKGTLIS DKAAGEKIIS LYEKNMASGK NPVFNYNVTI
QNHMPYTANK YYGLKIKEVP TDKKLSFQAK TLLANYFEGV RDGDKLLKTL TDYFRERQEP
VILVFFGDHK PSLGDSYLAY REAGIDINES GTIEQAMKSR EVPYIIWANK SAGNILDFPN
VVRNLDLPKD NTISANYLGP IILQLMGYEG YDPYFDFLNE LRKELPVITR YNFKTNSGYT
DKLSEKQQAM VNDFRIWQYY RMTSDKAD