Gene Ccel_1807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1807 
Symbol 
ID7310538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2160862 
End bp2162874 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content35% 
IMG OID643608739 
Productsulfatase 
Protein accessionYP_002506137 
Protein GI220929228 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0290811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACATGG ATAAAAAAAT AGGCCTTTTT CAAAGGTATG GTTCTATACT GTTTTTTCCT 
ATAACTATAA TATATCTTGA ATCGATATTT AAAATAGTGG TATTTAAAGA ACTATTCAAC
ATCGGTATAG TATATATGAT ATTATTTTCG ATTCCGGCTG GGATATTATT ATATTTAGTA
AGCAATTTGT TCAGCAGTAG GGTAAACAGA ACTATATCCA TAGTTCTTAC TGTTTTTCTG
ACATTTATAT TCATAGTACA AATTGTTTAT TTCCATATAT TTAAAACATT TCTTGCTATA
TATTCAATAA ATGGAACCGG CCAGGTTCTT CAGTTTTGGC AGGAAGTGCT TTCAGCAGTC
AAAAGTAAAG CAGCAGTTAT CTTATTACTG TTGGTACCTT TATTACTGAT TATATCCGGT
AAAAGAGTTT TATTTGTCAA AAAGGTCTCT ATTAAAACCA AAGCATGGCT TGCATTTACA
ATGGTGTGTA TACAAATAAC AGCCACAATT TTAGTATTTG CTTCAGGCAC AGGTGAGCTT
AGTACCGGCT TCATTTATTC AAAGGCTGTA ATACCTGACT TATCTATGAA CAGATTCGGT
ATGCTTACAA CTTCACGGTT GGATGTCAAG CATCTTGTTT TTAGGGTTAA CAGCCCTCGT
ACTGAGGAGA AAGAAGAAAT AACCGCTATT GCTGACAATA CGGAAATTTT AAACAAACCA
CAGCCGGAAA AGACAGTAGA AACGCAAGAT TTGCAAAAGC CCGAGATAAA TAATGATGAC
AATATAATGA ACATCGATTT TGACAAACTT ATAGCAAGTG AAAGTGACCC GAATATTGTT
TCAATGCATA GGTATTTTAA ATCTGTCAAA CCAACGAAAA AGAACAACTA TACAGGAATG
TTCAAGGATA AAAACCTTAT AATGATAACA GCTGAGGGCT TTTCACCGTA TGCAGTAAAC
AAGGATTTAA CACCTACATT GTATAAAATG TATCAGGAAG GCTTCAGATT TACCAACTTT
TATACGCCTA TGTGGGGTGT GAGTACATCT GATGGTGAAT ACGTTGCGTG TAATTCGTTA
ATACCAAAAT CTGGAATTTG GAGCTTTTAT ATTTCGGGAA AAAACTATAT GCCGTTCTGT
ATGGGAAACC AGCTTAAAAA GCTTGGATAT GGTACACGTG CATACCATGA CCACTCTTAT
ACGTATTATC ACAGGGATGT ATCCCACCCG AACATGGGGT ACGATTTTAA GGCAGTTGGT
AACGGTCTTA ATATAAAGAA ATCTTGGCCG GAATCAGACC TTGAAATGAT TCAAAAAACT
GCTGATGAAT ATATGGGAAA AACACCGTTT CATACATACT ATATGACTGT GAGCGGACAC
TTGATGTATA CTTTTAACGG AAATGCGATG TCGGCAAAAA ACAGAGAGCT AGTAAAAAAC
TTACCATATT CATCTGGAGT AAAAGCTTAC CTTGCATGCA ATATAGAATT TGACAGAGCT
ATGGGAGAGT TAATCGCCCT TCTCGAACAA TCAGGTATTG CAGATGATAC ACTGATTGCA
ATAAGCCCTG ATCACTACCC TTACGGACTT ACAAATAAAG AAATAAGCGA ACTTGCAGGA
CACCAGATAG AGACAAACTT TGAACTTTAC AAGGGAATAT TTATACTATG GTCAAAAGGA
ATTAAATCCG AAGAGATAAG TAAACCATGT GCAAGCATGG ATATACTGCC TACAATATCA
AACCTGATGG GTGTTGAATA TGACTCCAGA TTACTAATGG GGAGAGATAT TTTTTCTGAC
GCTCCGCCGC TGGTCGTATT CTCAAACTGG AGCTGGCTAA CTGACAAAGC ACGATACAAT
TCAAAGAACG GTAAATTTCT TCTTGCAGAA GGAGAAACAA ATGAATCTGT CAATAAACAA
TACAGGACTG AGATTTCCAA ACGTGTAAAT GACATGTTTA CCTATTCGGA AAAGATATTG
GAGAATAATT ATTATAAAAA AGTTATCAGA TAG
 
Protein sequence
MYMDKKIGLF QRYGSILFFP ITIIYLESIF KIVVFKELFN IGIVYMILFS IPAGILLYLV 
SNLFSSRVNR TISIVLTVFL TFIFIVQIVY FHIFKTFLAI YSINGTGQVL QFWQEVLSAV
KSKAAVILLL LVPLLLIISG KRVLFVKKVS IKTKAWLAFT MVCIQITATI LVFASGTGEL
STGFIYSKAV IPDLSMNRFG MLTTSRLDVK HLVFRVNSPR TEEKEEITAI ADNTEILNKP
QPEKTVETQD LQKPEINNDD NIMNIDFDKL IASESDPNIV SMHRYFKSVK PTKKNNYTGM
FKDKNLIMIT AEGFSPYAVN KDLTPTLYKM YQEGFRFTNF YTPMWGVSTS DGEYVACNSL
IPKSGIWSFY ISGKNYMPFC MGNQLKKLGY GTRAYHDHSY TYYHRDVSHP NMGYDFKAVG
NGLNIKKSWP ESDLEMIQKT ADEYMGKTPF HTYYMTVSGH LMYTFNGNAM SAKNRELVKN
LPYSSGVKAY LACNIEFDRA MGELIALLEQ SGIADDTLIA ISPDHYPYGL TNKEISELAG
HQIETNFELY KGIFILWSKG IKSEEISKPC ASMDILPTIS NLMGVEYDSR LLMGRDIFSD
APPLVVFSNW SWLTDKARYN SKNGKFLLAE GETNESVNKQ YRTEISKRVN DMFTYSEKIL
ENNYYKKVIR