Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3207 |
Symbol | |
ID | 7311790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 3742975 |
End bp | 3744762 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643610109 |
Product | sulfatase |
Protein accession | YP_002507477 |
Protein GI | 220930568 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.803481 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAATA ATCTTTTTAG TACATTTTCC AATGCTTTCA AAAATAAACC ATACCGAAAA GCAGCTATTG TTTCGGCTAT TTTTATAATT TTAAATGCAT TTAAGACAAG TCTTTTCAAC TACCTTTTGC TGCCCAAAAC CGGTGTACAT TTATTTACTT ACAAATTCTG GATATCCTTA CTTGTTTGTA TTATAGTCTT TTCATTTGTT CTCAGCTTGA AGTCCAGATA CGTTTTTTTA ATCGTTTATA TAATACAGGG TATATATTGT TTTACTAATA TATCATACTT CCTTTATTAT CATAGTTACC TACATTTTCT TCAGTGGATT TCCTTGTTTA AGGAAGCCAT GATTTCAGCT TCACACTTTG CTAATCCAAT AAGCGTTCAG CTGTTGGTAG TTTTTATTGA TGTTCCTGTA GCTTTATTTA TTTTCTTTAA ATGTTTTAAA CGGGAAGTTA AAAGTACAAG GCTTCCTTTA CTGCGTAACG TTCTTATTGC ATTATCTGTG GCGATTCTGA TAATTATTGA GATATTTAAT TTTTCAAACA GACAGTCTGT TGTACAGTTC ATGAACGACA GGTATTCTGG AGAAACCCGT ATAGTCGAAA GGTACGGTAC TGTTGCAAAC GGAATTGTTA GTATAGTTCA AAACAACACT GAAGAAAAAC TGATTAAACA AATCAATTAC GGTAAAAATA TTTCTTCCCC CAGTAATGTA ACTGCTTCCA AAAGTACTGT TGAGCAGCCT AATTACGTTG TTATACAAGT TGAGTCAATG GACTCAAATA TTGTTAAACA AAAGTATAAA GGCTCATATA TCATGCCTTA TATGAGTTCT TTGATGAATA ACAGTGTTTA TTATCCGTAC ACACTTAGCT ATCATAAGGG TGGAGGTACA TCGGATGCCG AATTTTCGGT TATTAACAGT GCTGAAACTC TTGATTCCTT TCCTGCTATA AAGCTGTCGT CGTATAATTA TCCCAACTCC GTTGTCTCAA AGCTTGCAAA AGCTTCATAC AACACAATGG CTTTTCATGG CAACGTGGGA ACATTTTATA ACAGAAATAT AGCATTTTCA AAAATGGGCT TTAAAAAGTT TTATGATATT AACTCAATGA ACTTCGATGA CGAGGGCTGG GGTGCACCGG ATGATAAGGT TTTCTCATTT GCTTTTGGAA AGATTGATAA AAGTACAAGA CCCTTTTATG CTCATATTAT TACAATGACG AGTCACGGCC CTTTCGAAAG TGCTAGAAAT TACTATAATA ACAAGGCTTA TGATGATATT GAAAATGAAA TAGTGAAGAA CTTTTATAAT TCCTTCAGTT ATGTTGATGA ATCAATAAAA GATTTTGTGG AAAAGATTCA GACAAAATAC AGCAACACAT ATATAATTAT TTACGGTGAC CACACTCCAA ATATCAGCTC AAAGGATTTT GCCCAAGCTT CATTTATAGA CGATGGCAAG TACTTTGAAT TTGTTCCCAT GTTTGTAATT ACACCTGACC ATAAGAAGTA TGTGGAAGAT TCTGTTGTGG CCTCCTTCCT TGATGTTTCC CCCACCATAA TGGCTACATC AAAGCTGGCA TATAACATTA AAACCGATGG AAGAAGCCTG TTGGACACAC AAACTACCCC AGCAGATATC CCTTTTAAAG GTGGTTCCTT TGACCGCATC CAGTTGTATA ATAAGATATC AACCCATAAG TATGTACAGG AAGAGCCTTT ATGGCGTAAA TATTTGCCAT CCTTTATTTC TTCCAGTCTT ATAGAGCGGC ATAAATAA
|
Protein sequence | MFNNLFSTFS NAFKNKPYRK AAIVSAIFII LNAFKTSLFN YLLLPKTGVH LFTYKFWISL LVCIIVFSFV LSLKSRYVFL IVYIIQGIYC FTNISYFLYY HSYLHFLQWI SLFKEAMISA SHFANPISVQ LLVVFIDVPV ALFIFFKCFK REVKSTRLPL LRNVLIALSV AILIIIEIFN FSNRQSVVQF MNDRYSGETR IVERYGTVAN GIVSIVQNNT EEKLIKQINY GKNISSPSNV TASKSTVEQP NYVVIQVESM DSNIVKQKYK GSYIMPYMSS LMNNSVYYPY TLSYHKGGGT SDAEFSVINS AETLDSFPAI KLSSYNYPNS VVSKLAKASY NTMAFHGNVG TFYNRNIAFS KMGFKKFYDI NSMNFDDEGW GAPDDKVFSF AFGKIDKSTR PFYAHIITMT SHGPFESARN YYNNKAYDDI ENEIVKNFYN SFSYVDESIK DFVEKIQTKY SNTYIIIYGD HTPNISSKDF AQASFIDDGK YFEFVPMFVI TPDHKKYVED SVVASFLDVS PTIMATSKLA YNIKTDGRSL LDTQTTPADI PFKGGSFDRI QLYNKISTHK YVQEEPLWRK YLPSFISSSL IERHK
|
| |