Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0704 |
Symbol | |
ID | 7309562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 814396 |
End bp | 816282 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643607643 |
Product | sulfatase |
Protein accession | YP_002505063 |
Protein GI | 220928154 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000394083 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAATT TATTTACGGT CAAACCCCAA AAGAGGATTG AATGGGGGTA TGTATGTACT GCCATTCTGA TAGCTGTTTT TACTGTATTT TTGACAATAA CGGGTTTCCT CCTTCAACCA AATGACTTTC ATACCATAGC ATCTAATGTA CTTAGGCATC CTACTTTATT TATACTTAAC AGTTTTCCTG TTTTTGCAGG CATATTATTT TTCTATCTGG TGTTCAACAA TGTATTTTTC TCAGCTGCAC TGGTTTCAAC AGTGTTCCAC GCAGCATCTC TTGTAAACCG CTTTAAGATT ATTTTTAGAG ATGACCCTTT TGTACCACAG GATGTGTTTC TTGGTGCAGA GGCGGCAAAT ATAATGTCTG ATACAAAAAT GAAGTTGGAT TATACTCTAA TCATAGCAAT ACTGCTTTTT TCTTTGGTCA TGTTAATTTT AGGAGTATTT ATAAAGTTTA AAAAAGTGAA ATTGCCCCTT AAGGTAGCAG GATGTCTCGC TATAGTGGGG ATAGCAATTT TATCAAATAC TTTTGTGTAC AGCTCCAGGG AAATATACGA TAGAATGCCA TCAAAGTCAA AAGCGTATGT AGCTGGTGTC TTTAATGATG TTGGTTTTAA TTATTGCTTT TTGTATAATT TGAGTGCATA TAAGATGGAA GTGCCTGAAA ACTACAGTAA GTCAGCGGCT GGAAAACTTG TAAAAAAATA TACAAAAAGT AAAGCTGAGC CAAAGGTTAG GGCCAATGTA ATAATAGTTA TGAACGAAGC TTTTTCAGAT ATAAGTAGGG AAAAAGTATT TAATTTCAGC CCTGAGGATG ATCCCCTTAG ATATTTTAAG AACATAGGAC AGGAAAATAA TTCGATAACT GGGCAGGTGG TAGTACCTAA TTTTGGGGGC GGAACAGCAA ATACGGAATT TGATGTAATG ACAGGAATGC AGACATTAAA CCTGAACACT GTACCCACTT CTGCTTTCAG ACTTGTTCAC CGGAATATCA AATCTATAGC CGGAGTACTG GGAAGCGATT CATATAAAAA ATATTTTATG CATCCCGGCG ACAGTTGGTT TTATAACAGG GCAAACGTAT ACAGGTTCCT CGGAGTGGAA GAGCAGATAT TTATAAATCA ATTCAAAAAG CCGGAAGACT TAAAGGGGAC GTTGATATCA GATAAGGCAG CAGGTGAAAA GATTATAAGC CTGTATGAAA AAAATATGGC ATCCGGCAAA AATCCTGTAT TCAATTACAA TGTAACTATT CAGAACCATA TGCCTTATAC TGCCAACAAG TACTACGGCT TAAAGATCAA AGAAGTACCT ACAGACAAAA AGCTGTCCTT TCAGGCTAAG ACTCTTTTAG CAAATTATTT TGAGGGAGTT AGGGATGGGG ACAAGCTGCT GAAAACCCTC ACAGATTACT TCAGGGAGAG ACAGGAACCT GTCATTCTTG TGTTCTTTGG AGATCACAAG CCGTCCTTAG GTGACAGTTA TCTTGCATAC AGGGAAGCGG GTATAGATAT AAATGAAAGC GGAACAATTG AGCAGGCCAT GAAGAGTCGT GAAGTACCTT ATATCATATG GGCAAATAAA AGTGCGGGCA ATATTCTGGA TTTCCCTAAC GTTGTCAGAA ACCTTGATTT ACCCAAGGAT AATACAATAA GTGCAAACTA TCTTGGGCCA ATTATTCTGC AATTGATGGG TTACGAGGGG TATGATCCTT ATTTTGATTT TCTTAATGAA TTGAGAAAGG AACTTCCGGT TATAACAAGA TACAATTTTA AAACAAACAG CGGTTATACT GATAAGCTTT CCGAAAAACA GCAGGCTATG GTTAATGACT TCAGAATATG GCAGTATTAC AGAATGACCA GTGATAAGGC AGACTGA
|
Protein sequence | MVNLFTVKPQ KRIEWGYVCT AILIAVFTVF LTITGFLLQP NDFHTIASNV LRHPTLFILN SFPVFAGILF FYLVFNNVFF SAALVSTVFH AASLVNRFKI IFRDDPFVPQ DVFLGAEAAN IMSDTKMKLD YTLIIAILLF SLVMLILGVF IKFKKVKLPL KVAGCLAIVG IAILSNTFVY SSREIYDRMP SKSKAYVAGV FNDVGFNYCF LYNLSAYKME VPENYSKSAA GKLVKKYTKS KAEPKVRANV IIVMNEAFSD ISREKVFNFS PEDDPLRYFK NIGQENNSIT GQVVVPNFGG GTANTEFDVM TGMQTLNLNT VPTSAFRLVH RNIKSIAGVL GSDSYKKYFM HPGDSWFYNR ANVYRFLGVE EQIFINQFKK PEDLKGTLIS DKAAGEKIIS LYEKNMASGK NPVFNYNVTI QNHMPYTANK YYGLKIKEVP TDKKLSFQAK TLLANYFEGV RDGDKLLKTL TDYFRERQEP VILVFFGDHK PSLGDSYLAY REAGIDINES GTIEQAMKSR EVPYIIWANK SAGNILDFPN VVRNLDLPKD NTISANYLGP IILQLMGYEG YDPYFDFLNE LRKELPVITR YNFKTNSGYT DKLSEKQQAM VNDFRIWQYY RMTSDKAD
|
| |