Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0575 |
Symbol | |
ID | 4206111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 683456 |
End bp | 685282 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 26% |
IMG OID | 642565135 |
Product | sulfatase family protein |
Protein accession | YP_697902 |
Protein GI | 110803822 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.122436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA ATGAATATAG ATTTTTATCA ATACTTAATA ACAAATTTTT ATTATTTTTC CTCTTTTCTT TATTGATAAT AATAAAAGAA GTATTTTTTA CTTGGATATG GTCAAGTAGT GATAGTATTG CTAAATTGCA AATTTTTAAT ATGTATATGT ACTGGCCAAA GCTATTAATA CATATAGTTT TTGCTATGAC TATTGCAAGT GGAATTTTTT TATTTAATAA AAATGGAAGA ATAATTTATA TATTAATAGC AGATATTGTT ATAACAATTC TTATGTTTTT AGATATTTCA TATTATAGAA ATTACGGGAA TTTTTTATCA ATAAGACATT TATTTCATGA AGAGTTGTTT AATCCTTTAA ATAAGGAATT ATTTAATTTT TATAAAAGAG ATATTCTTCT TTTAATTGAT TTTATAATTT TAGTTCCTCT ATCAATTTTT TCATTAAAGA ATGATAGTGG TAAAAAGAGT AGAAGAAGTA TAAAGATATT TATACTAAGC TGGATAATAA ATGGAATAAT TATATACACA AGCCATTCTT TAGTAGATAT AAAAGGGGTT ACTAATGGTA AATTAACATT GTTTGAAAAA TCATTTGCTC CACAAGTTAA TATGGATGAT TTGGGAATGG TAGGATATCA TGAATATGAC TTAACAAGCT ATATTTTAAA AAAAGACAAA AAGCTAAGTA CAGAAGAAAA GGTTGAAATA AATAAATGGT TTGAAGAGAA TAAAGAGACT TTACCTGATA ATAAATATAA AGGACTTGGT AAAGGAAAAA ATCTTATAAT TATACAGTGG GAATCTTTAG AAAATTTTGC TATTAATTAT AAAGTTGATG GTCAAGAAAT AACTCCTAAC ATGAACAAAT TATTAAGTAA TTCACTATGT TTTGACAATA TTTATGAGCA AAATAAAAAT GGGACAACTT CTGATGCTGA ATTAATGGCT AATACATCAT TATTACCAAT AAGTGAAAGT GCGTATTTTA TACAATATCC ATGGAAAAAA CAAAATACAC TTCAAAGACT TTTAGAGAAA CACGGGTATA ATACAGCTAC AGCGATTGCT GATAAGGGTG GAGTATGGAA CTGGTTAGAA AATCATAAAA GTTTTGGTGT ACAGACTATA TGGGATAGCA GTTATTTCAA TAGAGATGAA TTGATAGGAT CAAATATTAC AGACGGAAGT TTATTTAGAC AAACAGAAGA AAAAATAAAA ACATTAAAAA GACCATATTA TTTGTTTATG GCAACAGCAA CATCTCACGG TCCATTTGAT CTTCCTACTA ACTATAGAGA ACTTAAATTA CCTAAAGAAA TTGATGATAC TAAATTAGGT GGTTATCTTC AAAGTTTAAG ATATACTGAT AAAATGCTTG GAGAATTCTT GAATAAACTT AAAGGTGATG GAGTGCTAGA TAATAGTATT ATTGTCATTT ATGGAGATCA TGGAGGAATA AATAAATATT ATAAAAAAGA GTTAGAAAAT ATAGATTTTG CTAATAACAA TTGGAAGCAG GAATACTTAA AGGTACCTAT GTTGATATAT AATCCAGAAA TTAAAGGTGA AGTTATAAAT ACATATGGTG GATTGGTAGA TCTTTTACCT ACTGTTGGAT ATATCATGGG AGTTGACAAA AGTGATTTTG AAAAAACAGC AATGGGAAGA GTTTTAGTAA ATACGAATGT AAATGCTACA ATAAATTCAA GTGGTCAAAT TTTAGGTAAT CCTAAAGATG AAAAAGAGAT AAGGCATCTA CAAGATATGT ATAAAATTAG TAATAATATA ATAGAGAGCA ATTATTTTAA TAATTAA
|
Protein sequence | MKKNEYRFLS ILNNKFLLFF LFSLLIIIKE VFFTWIWSSS DSIAKLQIFN MYMYWPKLLI HIVFAMTIAS GIFLFNKNGR IIYILIADIV ITILMFLDIS YYRNYGNFLS IRHLFHEELF NPLNKELFNF YKRDILLLID FIILVPLSIF SLKNDSGKKS RRSIKIFILS WIINGIIIYT SHSLVDIKGV TNGKLTLFEK SFAPQVNMDD LGMVGYHEYD LTSYILKKDK KLSTEEKVEI NKWFEENKET LPDNKYKGLG KGKNLIIIQW ESLENFAINY KVDGQEITPN MNKLLSNSLC FDNIYEQNKN GTTSDAELMA NTSLLPISES AYFIQYPWKK QNTLQRLLEK HGYNTATAIA DKGGVWNWLE NHKSFGVQTI WDSSYFNRDE LIGSNITDGS LFRQTEEKIK TLKRPYYLFM ATATSHGPFD LPTNYRELKL PKEIDDTKLG GYLQSLRYTD KMLGEFLNKL KGDGVLDNSI IVIYGDHGGI NKYYKKELEN IDFANNNWKQ EYLKVPMLIY NPEIKGEVIN TYGGLVDLLP TVGYIMGVDK SDFEKTAMGR VLVNTNVNAT INSSGQILGN PKDEKEIRHL QDMYKISNNI IESNYFNN
|
| |