Gene CPR_0575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0575 
Symbol 
ID4206111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp683456 
End bp685282 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content26% 
IMG OID642565135 
Productsulfatase family protein 
Protein accessionYP_697902 
Protein GI110803822 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.122436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA ATGAATATAG ATTTTTATCA ATACTTAATA ACAAATTTTT ATTATTTTTC 
CTCTTTTCTT TATTGATAAT AATAAAAGAA GTATTTTTTA CTTGGATATG GTCAAGTAGT
GATAGTATTG CTAAATTGCA AATTTTTAAT ATGTATATGT ACTGGCCAAA GCTATTAATA
CATATAGTTT TTGCTATGAC TATTGCAAGT GGAATTTTTT TATTTAATAA AAATGGAAGA
ATAATTTATA TATTAATAGC AGATATTGTT ATAACAATTC TTATGTTTTT AGATATTTCA
TATTATAGAA ATTACGGGAA TTTTTTATCA ATAAGACATT TATTTCATGA AGAGTTGTTT
AATCCTTTAA ATAAGGAATT ATTTAATTTT TATAAAAGAG ATATTCTTCT TTTAATTGAT
TTTATAATTT TAGTTCCTCT ATCAATTTTT TCATTAAAGA ATGATAGTGG TAAAAAGAGT
AGAAGAAGTA TAAAGATATT TATACTAAGC TGGATAATAA ATGGAATAAT TATATACACA
AGCCATTCTT TAGTAGATAT AAAAGGGGTT ACTAATGGTA AATTAACATT GTTTGAAAAA
TCATTTGCTC CACAAGTTAA TATGGATGAT TTGGGAATGG TAGGATATCA TGAATATGAC
TTAACAAGCT ATATTTTAAA AAAAGACAAA AAGCTAAGTA CAGAAGAAAA GGTTGAAATA
AATAAATGGT TTGAAGAGAA TAAAGAGACT TTACCTGATA ATAAATATAA AGGACTTGGT
AAAGGAAAAA ATCTTATAAT TATACAGTGG GAATCTTTAG AAAATTTTGC TATTAATTAT
AAAGTTGATG GTCAAGAAAT AACTCCTAAC ATGAACAAAT TATTAAGTAA TTCACTATGT
TTTGACAATA TTTATGAGCA AAATAAAAAT GGGACAACTT CTGATGCTGA ATTAATGGCT
AATACATCAT TATTACCAAT AAGTGAAAGT GCGTATTTTA TACAATATCC ATGGAAAAAA
CAAAATACAC TTCAAAGACT TTTAGAGAAA CACGGGTATA ATACAGCTAC AGCGATTGCT
GATAAGGGTG GAGTATGGAA CTGGTTAGAA AATCATAAAA GTTTTGGTGT ACAGACTATA
TGGGATAGCA GTTATTTCAA TAGAGATGAA TTGATAGGAT CAAATATTAC AGACGGAAGT
TTATTTAGAC AAACAGAAGA AAAAATAAAA ACATTAAAAA GACCATATTA TTTGTTTATG
GCAACAGCAA CATCTCACGG TCCATTTGAT CTTCCTACTA ACTATAGAGA ACTTAAATTA
CCTAAAGAAA TTGATGATAC TAAATTAGGT GGTTATCTTC AAAGTTTAAG ATATACTGAT
AAAATGCTTG GAGAATTCTT GAATAAACTT AAAGGTGATG GAGTGCTAGA TAATAGTATT
ATTGTCATTT ATGGAGATCA TGGAGGAATA AATAAATATT ATAAAAAAGA GTTAGAAAAT
ATAGATTTTG CTAATAACAA TTGGAAGCAG GAATACTTAA AGGTACCTAT GTTGATATAT
AATCCAGAAA TTAAAGGTGA AGTTATAAAT ACATATGGTG GATTGGTAGA TCTTTTACCT
ACTGTTGGAT ATATCATGGG AGTTGACAAA AGTGATTTTG AAAAAACAGC AATGGGAAGA
GTTTTAGTAA ATACGAATGT AAATGCTACA ATAAATTCAA GTGGTCAAAT TTTAGGTAAT
CCTAAAGATG AAAAAGAGAT AAGGCATCTA CAAGATATGT ATAAAATTAG TAATAATATA
ATAGAGAGCA ATTATTTTAA TAATTAA
 
Protein sequence
MKKNEYRFLS ILNNKFLLFF LFSLLIIIKE VFFTWIWSSS DSIAKLQIFN MYMYWPKLLI 
HIVFAMTIAS GIFLFNKNGR IIYILIADIV ITILMFLDIS YYRNYGNFLS IRHLFHEELF
NPLNKELFNF YKRDILLLID FIILVPLSIF SLKNDSGKKS RRSIKIFILS WIINGIIIYT
SHSLVDIKGV TNGKLTLFEK SFAPQVNMDD LGMVGYHEYD LTSYILKKDK KLSTEEKVEI
NKWFEENKET LPDNKYKGLG KGKNLIIIQW ESLENFAINY KVDGQEITPN MNKLLSNSLC
FDNIYEQNKN GTTSDAELMA NTSLLPISES AYFIQYPWKK QNTLQRLLEK HGYNTATAIA
DKGGVWNWLE NHKSFGVQTI WDSSYFNRDE LIGSNITDGS LFRQTEEKIK TLKRPYYLFM
ATATSHGPFD LPTNYRELKL PKEIDDTKLG GYLQSLRYTD KMLGEFLNKL KGDGVLDNSI
IVIYGDHGGI NKYYKKELEN IDFANNNWKQ EYLKVPMLIY NPEIKGEVIN TYGGLVDLLP
TVGYIMGVDK SDFEKTAMGR VLVNTNVNAT INSSGQILGN PKDEKEIRHL QDMYKISNNI
IESNYFNN