Gene CPR_0570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0570 
Symbol 
ID4205462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp678647 
End bp680455 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content29% 
IMG OID642565130 
Productsulfatase family protein 
Protein accessionYP_697897 
Protein GI110803087 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAAA AAGTAAAAAG TTTTTTTAAG TGTAACTGGG TTTTTATTTT CTTAGTGATA 
ACACTTCAAA TAAAATCAAT GTTACTTTTA TCTATGCTTA GAACTCCGGG GTCTAGAGGA
ATTAATTTTG ACTTAATGTA TTTTACTCCC CCTGCTTGGT GGGCTCATAT AGCAATAGTT
ACGTTAATAG CGAGTTTTGT TTATCTATTT AAAGGTAAAG GAAGAATATG GGCCGGTATA
GTTATTGATA TTCTAGTAAC AATTTTATTT GTGGCTGATA TTTGGTATTA CAGAGTAAAT
GGAACTTTTT TATCAATAAG ACATATAATT GAGCCAGGAA TATTTAATCC TGTAGGAAAG
AGCTTATTTA ATTTAGCTAA AGTAGATGTA GTGTTCATAG TTGATTTTAT AATATTATTT
TTAGTTTATA ACTTTACAGG CTTAAAAAAT GTTAAATATA AAAATAATAT AAAAACTAGA
TTAATTGCAT TTATATGCCT TTTTGGAATA AGTGCAACAG TAATAGGGGT ATCACACTAC
TATATAGATA TTGAAAAAAA ATCAGATAAA AGTTTTTTAA GAATATCATG GGCGCCTTTT
CAAACAATTA GTGATTCAAG TCCATTAGGA TATCATGGAT ATGATATTTA TTATTATGCT
AACAGAAAAG AAACTTTAAC TGATGCTCAA AAAAATGAAA TTAAAACTTG GTTTGATGAA
AACAAAGAAG ATTTACCAGA TAACAAGTAT AAAGGTATGC TTCAAGGAAA AAATGTTATA
GCCTTGCAGG TTGAGTCTCT TGAAAACTTT GTTATAGGTA AAAAAGTTTA TGGTCAAGAG
ATTACTCCAA ACATAAATAA ACTTTTAAAG AATAGTTTAT ATTTTGATAA TATAAAAGAA
CAAAACAATT CAGGAACAAG TTCAGATTGC GATATAATGG TTAATACATC AATACTTCCT
GTTAGAGAAG GAACTACAGT ATTTGGTTAT CCATGGGCTG AATATAATAC TTTACAGAAA
ATATTAAAGA GTAAGGGATA TTCAACAGTT TCAACACATC CAGAGGTTCC TGGAAATTGG
AACTGGGCAG AGGTTCATAA GGCATTTAAA GCTGATGAAA TATGGGATGC TCATCAGTTT
GATCAAAGTG AAATCATAGG ACTTGGAATG TCAGATGAAT CTTATTTAAG ACAGGTTGGA
GAAAGATTAA AGAGTCAAAA ACAACCTTTC TATACATTCT TAGTAACATT AACAAGTCAT
GGACCATTTG ATATGCCTAA GGATAAACAA TATTTAAATT TACCAGAAGA TTTAAATGAA
AATATGTTAG GTGCATATTT CCAAAGTGTT AGATATACAG ATGAAGCTAT TGGAGAATTT
ATAAATCAAT TGAAAGAAGA AGGTCTTTTA GACAATACTG TAATAATGAT TTATGGAGAT
CATGGTGGAG TTCATAAATT CTATGAAGAT AAAATAAAAG ATGCTCCTTT AGAGGGAGAT
TGGTGGAAAG ATGATGAAAA GGAAATACCT TTCTTAATAT ACAATCCAAG TATTAATGGA
GAGACTATAT CAAAAGAAGG TGGTCAAATA GATTTCTTAC CAACTATAGC ATACCTTTTA
GGATTTAATA GAGATACTTT TGATAATACA GCAATGGGAA GAGTATTAGT AAATACTAAT
AGAAATGCAA GCATATTAAA TAATGGAGAA ATAGTTGGAA ATCCAACACC TGAAGAAAAA
GCTCATTTAG AGAAATCATT TAATATTGCA GATATGATTA TACAAGGAAA TTATTTTAAA
AATAATTAA
 
Protein sequence
MKEKVKSFFK CNWVFIFLVI TLQIKSMLLL SMLRTPGSRG INFDLMYFTP PAWWAHIAIV 
TLIASFVYLF KGKGRIWAGI VIDILVTILF VADIWYYRVN GTFLSIRHII EPGIFNPVGK
SLFNLAKVDV VFIVDFIILF LVYNFTGLKN VKYKNNIKTR LIAFICLFGI SATVIGVSHY
YIDIEKKSDK SFLRISWAPF QTISDSSPLG YHGYDIYYYA NRKETLTDAQ KNEIKTWFDE
NKEDLPDNKY KGMLQGKNVI ALQVESLENF VIGKKVYGQE ITPNINKLLK NSLYFDNIKE
QNNSGTSSDC DIMVNTSILP VREGTTVFGY PWAEYNTLQK ILKSKGYSTV STHPEVPGNW
NWAEVHKAFK ADEIWDAHQF DQSEIIGLGM SDESYLRQVG ERLKSQKQPF YTFLVTLTSH
GPFDMPKDKQ YLNLPEDLNE NMLGAYFQSV RYTDEAIGEF INQLKEEGLL DNTVIMIYGD
HGGVHKFYED KIKDAPLEGD WWKDDEKEIP FLIYNPSING ETISKEGGQI DFLPTIAYLL
GFNRDTFDNT AMGRVLVNTN RNASILNNGE IVGNPTPEEK AHLEKSFNIA DMIIQGNYFK
NN