Gene CPR_2207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2207 
Symbol 
ID4204389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2435726 
End bp2437609 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content27% 
IMG OID642566757 
Productsulfatase 
Protein accessionYP_699507 
Protein GI110801579 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAA AAGTTAAATT AAATAACGGG CTTTCTAAAT TTAAGGAGTC TCTTAATAAA 
AATGCTTTAA TAAGACTAGG TTTTTACATA TTTACTCTTA TAGCCATAGT TTTAAAAGGA
GCACTTTTTT TAGGTTTCTC TTTAAACCAA AATCTTTACA CACTTAATTT TGGTTTAGGA
TATAGACAAG CTTCTTATTT TATTAATTAT TACATAGCAT TTACAGCAAT ATTTGTAAGT
ATATGTTTTT TATTTAAAAA CAAAGGAAAG TTTTTCTCAT TAATAATTGT AGATTTATTT
ATAACTTTAA TTACAGTAAT GGATATTTGG TATTTCAGAG GATTCCAAAC AGTTCCATCA
CTAATGCTAT TAAAACAAAC TGCTAACTTA GATAATCTTG GAGATAGTAT TTTTTCAATG
GCTAGTCCAT ATGACTTACT ATTCTTCGTA GACTTTATGA TTTTAATAAT AGCTTTTATA
ATTTTTAGAA AAAGCTTTAA AAACTGTAAA TCCAATTGGA AAGGTACCTT AATTGTTTTA
CTTATTTCAA TTTGTTATAT AGGATATGTT CCTTTTAACA TTAATGTATT AAAAAGAGAA
AATGTTAAAA ATTCATACTT ATTTAGTAAT TATGACCCAA CTAACACAGT AGAATACTTC
TCACCAATTG GTTATCATAT TTTTGATATA TATAATGTTT ATAAGAATTC TAAACCTTAT
AAAATGACAG CTGACGATGA AGCAAAAATA AAAGAATATT ATGATTTCAA AAATGAGAAT
CTTCCTGATA ATAAATTTAA GGGAATGTTC AAAGGAAAAA ATTTAATAGT AATACAAGTT
GAGTCCCTTG AAGACTTTGT TATAAATAAA AAAGTAGATG GACAAGAAAT AACTCCAAAC
ATAAATAAAT TATTAAATAA TTCAATTTAC TTACCTAATA TATTTGAACA AGTTAACGAA
GGTACAAGCT CTGATTCTGA CTTAATGGTT AATACTTCTA TGTTACCATT AAGACAAGGA
AGTACTTTCT TTAGAAATCC AGCTACAACT TATAACTCAT TACCTAATAT ATTAGAAAAG
GATGGCTATA GTACTATTGC TATCCATTCA GATAAAGGAT CTTTCTGGAA CTATGCTCAA
GGTTTAAATG GTATAGGTTT TGATAAATTT GTAGATTACT ATTCATTTGA TCGTGATGAA
AATATAGGTC TTGGATTAAG TGACGGAAGC TACTTTAGAC AAATTGAACC AATGATTAAA
GAATTAAAAC AACCATTCTA TGCATTTACA GTTACTTTAA CAAGCCACGG ACCATTTGAT
TTACCAAAGG AATACCGTCA ATTAAAACTT ACTCCTGAAC TTGATGACAA TGTTTTAGGA
GGATACTTCC AAAGTGTTCA CTATACAGAT GCTAAAATAG GAATGTTCAT AGAATCACTA
AAAAAAGATG GTCTTTTAGA TAACACTGTT ATTGCAATAG AAGGTGACCA TACTGGTCCT
CATAAATACT ATAACAGTAA GATAGAATCA CTACCTAATC CTGAACCTTG GTGGTTAGAC
AATGGAAATC ATACAGTTCC ATTAATTATC TATAATCCAA GCATTAAGAC ACCTGTAAAA
GATGATGTTT ACGGTGGTCA AATAGATATA ATGCCAACTC TTTTATATCT ATTAGGCGTA
GATAATAATG TATATCAAAA TACAGCTTTA GGTAGAAATC TATTAAACAC TAAGAGATCT
TACGCTGTTT TAACTGATAA AACAATTAAG GGTGAACTTA CAGATAAAGA AAAAGAAATA
GTAGGAAATG TATTAGATCT ATCTGATAAA ATGATTAGAG CTGATTACTT TAAAGATAAA
ATACCTAATT ATAATTCTAA TTAA
 
Protein sequence
MQEKVKLNNG LSKFKESLNK NALIRLGFYI FTLIAIVLKG ALFLGFSLNQ NLYTLNFGLG 
YRQASYFINY YIAFTAIFVS ICFLFKNKGK FFSLIIVDLF ITLITVMDIW YFRGFQTVPS
LMLLKQTANL DNLGDSIFSM ASPYDLLFFV DFMILIIAFI IFRKSFKNCK SNWKGTLIVL
LISICYIGYV PFNINVLKRE NVKNSYLFSN YDPTNTVEYF SPIGYHIFDI YNVYKNSKPY
KMTADDEAKI KEYYDFKNEN LPDNKFKGMF KGKNLIVIQV ESLEDFVINK KVDGQEITPN
INKLLNNSIY LPNIFEQVNE GTSSDSDLMV NTSMLPLRQG STFFRNPATT YNSLPNILEK
DGYSTIAIHS DKGSFWNYAQ GLNGIGFDKF VDYYSFDRDE NIGLGLSDGS YFRQIEPMIK
ELKQPFYAFT VTLTSHGPFD LPKEYRQLKL TPELDDNVLG GYFQSVHYTD AKIGMFIESL
KKDGLLDNTV IAIEGDHTGP HKYYNSKIES LPNPEPWWLD NGNHTVPLII YNPSIKTPVK
DDVYGGQIDI MPTLLYLLGV DNNVYQNTAL GRNLLNTKRS YAVLTDKTIK GELTDKEKEI
VGNVLDLSDK MIRADYFKDK IPNYNSN