Gene CPF_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2501 
Symbol 
ID4201937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2772216 
End bp2774105 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content27% 
IMG OID638083366 
Productsulfatase 
Protein accessionYP_696915 
Protein GI110801025 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAA AAGTTAAATT AAATAATGGG CTTTCTAAAT TTAAGGAGTC TCTTAATAAA 
AATGCTTTAA TAAGACTAGG TTTTTACATA TTTACTCTTA TAGCCATAGT TTTAAAAGGA
GCACTTTTTT TAGGTTTCTC TTTAAACCAA AACCTTTACA CACTTAATTT TGGTTTAGGA
TATAGACAAG CTTCTTATTT TATTAATTAT TACATAGCAT TTGCAGCAAT ATTTGTAAGT
ATATGTTTTT TATTTAAAAA CAAAGGTAAA TTTTTCTCAT TAATAATTGT AGATTTATTT
ATAACCTTAA TTACAGTAAT GGATATTTGG TATTTTAGAG GATTCCAAAC AGTTCCATCA
GTAATGCTAT TAAAACAAAC TGCTAACTTA GATAATCTTG GAGATAGTAT TTTTTCAATG
GCTAGTCCAT ATGACTTACT ATTCTTCGTA GACTTTATTA TTTTAATAAT AGCTTTTATA
ATTTTTAGAA AAAGCTTTAA AAACTGTAAG TCTAATTGGA AAGGTACTTT AATTGTTTTA
CTTGTATCAA TTTGTTATAT AGGCTATGTT CCTTTTAACG TTAATGTATT AAAAAGAGAA
AATGTTAAAA ATTCATACTT ATTTAGTAAC TATGATCCAA CTAACACAGT AGAATACTTC
TCACCAATTG GTTATCATAT TTTTGATATA TATAATGTTT ATAAGAATTC TAAACCTTAT
AAAATGACAG CTGATGATGA AGCAAAAATA AAAGAATATT ATGATTTCAA AAATGAGAAT
CTTCCTGATA ATGAATTTAA GGGAATGTTC AAAGGAAAGA ACTTAATAGT AATACAAGTT
GAGTCCCTTG AAGACTTTGT TATAAATAAA AAAGTAGATG GACAAGAAAT AACTCCAAAC
ATAAATAAGT TATTAAATAA TTCAATTTAC TTACCTAATA TATTTGAACA AGTTAATGAA
GGTACAAGCT CTGATTCTGA CTTAATGGTT AATACTTCTA TGTTACCATT AAGACAAGGA
AGTACTTTCT TTAGAAATCC AGCTACAACT TATAACTCAT TACCTAATAT CTTAGAAAAA
GATGGCTATA GCACTATTGC TATCCATTCA GATAAAGGTT CTTTCTGGAA CTATGCTCAA
GGTTTAAATG GTATAGGTTT TGATAAATTT GTAGATTACT ATTCATTTGA TCGTGATGAA
AATATAGGTC TTGGATTAAG TGACGGAAGC TACTTTAGAC AAATTGAACC AATGATTAAA
GAATTAAAAC AACCATTCTA TGCATTTACA GTTACTTTAA CAAGCCACGG ACCATTTGAT
TTACCAAAGG AATACCGTGA ATTAAAACTT AGCCCTGAAC TTGATGACAA TGTTTTAGGA
GGATATTTCC AAAGTATTCA TTATACAGAT GCTAAAATAG GAATGTTCAT AGAATCACTA
AAAAAAGATG GTCTTTTAGA TAATACTGTT ATTGCAATAG AAGGTGACCA TGCTGGTCCT
CATAAATACT ATAACAGTAA GATAGAATCC TTATCTAATC CTGAATCTTG GTGGTTAGAC
AATGGAAATC ATACAGTTCC ATTAATTATC TATAATCCAA GCATTAAGAC ACCTGTAAAA
GACGATGTTT ACGGTGGTCA AATAGATATA ATGCCAACTC TTTTATATCT ATTAGGCGTA
GATAATAATG TATATCAAAA TACAGCTTTA GGTAGAAATC TATTAAACAC TAAGAGATCT
TACGCTGTTT TAACTGATAA AACAATTAAA GGTGAACTTA CAGATAAAGA AAAAGAAATA
GTAGGAAATG TATTAGATCT ATCTGATAAA ATGATTAGAG CAGATTATTT TAAAGATAAA
ATACCTAATG ATAATTCTAA AAATAATTAA
 
Protein sequence
MQEKVKLNNG LSKFKESLNK NALIRLGFYI FTLIAIVLKG ALFLGFSLNQ NLYTLNFGLG 
YRQASYFINY YIAFAAIFVS ICFLFKNKGK FFSLIIVDLF ITLITVMDIW YFRGFQTVPS
VMLLKQTANL DNLGDSIFSM ASPYDLLFFV DFIILIIAFI IFRKSFKNCK SNWKGTLIVL
LVSICYIGYV PFNVNVLKRE NVKNSYLFSN YDPTNTVEYF SPIGYHIFDI YNVYKNSKPY
KMTADDEAKI KEYYDFKNEN LPDNEFKGMF KGKNLIVIQV ESLEDFVINK KVDGQEITPN
INKLLNNSIY LPNIFEQVNE GTSSDSDLMV NTSMLPLRQG STFFRNPATT YNSLPNILEK
DGYSTIAIHS DKGSFWNYAQ GLNGIGFDKF VDYYSFDRDE NIGLGLSDGS YFRQIEPMIK
ELKQPFYAFT VTLTSHGPFD LPKEYRELKL SPELDDNVLG GYFQSIHYTD AKIGMFIESL
KKDGLLDNTV IAIEGDHAGP HKYYNSKIES LSNPESWWLD NGNHTVPLII YNPSIKTPVK
DDVYGGQIDI MPTLLYLLGV DNNVYQNTAL GRNLLNTKRS YAVLTDKTIK GELTDKEKEI
VGNVLDLSDK MIRADYFKDK IPNDNSKNN