Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_2501 |
Symbol | |
ID | 4201937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 2772216 |
End bp | 2774105 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 638083366 |
Product | sulfatase |
Protein accession | YP_696915 |
Protein GI | 110801025 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAAA AAGTTAAATT AAATAATGGG CTTTCTAAAT TTAAGGAGTC TCTTAATAAA AATGCTTTAA TAAGACTAGG TTTTTACATA TTTACTCTTA TAGCCATAGT TTTAAAAGGA GCACTTTTTT TAGGTTTCTC TTTAAACCAA AACCTTTACA CACTTAATTT TGGTTTAGGA TATAGACAAG CTTCTTATTT TATTAATTAT TACATAGCAT TTGCAGCAAT ATTTGTAAGT ATATGTTTTT TATTTAAAAA CAAAGGTAAA TTTTTCTCAT TAATAATTGT AGATTTATTT ATAACCTTAA TTACAGTAAT GGATATTTGG TATTTTAGAG GATTCCAAAC AGTTCCATCA GTAATGCTAT TAAAACAAAC TGCTAACTTA GATAATCTTG GAGATAGTAT TTTTTCAATG GCTAGTCCAT ATGACTTACT ATTCTTCGTA GACTTTATTA TTTTAATAAT AGCTTTTATA ATTTTTAGAA AAAGCTTTAA AAACTGTAAG TCTAATTGGA AAGGTACTTT AATTGTTTTA CTTGTATCAA TTTGTTATAT AGGCTATGTT CCTTTTAACG TTAATGTATT AAAAAGAGAA AATGTTAAAA ATTCATACTT ATTTAGTAAC TATGATCCAA CTAACACAGT AGAATACTTC TCACCAATTG GTTATCATAT TTTTGATATA TATAATGTTT ATAAGAATTC TAAACCTTAT AAAATGACAG CTGATGATGA AGCAAAAATA AAAGAATATT ATGATTTCAA AAATGAGAAT CTTCCTGATA ATGAATTTAA GGGAATGTTC AAAGGAAAGA ACTTAATAGT AATACAAGTT GAGTCCCTTG AAGACTTTGT TATAAATAAA AAAGTAGATG GACAAGAAAT AACTCCAAAC ATAAATAAGT TATTAAATAA TTCAATTTAC TTACCTAATA TATTTGAACA AGTTAATGAA GGTACAAGCT CTGATTCTGA CTTAATGGTT AATACTTCTA TGTTACCATT AAGACAAGGA AGTACTTTCT TTAGAAATCC AGCTACAACT TATAACTCAT TACCTAATAT CTTAGAAAAA GATGGCTATA GCACTATTGC TATCCATTCA GATAAAGGTT CTTTCTGGAA CTATGCTCAA GGTTTAAATG GTATAGGTTT TGATAAATTT GTAGATTACT ATTCATTTGA TCGTGATGAA AATATAGGTC TTGGATTAAG TGACGGAAGC TACTTTAGAC AAATTGAACC AATGATTAAA GAATTAAAAC AACCATTCTA TGCATTTACA GTTACTTTAA CAAGCCACGG ACCATTTGAT TTACCAAAGG AATACCGTGA ATTAAAACTT AGCCCTGAAC TTGATGACAA TGTTTTAGGA GGATATTTCC AAAGTATTCA TTATACAGAT GCTAAAATAG GAATGTTCAT AGAATCACTA AAAAAAGATG GTCTTTTAGA TAATACTGTT ATTGCAATAG AAGGTGACCA TGCTGGTCCT CATAAATACT ATAACAGTAA GATAGAATCC TTATCTAATC CTGAATCTTG GTGGTTAGAC AATGGAAATC ATACAGTTCC ATTAATTATC TATAATCCAA GCATTAAGAC ACCTGTAAAA GACGATGTTT ACGGTGGTCA AATAGATATA ATGCCAACTC TTTTATATCT ATTAGGCGTA GATAATAATG TATATCAAAA TACAGCTTTA GGTAGAAATC TATTAAACAC TAAGAGATCT TACGCTGTTT TAACTGATAA AACAATTAAA GGTGAACTTA CAGATAAAGA AAAAGAAATA GTAGGAAATG TATTAGATCT ATCTGATAAA ATGATTAGAG CAGATTATTT TAAAGATAAA ATACCTAATG ATAATTCTAA AAATAATTAA
|
Protein sequence | MQEKVKLNNG LSKFKESLNK NALIRLGFYI FTLIAIVLKG ALFLGFSLNQ NLYTLNFGLG YRQASYFINY YIAFAAIFVS ICFLFKNKGK FFSLIIVDLF ITLITVMDIW YFRGFQTVPS VMLLKQTANL DNLGDSIFSM ASPYDLLFFV DFIILIIAFI IFRKSFKNCK SNWKGTLIVL LVSICYIGYV PFNVNVLKRE NVKNSYLFSN YDPTNTVEYF SPIGYHIFDI YNVYKNSKPY KMTADDEAKI KEYYDFKNEN LPDNEFKGMF KGKNLIVIQV ESLEDFVINK KVDGQEITPN INKLLNNSIY LPNIFEQVNE GTSSDSDLMV NTSMLPLRQG STFFRNPATT YNSLPNILEK DGYSTIAIHS DKGSFWNYAQ GLNGIGFDKF VDYYSFDRDE NIGLGLSDGS YFRQIEPMIK ELKQPFYAFT VTLTSHGPFD LPKEYRELKL SPELDDNVLG GYFQSIHYTD AKIGMFIESL KKDGLLDNTV IAIEGDHAGP HKYYNSKIES LSNPESWWLD NGNHTVPLII YNPSIKTPVK DDVYGGQIDI MPTLLYLLGV DNNVYQNTAL GRNLLNTKRS YAVLTDKTIK GELTDKEKEI VGNVLDLSDK MIRADYFKDK IPNDNSKNN
|
| |