Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2207 |
Symbol | |
ID | 4204389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 2435726 |
End bp | 2437609 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 642566757 |
Product | sulfatase |
Protein accession | YP_699507 |
Protein GI | 110801579 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAAA AAGTTAAATT AAATAACGGG CTTTCTAAAT TTAAGGAGTC TCTTAATAAA AATGCTTTAA TAAGACTAGG TTTTTACATA TTTACTCTTA TAGCCATAGT TTTAAAAGGA GCACTTTTTT TAGGTTTCTC TTTAAACCAA AATCTTTACA CACTTAATTT TGGTTTAGGA TATAGACAAG CTTCTTATTT TATTAATTAT TACATAGCAT TTACAGCAAT ATTTGTAAGT ATATGTTTTT TATTTAAAAA CAAAGGAAAG TTTTTCTCAT TAATAATTGT AGATTTATTT ATAACTTTAA TTACAGTAAT GGATATTTGG TATTTCAGAG GATTCCAAAC AGTTCCATCA CTAATGCTAT TAAAACAAAC TGCTAACTTA GATAATCTTG GAGATAGTAT TTTTTCAATG GCTAGTCCAT ATGACTTACT ATTCTTCGTA GACTTTATGA TTTTAATAAT AGCTTTTATA ATTTTTAGAA AAAGCTTTAA AAACTGTAAA TCCAATTGGA AAGGTACCTT AATTGTTTTA CTTATTTCAA TTTGTTATAT AGGATATGTT CCTTTTAACA TTAATGTATT AAAAAGAGAA AATGTTAAAA ATTCATACTT ATTTAGTAAT TATGACCCAA CTAACACAGT AGAATACTTC TCACCAATTG GTTATCATAT TTTTGATATA TATAATGTTT ATAAGAATTC TAAACCTTAT AAAATGACAG CTGACGATGA AGCAAAAATA AAAGAATATT ATGATTTCAA AAATGAGAAT CTTCCTGATA ATAAATTTAA GGGAATGTTC AAAGGAAAAA ATTTAATAGT AATACAAGTT GAGTCCCTTG AAGACTTTGT TATAAATAAA AAAGTAGATG GACAAGAAAT AACTCCAAAC ATAAATAAAT TATTAAATAA TTCAATTTAC TTACCTAATA TATTTGAACA AGTTAACGAA GGTACAAGCT CTGATTCTGA CTTAATGGTT AATACTTCTA TGTTACCATT AAGACAAGGA AGTACTTTCT TTAGAAATCC AGCTACAACT TATAACTCAT TACCTAATAT ATTAGAAAAG GATGGCTATA GTACTATTGC TATCCATTCA GATAAAGGAT CTTTCTGGAA CTATGCTCAA GGTTTAAATG GTATAGGTTT TGATAAATTT GTAGATTACT ATTCATTTGA TCGTGATGAA AATATAGGTC TTGGATTAAG TGACGGAAGC TACTTTAGAC AAATTGAACC AATGATTAAA GAATTAAAAC AACCATTCTA TGCATTTACA GTTACTTTAA CAAGCCACGG ACCATTTGAT TTACCAAAGG AATACCGTCA ATTAAAACTT ACTCCTGAAC TTGATGACAA TGTTTTAGGA GGATACTTCC AAAGTGTTCA CTATACAGAT GCTAAAATAG GAATGTTCAT AGAATCACTA AAAAAAGATG GTCTTTTAGA TAACACTGTT ATTGCAATAG AAGGTGACCA TACTGGTCCT CATAAATACT ATAACAGTAA GATAGAATCA CTACCTAATC CTGAACCTTG GTGGTTAGAC AATGGAAATC ATACAGTTCC ATTAATTATC TATAATCCAA GCATTAAGAC ACCTGTAAAA GATGATGTTT ACGGTGGTCA AATAGATATA ATGCCAACTC TTTTATATCT ATTAGGCGTA GATAATAATG TATATCAAAA TACAGCTTTA GGTAGAAATC TATTAAACAC TAAGAGATCT TACGCTGTTT TAACTGATAA AACAATTAAG GGTGAACTTA CAGATAAAGA AAAAGAAATA GTAGGAAATG TATTAGATCT ATCTGATAAA ATGATTAGAG CTGATTACTT TAAAGATAAA ATACCTAATT ATAATTCTAA TTAA
|
Protein sequence | MQEKVKLNNG LSKFKESLNK NALIRLGFYI FTLIAIVLKG ALFLGFSLNQ NLYTLNFGLG YRQASYFINY YIAFTAIFVS ICFLFKNKGK FFSLIIVDLF ITLITVMDIW YFRGFQTVPS LMLLKQTANL DNLGDSIFSM ASPYDLLFFV DFMILIIAFI IFRKSFKNCK SNWKGTLIVL LISICYIGYV PFNINVLKRE NVKNSYLFSN YDPTNTVEYF SPIGYHIFDI YNVYKNSKPY KMTADDEAKI KEYYDFKNEN LPDNKFKGMF KGKNLIVIQV ESLEDFVINK KVDGQEITPN INKLLNNSIY LPNIFEQVNE GTSSDSDLMV NTSMLPLRQG STFFRNPATT YNSLPNILEK DGYSTIAIHS DKGSFWNYAQ GLNGIGFDKF VDYYSFDRDE NIGLGLSDGS YFRQIEPMIK ELKQPFYAFT VTLTSHGPFD LPKEYRQLKL TPELDDNVLG GYFQSVHYTD AKIGMFIESL KKDGLLDNTV IAIEGDHTGP HKYYNSKIES LPNPEPWWLD NGNHTVPLII YNPSIKTPVK DDVYGGQIDI MPTLLYLLGV DNNVYQNTAL GRNLLNTKRS YAVLTDKTIK GELTDKEKEI VGNVLDLSDK MIRADYFKDK IPNYNSN
|
| |