Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1459 |
Symbol | |
ID | 4205568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1639273 |
End bp | 1640928 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642566013 |
Product | SulP family sulfate permease |
Protein accession | YP_698778 |
Protein GI | 110802804 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.032999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACAAAC CAAAATTAAT TTCTCTTTTA GATGATAAAG AGAGCGGATT TTCAAAAGAA CAATTTTTAA AAGATTTAAT CGCTGGTATA ATAGTTGCTA TTATAGCACT TCCCCTATCC ATTGCATTAG GTATTTCTTC AGGGGTATCT CCTGAAAAAG GATTAATAAC TGCAATCATA GCTGGATTCA TAATTTCATT ATTAGGAGGA AGTAGAGTTC AAATTGGTGG TCCTACTGGC GCCTTTGTTG TTATAGTATT TGGTATTATA CAAAACCATG GAGTTGATGG ACTAATAATT GCCACATTTA TGGCTGGTAT TATTCTTGTT TTATTTGGTT TATTACGATT TGGTAGCTTA ATAAAATATA TACCTTATCC AATAACGGTA GGATTTACCT CTGGTATAGC TATAACTCTT TTATCAACAC AAGTTAAGGA TTTTTTAGGA CTTTCAATTA CTAAAACCCC CTCTGAGTTT ATACCTAAGT GGGAAGCTTA CATATCTCAT ATGAATACTA CAAACCTTTA TACCTTAGCT ATAGGATTAC TAGCACTTAT TATTTTAATC TTTTGGCCAA AAATAAATAA AAAGATTCCA GGATCTTTAA TAGCCTTAAT AGTAACAACT TTAGTAGTAT TTATATTTAA TCTACCAGTT GCGACAATAG GAAGTCAATT TGGTAAAATA AGCTCAAATA TTCCAATACC TCATATTCCT AATCTAAATC TTAATACATT AAAAGCATTA ATAGGACCTG CTTTTACAAT AGCTCTTTTA GGTGGAATTG AATCTTTATT ATCTGCTGTT GTTTCAGATG GTATGATTGG AGACAAGCAT AATTCAAATG CAGAACTTAT AGCACAAGGA TTAGCTAATA TGGGTTCTTC TTTATTTGGA GGAATTCCTG CTACTGGAGC AATTGCTAGA ACTGCTGCCA ATGTTAAAAA CGGGGGAAGA ACTCCTATTT CTGGTATGGT TCACTCAATA ACTTTATTAC TTATAATGCT TGTATTTATG CCTCTTGCTA AATTCATTCC ATTAACTACT TTATCAGCAA TATTAATAAT TGTTTCATAT AACATGAGTG AATGGAGAAC TTTTAAAGCA ATACTTAAGG CTCCTAAAAG TGATATAGCT ATATTACTAA TAACATTTTT CTTGACAGTA TTATTTGATT TAGTAATTGC TATAGGGATA GGAATGATAG TTTCTATGTG CTTATTTATA AGAAGAGTTG CTACTTCTAT AGAAGTAAAT GAATTAAATG AAAGTGACTG TTCTTATAAA TCTAATATAG ATACTGATAT GGAAAATCTT AAAGTTGGAG AAAATGTCTT AGTTTATGAT ATAAGAGGTC ACCTTTTCTT TGGTGCTGTA GATACATTTA TGAATACAAT GAAGGAAATA AATGATGATG CAAAGGTTCT TGTTTTAAGA ATGAGACATA CTAAGACTTT AGATGTTACA GGCTATAAAC AAATAAAAAA TATAGCTCTA AGTTGTAAGT CTCGTAATAT GACTTTAATA ATATCTGAAT TACAAGAACA ACCAAAAAAA GTTATGAGAC TTATGGGATT TATAGATACT TTAGGTGAAG ATCACTTTGC TACAAATTTT GATGAGGCTT TAGAAAAAGC AAATTCTTTA ATTTAG
|
Protein sequence | MYKPKLISLL DDKESGFSKE QFLKDLIAGI IVAIIALPLS IALGISSGVS PEKGLITAII AGFIISLLGG SRVQIGGPTG AFVVIVFGII QNHGVDGLII ATFMAGIILV LFGLLRFGSL IKYIPYPITV GFTSGIAITL LSTQVKDFLG LSITKTPSEF IPKWEAYISH MNTTNLYTLA IGLLALIILI FWPKINKKIP GSLIALIVTT LVVFIFNLPV ATIGSQFGKI SSNIPIPHIP NLNLNTLKAL IGPAFTIALL GGIESLLSAV VSDGMIGDKH NSNAELIAQG LANMGSSLFG GIPATGAIAR TAANVKNGGR TPISGMVHSI TLLLIMLVFM PLAKFIPLTT LSAILIIVSY NMSEWRTFKA ILKAPKSDIA ILLITFFLTV LFDLVIAIGI GMIVSMCLFI RRVATSIEVN ELNESDCSYK SNIDTDMENL KVGENVLVYD IRGHLFFGAV DTFMNTMKEI NDDAKVLVLR MRHTKTLDVT GYKQIKNIAL SCKSRNMTLI ISELQEQPKK VMRLMGFIDT LGEDHFATNF DEALEKANSL I
|
| |