Gene CPR_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1459 
Symbol 
ID4205568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1639273 
End bp1640928 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content30% 
IMG OID642566013 
ProductSulP family sulfate permease 
Protein accessionYP_698778 
Protein GI110802804 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.032999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAAAC CAAAATTAAT TTCTCTTTTA GATGATAAAG AGAGCGGATT TTCAAAAGAA 
CAATTTTTAA AAGATTTAAT CGCTGGTATA ATAGTTGCTA TTATAGCACT TCCCCTATCC
ATTGCATTAG GTATTTCTTC AGGGGTATCT CCTGAAAAAG GATTAATAAC TGCAATCATA
GCTGGATTCA TAATTTCATT ATTAGGAGGA AGTAGAGTTC AAATTGGTGG TCCTACTGGC
GCCTTTGTTG TTATAGTATT TGGTATTATA CAAAACCATG GAGTTGATGG ACTAATAATT
GCCACATTTA TGGCTGGTAT TATTCTTGTT TTATTTGGTT TATTACGATT TGGTAGCTTA
ATAAAATATA TACCTTATCC AATAACGGTA GGATTTACCT CTGGTATAGC TATAACTCTT
TTATCAACAC AAGTTAAGGA TTTTTTAGGA CTTTCAATTA CTAAAACCCC CTCTGAGTTT
ATACCTAAGT GGGAAGCTTA CATATCTCAT ATGAATACTA CAAACCTTTA TACCTTAGCT
ATAGGATTAC TAGCACTTAT TATTTTAATC TTTTGGCCAA AAATAAATAA AAAGATTCCA
GGATCTTTAA TAGCCTTAAT AGTAACAACT TTAGTAGTAT TTATATTTAA TCTACCAGTT
GCGACAATAG GAAGTCAATT TGGTAAAATA AGCTCAAATA TTCCAATACC TCATATTCCT
AATCTAAATC TTAATACATT AAAAGCATTA ATAGGACCTG CTTTTACAAT AGCTCTTTTA
GGTGGAATTG AATCTTTATT ATCTGCTGTT GTTTCAGATG GTATGATTGG AGACAAGCAT
AATTCAAATG CAGAACTTAT AGCACAAGGA TTAGCTAATA TGGGTTCTTC TTTATTTGGA
GGAATTCCTG CTACTGGAGC AATTGCTAGA ACTGCTGCCA ATGTTAAAAA CGGGGGAAGA
ACTCCTATTT CTGGTATGGT TCACTCAATA ACTTTATTAC TTATAATGCT TGTATTTATG
CCTCTTGCTA AATTCATTCC ATTAACTACT TTATCAGCAA TATTAATAAT TGTTTCATAT
AACATGAGTG AATGGAGAAC TTTTAAAGCA ATACTTAAGG CTCCTAAAAG TGATATAGCT
ATATTACTAA TAACATTTTT CTTGACAGTA TTATTTGATT TAGTAATTGC TATAGGGATA
GGAATGATAG TTTCTATGTG CTTATTTATA AGAAGAGTTG CTACTTCTAT AGAAGTAAAT
GAATTAAATG AAAGTGACTG TTCTTATAAA TCTAATATAG ATACTGATAT GGAAAATCTT
AAAGTTGGAG AAAATGTCTT AGTTTATGAT ATAAGAGGTC ACCTTTTCTT TGGTGCTGTA
GATACATTTA TGAATACAAT GAAGGAAATA AATGATGATG CAAAGGTTCT TGTTTTAAGA
ATGAGACATA CTAAGACTTT AGATGTTACA GGCTATAAAC AAATAAAAAA TATAGCTCTA
AGTTGTAAGT CTCGTAATAT GACTTTAATA ATATCTGAAT TACAAGAACA ACCAAAAAAA
GTTATGAGAC TTATGGGATT TATAGATACT TTAGGTGAAG ATCACTTTGC TACAAATTTT
GATGAGGCTT TAGAAAAAGC AAATTCTTTA ATTTAG
 
Protein sequence
MYKPKLISLL DDKESGFSKE QFLKDLIAGI IVAIIALPLS IALGISSGVS PEKGLITAII 
AGFIISLLGG SRVQIGGPTG AFVVIVFGII QNHGVDGLII ATFMAGIILV LFGLLRFGSL
IKYIPYPITV GFTSGIAITL LSTQVKDFLG LSITKTPSEF IPKWEAYISH MNTTNLYTLA
IGLLALIILI FWPKINKKIP GSLIALIVTT LVVFIFNLPV ATIGSQFGKI SSNIPIPHIP
NLNLNTLKAL IGPAFTIALL GGIESLLSAV VSDGMIGDKH NSNAELIAQG LANMGSSLFG
GIPATGAIAR TAANVKNGGR TPISGMVHSI TLLLIMLVFM PLAKFIPLTT LSAILIIVSY
NMSEWRTFKA ILKAPKSDIA ILLITFFLTV LFDLVIAIGI GMIVSMCLFI RRVATSIEVN
ELNESDCSYK SNIDTDMENL KVGENVLVYD IRGHLFFGAV DTFMNTMKEI NDDAKVLVLR
MRHTKTLDVT GYKQIKNIAL SCKSRNMTLI ISELQEQPKK VMRLMGFIDT LGEDHFATNF
DEALEKANSL I