Gene CPF_1731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1731 
Symbol 
ID4201928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1952947 
End bp1954602 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content30% 
IMG OID638082603 
ProductSulP family sulfate permease 
Protein accessionYP_696167 
Protein GI110801327 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.281721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAAAC CAAAGTTAAT TTCTCTTTTA GATGATAAAG AGAACGGATT TTCAAAAGAA 
CAATTTTTTA AAGATTTAAT CGCTGGGATA ATAGTTGCTA TTATAGCACT TCCCCTATCT
ATTGCATTAG GTATTTCTTC AGGGGTGTCT CCTGAGAAAG GATTAATAAC TGCAATCATA
GCTGGATTTA TAATTTCATT ATTAGGAGGA AGTAGAGTTC AAATTGGTGG GCCTACTGGT
GCCTTTGTTG TTATAGTATT TGGTATTATA CAAAATCATG GAGTTGATGG ACTAATAATT
GCCACATTTA TGGCTGGTAT TATTCTTGTT TTATTTGGTT TATTACGATT TGGTAGCTTA
ATAAAATACA TACCTTATCC AATAACGGTA GGATTTACTT CTGGTATAGC TGTAACTCTT
TTCTCAACAC AGGTTAAGGA TTTTTTAGGA CTTTCAATGA CTAAAACCCC TTCTGAGTTT
ATACCTAAGT GGGAAGCTTA CATATCTCAT ATGAACACTA CAAACCTTTA TACATTAGCT
ATAGGATTAC TAGCACTTAT TATTTTAATC TTTTGGCCAA AAATAAATAA AAAGATTCCA
GGATCTTTAA TAGCTTTAAT AGTAACAACT TTAGTAGTAT TTATATTTAA TTTACCAGTT
GCAACAATAG GAAGTCAATT TGGTAAAATA AGCTCAAATA TTCCAATGCC TCATATTCCT
AACCTAAATC TTAATACATT AAAAGCATTA ATCGGACCTG CTTTTACAAT AGCGCTTTTA
GGTGGAATTG AATCTTTATT ATCTGCTGTT GTTTCAGATG GTATGATTGG AGACAAGCAT
AATTCAAATG CAGAACTTAT AGCACAAGGA ATAGCTAATA TGGGTTCTTC TCTATTTGGA
GGAATTCCTG CTACTGGAGC AATTGCTAGA ACTGCTGCCA ATGTTAAAAA TGGGGGAAGA
ACTCCTATTT CGGGTATAGT TCATTCAATA ACTTTATTAC TTATAATGCT TGTATTTATG
CCTCTTGCTA AATTCATTCC ATTAACTACT TTATCAGCAA TATTAATAAT TGTTTCATAT
AACATGAGTG AATGGAGAAC TTTTAAAGCA ATACTTAAGG CTCCTAAAAG TGATATAGCT
ATATTACTAA CAACATTTTT CTTAACAGTA TTATTTGATT TAGTAATTGC TATAGGAATA
GGAATGGTAG TTTCTATGTG CTTATTTATG AGAAGAGTTG CTACTTCTAT AGAAGTAAAT
GAATTAAATG AAAGTGACTG TTCTGATAAA TCTAATATAG ATACTGATAT GGAAAATCTT
AAGGTTGGAG AAAATGTCTT AGTCTATGAT ATAAGAGGTC ACCTTTTCTT TGGTGCTGTA
GATACATTTA TGAATACAAT GAAGGAAATA AATGATGATG CAAAGGTTCT TGTTTTAAGA
ATGAGACATA CTAAGACTTT AGATGTTACA GGATATAAAC AAATAAAAAA TATAGCTCTA
AGTTGTAAGT CTCGTAATAT GACTTTAATA ATATCTGAAT TACAAGAACA GCCAAAAAAA
GTTATGAGAC TTATGGGATT TATAGATACT TTAGGTGAAG ATCACTTTGC TACAAATTTT
GATGAAGCTT TAGAAAAAGC AAATTCTTTA ATTTAA
 
Protein sequence
MYKPKLISLL DDKENGFSKE QFFKDLIAGI IVAIIALPLS IALGISSGVS PEKGLITAII 
AGFIISLLGG SRVQIGGPTG AFVVIVFGII QNHGVDGLII ATFMAGIILV LFGLLRFGSL
IKYIPYPITV GFTSGIAVTL FSTQVKDFLG LSMTKTPSEF IPKWEAYISH MNTTNLYTLA
IGLLALIILI FWPKINKKIP GSLIALIVTT LVVFIFNLPV ATIGSQFGKI SSNIPMPHIP
NLNLNTLKAL IGPAFTIALL GGIESLLSAV VSDGMIGDKH NSNAELIAQG IANMGSSLFG
GIPATGAIAR TAANVKNGGR TPISGIVHSI TLLLIMLVFM PLAKFIPLTT LSAILIIVSY
NMSEWRTFKA ILKAPKSDIA ILLTTFFLTV LFDLVIAIGI GMVVSMCLFM RRVATSIEVN
ELNESDCSDK SNIDTDMENL KVGENVLVYD IRGHLFFGAV DTFMNTMKEI NDDAKVLVLR
MRHTKTLDVT GYKQIKNIAL SCKSRNMTLI ISELQEQPKK VMRLMGFIDT LGEDHFATNF
DEALEKANSL I