Gene CPR_2601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2601 
Symbol 
ID4206087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2835774 
End bp2837126 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content30% 
IMG OID642567151 
Productputative cellobiose phosphotransferase enzyme IIC component 
Protein accessionYP_699848 
Protein GI110803837 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCTA TGGAGAAATT TCAATCTCAA ATAGAAAAGG TATTAGTGCC GTTAGCTAGT 
AAGTTAAATT CACAAAGGCA TATTTGTGCA GTAAGGGATT CATTTATATT AACATTTCCA
CTAACTATGG CAGGATCTTT AATGGTATTA CTTAACTTCG TTTTATTATC ACCAGATGGG
TTCATTTGTA AATTATTAAG GTTAAATAAA ATATTCCCTA ATATAGGTGA ATGTCAAGCT
ATATTTAGCC CAGTATTAAA AGGCTCAACA GATATATTAG CTATATTAAT TGTATTTTTA
ATTGCTAGAA ATTTGGCAAA ACAACTTAAA TCAGATGATT TATTATCAGG ATTAACAGCA
GTATCAGTTT ATTTTATAAT TTACTCAGAT TATGTAAATG TGGATGGCGT AAACTATTTA
ACTACAAAGT TTATGGGGGC ACAAGGATTA TTTGTAGCTA TAATTGTTGG GTTAGTAGTT
GGAGAACTTA TGTCAGTTTT ATCAAAGTCT AAAAGGTTAG AAATTAAGAT GCCAGAACAA
GTTCCACCAG CAGTAGCTAG AACATTTAAA TTATTATTAC CAATAGTTAT AATTACAGTT
TCATTTTCAA TATTAAATTT CTTTATTAAG AAATTTGCAC CAGGTGGATT ACATGAATTA
GTTTATACTG TAATTCAAAC TCCATTAACA CAATTAGGTC AAAATGTAGG ATCAGTATTA
ATATTAACTC TTATATCTCA ATCACTTTGG GTTATGGGAA TCCATGGTCC AAATACTATT
GCAGCAGTAC GTGATACTAT GTTTGCAGAG GCAACAAATG CAAATCTTTC ATATGCAGCA
GCAAATGGTA CTGCATGGGG AGCACCTTAT CCAGTAACAT TTAATGGATT ATATGATGCT
TTTGGAGCAT ATGGTGGTTC AGGAGCAACA TTAGGATTAA TAATTGCAAT ATTTATATTT
AGTAAAGCAA AAGAACAAAA AAGTATAGCA AAGCTTTCAT TTGCACCAGG ACTATTTAAT
ATAAATGAGA TGGTTATATT TGGATTACCT ATAGTATTAA ATCCTATATA TATAATACCA
TTTATATTAA CTCCATTAGT AAATATAACA ATTGGGTATT TAGCAACATC AGTTATGAGG
ATTATACCAC CAGTAGCATA TGGAGTGCCT TGGACAACAC CAGGACCATT AGCACCATTC
TTAGGAACTG GGGGAAATAT TATGGGATTA GTAATAGGAT TAATTTGTTT AGCAGTTAGC
GTATTTACTT ATGCACCATT TGTAATAGCT GCAAGTAAAG CTGAATTAAA AAATGAAGAA
GAAGCAACTT TAAATGAATT TAATAATGTT TAA
 
Protein sequence
MSAMEKFQSQ IEKVLVPLAS KLNSQRHICA VRDSFILTFP LTMAGSLMVL LNFVLLSPDG 
FICKLLRLNK IFPNIGECQA IFSPVLKGST DILAILIVFL IARNLAKQLK SDDLLSGLTA
VSVYFIIYSD YVNVDGVNYL TTKFMGAQGL FVAIIVGLVV GELMSVLSKS KRLEIKMPEQ
VPPAVARTFK LLLPIVIITV SFSILNFFIK KFAPGGLHEL VYTVIQTPLT QLGQNVGSVL
ILTLISQSLW VMGIHGPNTI AAVRDTMFAE ATNANLSYAA ANGTAWGAPY PVTFNGLYDA
FGAYGGSGAT LGLIIAIFIF SKAKEQKSIA KLSFAPGLFN INEMVIFGLP IVLNPIYIIP
FILTPLVNIT IGYLATSVMR IIPPVAYGVP WTTPGPLAPF LGTGGNIMGL VIGLICLAVS
VFTYAPFVIA ASKAELKNEE EATLNEFNNV