Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2601 |
Symbol | |
ID | 4206087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2835774 |
End bp | 2837126 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642567151 |
Product | putative cellobiose phosphotransferase enzyme IIC component |
Protein accession | YP_699848 |
Protein GI | 110803837 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1455] Phosphotransferase system cellobiose-specific component IIC |
TIGRFAM ID | [TIGR00359] phosphotransferase system, cellobiose specific, IIC component [TIGR00410] PTS system, lactose/cellobiose family IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCTA TGGAGAAATT TCAATCTCAA ATAGAAAAGG TATTAGTGCC GTTAGCTAGT AAGTTAAATT CACAAAGGCA TATTTGTGCA GTAAGGGATT CATTTATATT AACATTTCCA CTAACTATGG CAGGATCTTT AATGGTATTA CTTAACTTCG TTTTATTATC ACCAGATGGG TTCATTTGTA AATTATTAAG GTTAAATAAA ATATTCCCTA ATATAGGTGA ATGTCAAGCT ATATTTAGCC CAGTATTAAA AGGCTCAACA GATATATTAG CTATATTAAT TGTATTTTTA ATTGCTAGAA ATTTGGCAAA ACAACTTAAA TCAGATGATT TATTATCAGG ATTAACAGCA GTATCAGTTT ATTTTATAAT TTACTCAGAT TATGTAAATG TGGATGGCGT AAACTATTTA ACTACAAAGT TTATGGGGGC ACAAGGATTA TTTGTAGCTA TAATTGTTGG GTTAGTAGTT GGAGAACTTA TGTCAGTTTT ATCAAAGTCT AAAAGGTTAG AAATTAAGAT GCCAGAACAA GTTCCACCAG CAGTAGCTAG AACATTTAAA TTATTATTAC CAATAGTTAT AATTACAGTT TCATTTTCAA TATTAAATTT CTTTATTAAG AAATTTGCAC CAGGTGGATT ACATGAATTA GTTTATACTG TAATTCAAAC TCCATTAACA CAATTAGGTC AAAATGTAGG ATCAGTATTA ATATTAACTC TTATATCTCA ATCACTTTGG GTTATGGGAA TCCATGGTCC AAATACTATT GCAGCAGTAC GTGATACTAT GTTTGCAGAG GCAACAAATG CAAATCTTTC ATATGCAGCA GCAAATGGTA CTGCATGGGG AGCACCTTAT CCAGTAACAT TTAATGGATT ATATGATGCT TTTGGAGCAT ATGGTGGTTC AGGAGCAACA TTAGGATTAA TAATTGCAAT ATTTATATTT AGTAAAGCAA AAGAACAAAA AAGTATAGCA AAGCTTTCAT TTGCACCAGG ACTATTTAAT ATAAATGAGA TGGTTATATT TGGATTACCT ATAGTATTAA ATCCTATATA TATAATACCA TTTATATTAA CTCCATTAGT AAATATAACA ATTGGGTATT TAGCAACATC AGTTATGAGG ATTATACCAC CAGTAGCATA TGGAGTGCCT TGGACAACAC CAGGACCATT AGCACCATTC TTAGGAACTG GGGGAAATAT TATGGGATTA GTAATAGGAT TAATTTGTTT AGCAGTTAGC GTATTTACTT ATGCACCATT TGTAATAGCT GCAAGTAAAG CTGAATTAAA AAATGAAGAA GAAGCAACTT TAAATGAATT TAATAATGTT TAA
|
Protein sequence | MSAMEKFQSQ IEKVLVPLAS KLNSQRHICA VRDSFILTFP LTMAGSLMVL LNFVLLSPDG FICKLLRLNK IFPNIGECQA IFSPVLKGST DILAILIVFL IARNLAKQLK SDDLLSGLTA VSVYFIIYSD YVNVDGVNYL TTKFMGAQGL FVAIIVGLVV GELMSVLSKS KRLEIKMPEQ VPPAVARTFK LLLPIVIITV SFSILNFFIK KFAPGGLHEL VYTVIQTPLT QLGQNVGSVL ILTLISQSLW VMGIHGPNTI AAVRDTMFAE ATNANLSYAA ANGTAWGAPY PVTFNGLYDA FGAYGGSGAT LGLIIAIFIF SKAKEQKSIA KLSFAPGLFN INEMVIFGLP IVLNPIYIIP FILTPLVNIT IGYLATSVMR IIPPVAYGVP WTTPGPLAPF LGTGGNIMGL VIGLICLAVS VFTYAPFVIA ASKAELKNEE EATLNEFNNV
|
| |