Gene CPR_2124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2124 
SymbolptsG 
ID4205802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2353502 
End bp2355031 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content34% 
IMG OID642566674 
ProductPTS system, glucose-specific IIBC component 
Protein accessionYP_699433 
Protein GI110802584 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR02002] PTS system, glucose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAG CTTTTGGAGT TTTACAAAGA ATAGGTAAAG CTTTGATGTT ACCAGTTGCT 
TTATTACCAG CGGCAGGTTT ATTACTTGGA TTTGGATCAA TGTTTCAAGA TCCTCAGTTT
TTAAGCTTAG TGCCAGCATT AAATGCAGGG TGGTTTCAAT TAATAGCTAC AATAATGTCT
GATGCAGGTA ATATAATATT TTCAAACTTA GCATTATTAT TCGCAGTTGG GGTAGCTGTA
GGATTATCAG AAGGAGACGG TGTTGCAGGT CTTGCAGCAA TAGTTGGTTT TTTAATACTT
AATGTAACAA TGGGAATTGG TGGAGGAATA ACTGCCGACG TGGTTGCTAG TACTAAAAAT
GGTATGTATG CCACAGTTCT TGGAATACCA ACACTTCAAA CTGGAGTATT TGGTGGAATT
ATTATGGGAG TTATAGCCGC AGCTTTATAT AAAAGGTTCT ATAAGATAGA ATTACCTTCA
TACTTAGGAT TCTTTGCAGG TAAAAGATTT GTCCCAATAA TAACTGCTAT TTCTGCCTTA
GTGCTTGGAG GAATAATGGT ATTTGTATGG CCGCCAATAC AAGGTGCCTT ATTGTACTTC
TCACAAAATA TGATAAGTGC TAATCAAACC CTATCAGCTT TATTATTTGG AATAATAGAA
AGAGCATTAA TTCCATTTGG ATTACATCAT ATTTGGTATA ATCCATTCTG GTTCCAATTT
GGAGAATACA CAAATAAAGC AGGTCAATTA ATAATGGGAG ATAATCAAAT ATTTTTTGCC
CAATTAAGAG ATGGAGGTCC ATTTACAGCA GGTACCTTTA TGACTGGTAA GTTCCCGTTT
ATGATGTTTG GATTACCAGC AGCAGCCTTA GCAATGTATC ATGAAGCTAA ACCAGGTAAA
AAGAAAATAG CTTCAGGTAT TTTATTCTCA GCTGCCTTAA CTTCCTTCTT AACAGGAATT
ACTGAGCCTT TAGAGTTTGC ATTCTTATTT GTTGCCCCAG TATTATTTGT AATCCACTGC
GTATTAGCAG GATTATCCTT TATGATAATG CAATTATTAA ATGTTAAAAT AGGTATGACT
TTCTCAGGTG GACTTTTAGA CTTTATACTA TTAGGAGTAA TTCCAAATAG AACTAGATGG
TGGTTAGTAA TACTTGTAGG ATTAGGCTTT GGTGTAATTT ACTATTTCTT ATTTAGATTC
TTTATTAGAA AATTTGATCT TAAAACTCCA GGTAGAGAAG ATGATGACAT ATTTGATGAT
ATGGATAACC AAACTGTAAA TGATGACTTA GCTGCTGAAA TTTTAATAGC TTTAGGTGGT
GCAGGCAATA TAAATAAATT AGATGCTTGC ATAACTAGAT TAAGAGTTAC AGTTAACGAT
TCAAGCAAAG TTAATAAAGA AAGATTAAAA GAGCTTGGAG CAGCTGGTGT AATGCAAGTT
GGAGAAAATA TTCAAGCTAT ATTTGGAGGA AAATCAGATA TACTTAAAAC TCAAATTAGA
GAAATTATGA ATGGCATGGG AGAAAAGTAG
 
Protein sequence
MKKAFGVLQR IGKALMLPVA LLPAAGLLLG FGSMFQDPQF LSLVPALNAG WFQLIATIMS 
DAGNIIFSNL ALLFAVGVAV GLSEGDGVAG LAAIVGFLIL NVTMGIGGGI TADVVASTKN
GMYATVLGIP TLQTGVFGGI IMGVIAAALY KRFYKIELPS YLGFFAGKRF VPIITAISAL
VLGGIMVFVW PPIQGALLYF SQNMISANQT LSALLFGIIE RALIPFGLHH IWYNPFWFQF
GEYTNKAGQL IMGDNQIFFA QLRDGGPFTA GTFMTGKFPF MMFGLPAAAL AMYHEAKPGK
KKIASGILFS AALTSFLTGI TEPLEFAFLF VAPVLFVIHC VLAGLSFMIM QLLNVKIGMT
FSGGLLDFIL LGVIPNRTRW WLVILVGLGF GVIYYFLFRF FIRKFDLKTP GREDDDIFDD
MDNQTVNDDL AAEILIALGG AGNINKLDAC ITRLRVTVND SSKVNKERLK ELGAAGVMQV
GENIQAIFGG KSDILKTQIR EIMNGMGEK