Gene CPR_0093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0093 
SymbolnagE 
ID4205299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp115034 
End bp116479 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content32% 
IMG OID642564645 
ProductPTS system, N-acetylglucosamine-specific IIBC component 
Protein accessionYP_697431 
Protein GI110802112 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AAGGTTCAAA AGTTTTAGGC TTTCTTCAAA GAATAGGTAA ATCTTTAATG 
GTTCCTATAG CAGTAATGCC AGCTTTAGGA TTATTATTAA GACTTGGAGA TAAAGACTTA
TTAAATATCC CTTGGATCAG CGCTGCTGGT GGAGCTGCCT TTGGAGATAA TATGGCAATG
CTTTTTGCTG TAGGTATAGG ATTTGGACTT TCAGATGAAA ATAATGGAGT TGGAGGATTA
GCAGGTCTTT TAGGATTTTT AGTTATGAAA AATGTTGCAA CTTCATTTGA TCCAAGTATA
AATATGGGAG CTTTTGGTGG AGTTGTAGCC GGAGTTGTTG GAGGACTTTT ATATAATAAA
TTTAAAGATA TTAAAGTTCC TCAATTTTTA GGATTCTTTG GTGGAAAAAG ATTTGTTCCA
ATAATAACAT CAGCGGTATG TCTTATTTTA GGTGTATTCT TTGGATATAC TTGGCCAACA
TTCCAAGCTG GATTAGACGG ATTTGCTAAT ATAATGGTAG CAGTAGGTGC TATTGGTGCT
GGAATATATG GAATATTAAA TAGATTATTA ATACCAATTG GATTACACCA TGTAATGAAC
ACAGTAATTT GGTTCCAATT AGGAAGTTTT ACAGATCCAG TTTCAGGTCA AATTGCAACT
GGAGATATTG CAAGATTTTT AGCTGGAGAT CCAACAGCTG GGGTTTACAC AGCAGGCTTT
TATCCAATAA TGATGTTTGG TTTACCAGCT GCATGTTTAG CAATGTATGT TTGTGCTAAG
AAGAAAAACA AGGCAGTGGT TGGTGGAATG TTCTTATCAT TAGCATTAAC AGCTATAATA
ACAGGTATTA CAGAACCAAT AGAATTTGCT TTTATGTTCT TATCACCAAT ACTTTATGTT
ATACATGCAA TTTTAACAGG TATATCCTTA GCAGTGGCAT ATGCTCTTAA TGTACATCTA
GCATTTAGTT TCTCAGGTGG ATTAATTGAC TATATTTTAT ATTTTGGAAA AGGCCAAAAT
CAATTAATCA TATTATTAAT GGGGCTTGTT GCATTTGTAG TTTATTATTT CTTATTTATG
TTCTTTATTA AGAAGTTTAA TCTTAAAACA CCTGGTAGAG AAGATGATTT TGATGATGAA
AATGAAGACG TAGAAAATAA TTCAAAGACA GTACCAAAAT TAGAAGACAA TCCATCTAAG
GGTGGTACTT TAGCTGAAAA GGCAGAAGTT GTTTTAGTAG CTCTTGGAGG AAAAGAAAAT
ATTGAAGTTC TAGACAATTG TATAACAAGA TTAAGATTAA CTTTAAAAGA TGCTTCTAAA
ATAGATGAAG TTACTTTAAA AAAGGCTGGA GCTAGTGGAA TAATGAAATT AGATGGAAAG
AATGTTCAAG TAATTATGGG AACTTTAGCA GATCCTTTAG CTAGCCAAAT GAAAAAATTA
CTTTAA
 
Protein sequence
MSKKGSKVLG FLQRIGKSLM VPIAVMPALG LLLRLGDKDL LNIPWISAAG GAAFGDNMAM 
LFAVGIGFGL SDENNGVGGL AGLLGFLVMK NVATSFDPSI NMGAFGGVVA GVVGGLLYNK
FKDIKVPQFL GFFGGKRFVP IITSAVCLIL GVFFGYTWPT FQAGLDGFAN IMVAVGAIGA
GIYGILNRLL IPIGLHHVMN TVIWFQLGSF TDPVSGQIAT GDIARFLAGD PTAGVYTAGF
YPIMMFGLPA ACLAMYVCAK KKNKAVVGGM FLSLALTAII TGITEPIEFA FMFLSPILYV
IHAILTGISL AVAYALNVHL AFSFSGGLID YILYFGKGQN QLIILLMGLV AFVVYYFLFM
FFIKKFNLKT PGREDDFDDE NEDVENNSKT VPKLEDNPSK GGTLAEKAEV VLVALGGKEN
IEVLDNCITR LRLTLKDASK IDEVTLKKAG ASGIMKLDGK NVQVIMGTLA DPLASQMKKL
L