Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0093 |
Symbol | nagE |
ID | 4205299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 115034 |
End bp | 116479 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642564645 |
Product | PTS system, N-acetylglucosamine-specific IIBC component |
Protein accession | YP_697431 |
Protein GI | 110802112 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG1264] Phosphotransferase system IIB components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA AAGGTTCAAA AGTTTTAGGC TTTCTTCAAA GAATAGGTAA ATCTTTAATG GTTCCTATAG CAGTAATGCC AGCTTTAGGA TTATTATTAA GACTTGGAGA TAAAGACTTA TTAAATATCC CTTGGATCAG CGCTGCTGGT GGAGCTGCCT TTGGAGATAA TATGGCAATG CTTTTTGCTG TAGGTATAGG ATTTGGACTT TCAGATGAAA ATAATGGAGT TGGAGGATTA GCAGGTCTTT TAGGATTTTT AGTTATGAAA AATGTTGCAA CTTCATTTGA TCCAAGTATA AATATGGGAG CTTTTGGTGG AGTTGTAGCC GGAGTTGTTG GAGGACTTTT ATATAATAAA TTTAAAGATA TTAAAGTTCC TCAATTTTTA GGATTCTTTG GTGGAAAAAG ATTTGTTCCA ATAATAACAT CAGCGGTATG TCTTATTTTA GGTGTATTCT TTGGATATAC TTGGCCAACA TTCCAAGCTG GATTAGACGG ATTTGCTAAT ATAATGGTAG CAGTAGGTGC TATTGGTGCT GGAATATATG GAATATTAAA TAGATTATTA ATACCAATTG GATTACACCA TGTAATGAAC ACAGTAATTT GGTTCCAATT AGGAAGTTTT ACAGATCCAG TTTCAGGTCA AATTGCAACT GGAGATATTG CAAGATTTTT AGCTGGAGAT CCAACAGCTG GGGTTTACAC AGCAGGCTTT TATCCAATAA TGATGTTTGG TTTACCAGCT GCATGTTTAG CAATGTATGT TTGTGCTAAG AAGAAAAACA AGGCAGTGGT TGGTGGAATG TTCTTATCAT TAGCATTAAC AGCTATAATA ACAGGTATTA CAGAACCAAT AGAATTTGCT TTTATGTTCT TATCACCAAT ACTTTATGTT ATACATGCAA TTTTAACAGG TATATCCTTA GCAGTGGCAT ATGCTCTTAA TGTACATCTA GCATTTAGTT TCTCAGGTGG ATTAATTGAC TATATTTTAT ATTTTGGAAA AGGCCAAAAT CAATTAATCA TATTATTAAT GGGGCTTGTT GCATTTGTAG TTTATTATTT CTTATTTATG TTCTTTATTA AGAAGTTTAA TCTTAAAACA CCTGGTAGAG AAGATGATTT TGATGATGAA AATGAAGACG TAGAAAATAA TTCAAAGACA GTACCAAAAT TAGAAGACAA TCCATCTAAG GGTGGTACTT TAGCTGAAAA GGCAGAAGTT GTTTTAGTAG CTCTTGGAGG AAAAGAAAAT ATTGAAGTTC TAGACAATTG TATAACAAGA TTAAGATTAA CTTTAAAAGA TGCTTCTAAA ATAGATGAAG TTACTTTAAA AAAGGCTGGA GCTAGTGGAA TAATGAAATT AGATGGAAAG AATGTTCAAG TAATTATGGG AACTTTAGCA GATCCTTTAG CTAGCCAAAT GAAAAAATTA CTTTAA
|
Protein sequence | MSKKGSKVLG FLQRIGKSLM VPIAVMPALG LLLRLGDKDL LNIPWISAAG GAAFGDNMAM LFAVGIGFGL SDENNGVGGL AGLLGFLVMK NVATSFDPSI NMGAFGGVVA GVVGGLLYNK FKDIKVPQFL GFFGGKRFVP IITSAVCLIL GVFFGYTWPT FQAGLDGFAN IMVAVGAIGA GIYGILNRLL IPIGLHHVMN TVIWFQLGSF TDPVSGQIAT GDIARFLAGD PTAGVYTAGF YPIMMFGLPA ACLAMYVCAK KKNKAVVGGM FLSLALTAII TGITEPIEFA FMFLSPILYV IHAILTGISL AVAYALNVHL AFSFSGGLID YILYFGKGQN QLIILLMGLV AFVVYYFLFM FFIKKFNLKT PGREDDFDDE NEDVENNSKT VPKLEDNPSK GGTLAEKAEV VLVALGGKEN IEVLDNCITR LRLTLKDASK IDEVTLKKAG ASGIMKLDGK NVQVIMGTLA DPLASQMKKL L
|
| |