Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0071 |
Symbol | nagE |
ID | 4201497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 79744 |
End bp | 81189 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638080949 |
Product | PTS system, N-acetylglucosamine-specific IIBC component |
Protein accession | YP_694536 |
Protein GI | 110799623 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG1264] Phosphotransferase system IIB components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA AAGGTTCAAA AGTTTTAGGC TTTCTTCAAA GAATAGGTAA ATCTTTAATG GTTCCTATAG CAGTAATGCC AGCTTTAGGA TTATTATTAA GACTTGGAGA TAAAGACTTA TTAAATATCC CTTGGATCAG CGCTGCTGGT GGAGCTGCCT TTGGAGATAA TATGGCAATG CTTTTTGCTG TAGGTATAGG ATTTGGACTT TCAGATGAAA ATAATGGAGT TGGAGGATTA GCAGGTCTTT TAGGATTTTT AGTTATGAAA AATGTTGCAA CTTCATTTGA TCCAAGTATA AATATGGGAG CTTTTGGTGG AGTTGTAGCC GGAGTTGTTG GAGGACTTTT ATATAATAAA TTTAAAGATA TTAAAGTTCC TCAATTTTTA GGATTCTTTG GTGGAAAAAG ATTTGTTCCA ATAATAACAT CAGCGGTATG TCTTATTTTA GGGGTATTCT TTGGATATAC TTGGCCAACA TTCCAAGCTG GATTAGACGG ATTTGCTAAT ATAATGGTAG CGGCAGGTGC TATTGGGGCT GGAATATATG GAATATTAAA TAGATTATTA ATACCAATTG GATTACACCA TGTAATGAAC ACAGTAATTT GGTTCCAATT AGGAAGCTTT ACAGATCCAG TTTCAGGTCA AATTGCAACT GGAGATATTG CAAGATTTTT AGCTGGAGAT CCAACAGCTG GGGTTTACAC AGCAGGCTTT TATCCAATAA TGATGTTTGG TTTACCAGCT GCATGTTTAG CAATGTATGT TTGTGCTAAG AAGAAAAACA AGGCAGTGGT TGGGGGAATG TTCTTATCAT TAGCATTAAC AGCTATAATA ACAGGTATTA CAGAACCAAT AGAATTTGCT TTTATGTTCT TATCACCAAT ACTTTATGTT ATACATGCAA TATTAACAGG TATATCATTA GCAGTGGCAT ATGCTCTTAA TGTACATCTA GCATTTAGTT TCTCAGGTGG ATTAATTGAC TATATTTTAT ACTTTGGAAA AGGTCAAAAT CAATTAATAA TATTATTAAT GGGACTTGTT GCATTTGTAG TTTATTATTT CTTATTTATG TTCTTTATTA AGAAGTTTAA TCTTAAAACA CCTGGTAGAG AAGATGATTT TGATGATGAA AATGAAGATA CAGAAACTAA TTCAAAAACA GTATCAAAAT CAGAAGACAA TCCATCTAAG GGTGGTACTT TAGCTGAAAA GGCAGAAGTT GTTTTAGAAG CTCTTGGAGG AAAAGAAAAT ATAGAAGTTC TAGACAACTG TATAACAAGA TTAAGATTAA CTTTAAAAGA TGCTTCTAAA ATAGATGAAG TTACTTTAAA AAAGGCTGGA GCTAGCGGAA TAATGAAATT AGATGGAAAG AATGTTCAAG TAATTATGGG AACTTTAGCA GATCCTTTAG CTAGCCAAAT GAAAAAATTA CTTTAA
|
Protein sequence | MSKKGSKVLG FLQRIGKSLM VPIAVMPALG LLLRLGDKDL LNIPWISAAG GAAFGDNMAM LFAVGIGFGL SDENNGVGGL AGLLGFLVMK NVATSFDPSI NMGAFGGVVA GVVGGLLYNK FKDIKVPQFL GFFGGKRFVP IITSAVCLIL GVFFGYTWPT FQAGLDGFAN IMVAAGAIGA GIYGILNRLL IPIGLHHVMN TVIWFQLGSF TDPVSGQIAT GDIARFLAGD PTAGVYTAGF YPIMMFGLPA ACLAMYVCAK KKNKAVVGGM FLSLALTAII TGITEPIEFA FMFLSPILYV IHAILTGISL AVAYALNVHL AFSFSGGLID YILYFGKGQN QLIILLMGLV AFVVYYFLFM FFIKKFNLKT PGREDDFDDE NEDTETNSKT VSKSEDNPSK GGTLAEKAEV VLEALGGKEN IEVLDNCITR LRLTLKDASK IDEVTLKKAG ASGIMKLDGK NVQVIMGTLA DPLASQMKKL L
|
| |