Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0935 |
Symbol | nagE |
ID | 4204639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1068685 |
End bp | 1070127 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642565493 |
Product | PTS system, N-acetylglucosamine-specific IIBC component |
Protein accession | YP_698259 |
Protein GI | 110802564 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG1264] Phosphotransferase system IIB components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0848567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAT ATTTTCAGAA ATTAGGTAAA TCTTTAATGC TACCAGTTGC TGCGCTTCCA GTAGCAGGTA TATTAATGGG AATTGGATAT TGGTTAGACC CATCTGGATG GGGTGCTAAT AGTGTAGTAT CAGCATTCTT ATTAAAATCT GGTGGAGCTA TCATAGATAA CATGGCAATA TTATTTGCTA TTGGGGTAGC TATTGGTATG TCAGATGATA ATGATGGAGC AGCAGGTTTA GCAGGTCTAG TATCATGGCT TATGATAACT ACATTATTAT CTAGTGCAGT TGTTGCAATG ATGCAAGGAG TAGAAGTAGA AGCTGTAAAC CCAGCTTTTG GAAAGATACA AAACCAATTT ATAGGTATCG TAGCAGGTTT AATAGGAGCT GGTTGTTATA ATAGATTTAA AAACACTAAA TTACCAGATT TCTTAGGATT CTTTAGTGGA AGAAGATGTG TTGCTATAGT AACAGCATTT GCTTCAATAA TAGCCTCTTT AGTTTTATAC TTTGTATATC CTGTAATATA TAGCGCATTA GTTTCATTTG GGGAATCAAT AATAGCTATG GGACCAATAG GAGCTGGTGT TTATGGATTC TTCAATAGAT TATTAATACC TCTTGGTTTA CACCATGCTT TAAACTCAGT ATTCTGGTTT GACGTTGCTG GTATAAATGA TATAGGAAAC TTCTGGGCTG GAACAGGTAC TTTAGGACAA ACTGGTATGT ATATGTCAGG CTTCTTCCCA GTAATGATGT TTGGTTTACC TGCTGCTGCT TTTGCGATGT ATCGTTGTGC AAAAACTAAT AAAAAGAAGG CTGCTGCTGG TATATTATTA GCTGCTGCTT TATCATCATT CGTTACAGGA GTAACTGAAC CATTAGAATT TGCATTTATG TTCTTAGCAC CTGCTTTATA TGTAGCTCAT GCTTTATTAA CAGGAATATC TATGGCAGTT GTTGCTGCTT TACCAGTAAG AGCAGGATTT AACTTTAGTG CTGGTTTAGT AGATTGGATA TTAAGTTTTG CATCACCAAT GGCATTAAAC CCATTATACT TATTAGGTAT AGGTTTAGTA GTAGGAGTTA TATACTATGT AGTATTCGTA GTTATGATAA AGAAATTTGA CTTAAAGACT CCAGGTAGAG AAGAAGATGA TGATGATTCA AATGCAGTTT TAGCAAATAA TAACTACACT GAAGTTGCAA AAATAATCTT AGAAGGACTA GGTGGACCTT CTAACATTAC TTCAATTGAT AACTGTATAA CTAGATTAAG ATTACAAGTA AAAGATAATA CATTAGTTGA TGAGAAAAAA ATAAAATCAT CAGGTATTTC AGGAATTATA AGACCTGGAA AAACAAGTGT TCAAGTTATA GTTGGAACAC AAGTACAGTT TGTAGCTGAC GAATTTAAGA AGCTTTGTAA GTCTAAAAAT TAA
|
Protein sequence | MMKYFQKLGK SLMLPVAALP VAGILMGIGY WLDPSGWGAN SVVSAFLLKS GGAIIDNMAI LFAIGVAIGM SDDNDGAAGL AGLVSWLMIT TLLSSAVVAM MQGVEVEAVN PAFGKIQNQF IGIVAGLIGA GCYNRFKNTK LPDFLGFFSG RRCVAIVTAF ASIIASLVLY FVYPVIYSAL VSFGESIIAM GPIGAGVYGF FNRLLIPLGL HHALNSVFWF DVAGINDIGN FWAGTGTLGQ TGMYMSGFFP VMMFGLPAAA FAMYRCAKTN KKKAAAGILL AAALSSFVTG VTEPLEFAFM FLAPALYVAH ALLTGISMAV VAALPVRAGF NFSAGLVDWI LSFASPMALN PLYLLGIGLV VGVIYYVVFV VMIKKFDLKT PGREEDDDDS NAVLANNNYT EVAKIILEGL GGPSNITSID NCITRLRLQV KDNTLVDEKK IKSSGISGII RPGKTSVQVI VGTQVQFVAD EFKKLCKSKN
|
| |