Gene CPR_0935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0935 
SymbolnagE 
ID4204639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1068685 
End bp1070127 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content34% 
IMG OID642565493 
ProductPTS system, N-acetylglucosamine-specific IIBC component 
Protein accessionYP_698259 
Protein GI110802564 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0848567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAT ATTTTCAGAA ATTAGGTAAA TCTTTAATGC TACCAGTTGC TGCGCTTCCA 
GTAGCAGGTA TATTAATGGG AATTGGATAT TGGTTAGACC CATCTGGATG GGGTGCTAAT
AGTGTAGTAT CAGCATTCTT ATTAAAATCT GGTGGAGCTA TCATAGATAA CATGGCAATA
TTATTTGCTA TTGGGGTAGC TATTGGTATG TCAGATGATA ATGATGGAGC AGCAGGTTTA
GCAGGTCTAG TATCATGGCT TATGATAACT ACATTATTAT CTAGTGCAGT TGTTGCAATG
ATGCAAGGAG TAGAAGTAGA AGCTGTAAAC CCAGCTTTTG GAAAGATACA AAACCAATTT
ATAGGTATCG TAGCAGGTTT AATAGGAGCT GGTTGTTATA ATAGATTTAA AAACACTAAA
TTACCAGATT TCTTAGGATT CTTTAGTGGA AGAAGATGTG TTGCTATAGT AACAGCATTT
GCTTCAATAA TAGCCTCTTT AGTTTTATAC TTTGTATATC CTGTAATATA TAGCGCATTA
GTTTCATTTG GGGAATCAAT AATAGCTATG GGACCAATAG GAGCTGGTGT TTATGGATTC
TTCAATAGAT TATTAATACC TCTTGGTTTA CACCATGCTT TAAACTCAGT ATTCTGGTTT
GACGTTGCTG GTATAAATGA TATAGGAAAC TTCTGGGCTG GAACAGGTAC TTTAGGACAA
ACTGGTATGT ATATGTCAGG CTTCTTCCCA GTAATGATGT TTGGTTTACC TGCTGCTGCT
TTTGCGATGT ATCGTTGTGC AAAAACTAAT AAAAAGAAGG CTGCTGCTGG TATATTATTA
GCTGCTGCTT TATCATCATT CGTTACAGGA GTAACTGAAC CATTAGAATT TGCATTTATG
TTCTTAGCAC CTGCTTTATA TGTAGCTCAT GCTTTATTAA CAGGAATATC TATGGCAGTT
GTTGCTGCTT TACCAGTAAG AGCAGGATTT AACTTTAGTG CTGGTTTAGT AGATTGGATA
TTAAGTTTTG CATCACCAAT GGCATTAAAC CCATTATACT TATTAGGTAT AGGTTTAGTA
GTAGGAGTTA TATACTATGT AGTATTCGTA GTTATGATAA AGAAATTTGA CTTAAAGACT
CCAGGTAGAG AAGAAGATGA TGATGATTCA AATGCAGTTT TAGCAAATAA TAACTACACT
GAAGTTGCAA AAATAATCTT AGAAGGACTA GGTGGACCTT CTAACATTAC TTCAATTGAT
AACTGTATAA CTAGATTAAG ATTACAAGTA AAAGATAATA CATTAGTTGA TGAGAAAAAA
ATAAAATCAT CAGGTATTTC AGGAATTATA AGACCTGGAA AAACAAGTGT TCAAGTTATA
GTTGGAACAC AAGTACAGTT TGTAGCTGAC GAATTTAAGA AGCTTTGTAA GTCTAAAAAT
TAA
 
Protein sequence
MMKYFQKLGK SLMLPVAALP VAGILMGIGY WLDPSGWGAN SVVSAFLLKS GGAIIDNMAI 
LFAIGVAIGM SDDNDGAAGL AGLVSWLMIT TLLSSAVVAM MQGVEVEAVN PAFGKIQNQF
IGIVAGLIGA GCYNRFKNTK LPDFLGFFSG RRCVAIVTAF ASIIASLVLY FVYPVIYSAL
VSFGESIIAM GPIGAGVYGF FNRLLIPLGL HHALNSVFWF DVAGINDIGN FWAGTGTLGQ
TGMYMSGFFP VMMFGLPAAA FAMYRCAKTN KKKAAAGILL AAALSSFVTG VTEPLEFAFM
FLAPALYVAH ALLTGISMAV VAALPVRAGF NFSAGLVDWI LSFASPMALN PLYLLGIGLV
VGVIYYVVFV VMIKKFDLKT PGREEDDDDS NAVLANNNYT EVAKIILEGL GGPSNITSID
NCITRLRLQV KDNTLVDEKK IKSSGISGII RPGKTSVQVI VGTQVQFVAD EFKKLCKSKN