Gene CPF_0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0071 
SymbolnagE 
ID4201497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp79744 
End bp81189 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content33% 
IMG OID638080949 
ProductPTS system, N-acetylglucosamine-specific IIBC component 
Protein accessionYP_694536 
Protein GI110799623 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AAGGTTCAAA AGTTTTAGGC TTTCTTCAAA GAATAGGTAA ATCTTTAATG 
GTTCCTATAG CAGTAATGCC AGCTTTAGGA TTATTATTAA GACTTGGAGA TAAAGACTTA
TTAAATATCC CTTGGATCAG CGCTGCTGGT GGAGCTGCCT TTGGAGATAA TATGGCAATG
CTTTTTGCTG TAGGTATAGG ATTTGGACTT TCAGATGAAA ATAATGGAGT TGGAGGATTA
GCAGGTCTTT TAGGATTTTT AGTTATGAAA AATGTTGCAA CTTCATTTGA TCCAAGTATA
AATATGGGAG CTTTTGGTGG AGTTGTAGCC GGAGTTGTTG GAGGACTTTT ATATAATAAA
TTTAAAGATA TTAAAGTTCC TCAATTTTTA GGATTCTTTG GTGGAAAAAG ATTTGTTCCA
ATAATAACAT CAGCGGTATG TCTTATTTTA GGGGTATTCT TTGGATATAC TTGGCCAACA
TTCCAAGCTG GATTAGACGG ATTTGCTAAT ATAATGGTAG CGGCAGGTGC TATTGGGGCT
GGAATATATG GAATATTAAA TAGATTATTA ATACCAATTG GATTACACCA TGTAATGAAC
ACAGTAATTT GGTTCCAATT AGGAAGCTTT ACAGATCCAG TTTCAGGTCA AATTGCAACT
GGAGATATTG CAAGATTTTT AGCTGGAGAT CCAACAGCTG GGGTTTACAC AGCAGGCTTT
TATCCAATAA TGATGTTTGG TTTACCAGCT GCATGTTTAG CAATGTATGT TTGTGCTAAG
AAGAAAAACA AGGCAGTGGT TGGGGGAATG TTCTTATCAT TAGCATTAAC AGCTATAATA
ACAGGTATTA CAGAACCAAT AGAATTTGCT TTTATGTTCT TATCACCAAT ACTTTATGTT
ATACATGCAA TATTAACAGG TATATCATTA GCAGTGGCAT ATGCTCTTAA TGTACATCTA
GCATTTAGTT TCTCAGGTGG ATTAATTGAC TATATTTTAT ACTTTGGAAA AGGTCAAAAT
CAATTAATAA TATTATTAAT GGGACTTGTT GCATTTGTAG TTTATTATTT CTTATTTATG
TTCTTTATTA AGAAGTTTAA TCTTAAAACA CCTGGTAGAG AAGATGATTT TGATGATGAA
AATGAAGATA CAGAAACTAA TTCAAAAACA GTATCAAAAT CAGAAGACAA TCCATCTAAG
GGTGGTACTT TAGCTGAAAA GGCAGAAGTT GTTTTAGAAG CTCTTGGAGG AAAAGAAAAT
ATAGAAGTTC TAGACAACTG TATAACAAGA TTAAGATTAA CTTTAAAAGA TGCTTCTAAA
ATAGATGAAG TTACTTTAAA AAAGGCTGGA GCTAGCGGAA TAATGAAATT AGATGGAAAG
AATGTTCAAG TAATTATGGG AACTTTAGCA GATCCTTTAG CTAGCCAAAT GAAAAAATTA
CTTTAA
 
Protein sequence
MSKKGSKVLG FLQRIGKSLM VPIAVMPALG LLLRLGDKDL LNIPWISAAG GAAFGDNMAM 
LFAVGIGFGL SDENNGVGGL AGLLGFLVMK NVATSFDPSI NMGAFGGVVA GVVGGLLYNK
FKDIKVPQFL GFFGGKRFVP IITSAVCLIL GVFFGYTWPT FQAGLDGFAN IMVAAGAIGA
GIYGILNRLL IPIGLHHVMN TVIWFQLGSF TDPVSGQIAT GDIARFLAGD PTAGVYTAGF
YPIMMFGLPA ACLAMYVCAK KKNKAVVGGM FLSLALTAII TGITEPIEFA FMFLSPILYV
IHAILTGISL AVAYALNVHL AFSFSGGLID YILYFGKGQN QLIILLMGLV AFVVYYFLFM
FFIKKFNLKT PGREDDFDDE NEDTETNSKT VSKSEDNPSK GGTLAEKAEV VLEALGGKEN
IEVLDNCITR LRLTLKDASK IDEVTLKKAG ASGIMKLDGK NVQVIMGTLA DPLASQMKKL
L