Gene CPF_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2666 
SymbolptsI 
ID4202338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2938462 
End bp2940081 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content34% 
IMG OID638083532 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_697046 
Protein GI110799494 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG GTATTGCTGC TTCAAAAGGA TATGCAATAG GAACTGTATT TATACAAGAA 
CATGAGGAAA TAATAATATC TGATGCGAAG GTTTCAGATA TAGCAGCTGA AAAAGAAAAA
TTATCTAAAG CTTTAGCTCA ATCAAAAGAG CAATTAGAAG CAATAAAAGA AAAAACAGCT
AATGAAATTG GAGAACATGA AGCTCAAGTT TTTGAAGCTC ATTTAATGTT ATTAGATGAT
GTAGAGTTTA CTGGTCAAAT GGAAATGACT ATAGAAAATG ACCAATTAAA CGCTATGAAA
GCTGTTCAAA ATGTTACAGA TACTTTCGTT ATGATATTTG ATTCTATGGA TGACCCTTAC
ATGAGAGAAA GGGCAGCAGA TATAAAAGAC GTTTCTAAGA GAATAATAGC TAACCTAGCT
GGCAAGGGTG GAAACGGAAT GGAAAATGTA GGAGCTAACA CTGTTGTTGT AGCTCATGAC
TTAACACCTT CAGATACTGC TCAATTAGAT AGAAGTAAAG TTATTGGTTT CTTAACTAAT
ATCGGGGGAA GAACTTCTCA CTCAGCTATA ATGGCTAGAA CTTTAGAAAT ACCAGCTGTT
GTTGGATTAG GTAATATAAC AACTTCAGTT AAAAATGGGG ACACTGTAAT AGTTGATGGT
ATTGAGGGTG TAGCTATAAT CAACCCAGAT GAAGCTACTA TAAACGAATA TAAAGCTAGA
TTAGAAAAAT TTAAAGCAGA ACAAGAAGAA TTAAAGAAGT TAATAGATGT TAAAACAACT
ACTAAATCAG GTAGAAGAAT AGAGGTTTGC GGAAACATAG GTAAACCAGA AGATATAGAT
CAAGTTTTAG CAAATGGTGG AGACGGAGTT GGACTATTTA GAACTGAGTT CTTATACATG
GACAGAGATG AAGCTCCAAC TGAAGATGAA CAATTTGAAG CATACAAATA TGTTTTAGAA
AAAGCAGATG GTAAGCAAGT TGTTATCAGA ACATTAGATA TCGGTGGAGA TAAAACTCTT
CCATACTTAC CATTACCAGA AGAGATGAAT CCATTCTTAG GATACAGAGC TATAAGATTA
TGCTTAGACA GAAAAGATAT CTTTAGAGTT CAAATAAGAG CTTTATTAAG AGCTTCTGTT
TATGGAAATC TTGCAGTAAT GTTCCCAATG ATTTCAGGAT TAGAAGAATT CCAACAAGCT
AAAGCATTTG TTGAAGAATG CAAAGCTGAG TTAAAAGCAG AAGGTATAGC ATACTCAGAT
TCAATTCAAT GGGGTATCAT GGTTGAAATC CCAGCTGCAG CAGTTTATGC TGATGAATTA
GCTAAGCATG TTGATTTCTT CTCAATAGGA ACTAACGATT TAATACAATA TACATTAGCT
GCTGACAGAA TGAGTGAAAA GGTATCATAC CTTTACAATC CAATGCATCC AGCTGTATTA
AGATTAATCA AAATGACAAT AGATGGAGCT CACAAACATG GTAAGTGGGT AGGAATGTGT
GGAGAGATGG CAGGAGATGA AAGAGCTATA CCAACATTAG TTGAATATGG TTTAGATGAA
TTCTCAATGA GTGCTACATC AATCCTAACT GCTAAGAAAA TAATAATGGA ACAAGAATAG
 
Protein sequence
MKKGIAASKG YAIGTVFIQE HEEIIISDAK VSDIAAEKEK LSKALAQSKE QLEAIKEKTA 
NEIGEHEAQV FEAHLMLLDD VEFTGQMEMT IENDQLNAMK AVQNVTDTFV MIFDSMDDPY
MRERAADIKD VSKRIIANLA GKGGNGMENV GANTVVVAHD LTPSDTAQLD RSKVIGFLTN
IGGRTSHSAI MARTLEIPAV VGLGNITTSV KNGDTVIVDG IEGVAIINPD EATINEYKAR
LEKFKAEQEE LKKLIDVKTT TKSGRRIEVC GNIGKPEDID QVLANGGDGV GLFRTEFLYM
DRDEAPTEDE QFEAYKYVLE KADGKQVVIR TLDIGGDKTL PYLPLPEEMN PFLGYRAIRL
CLDRKDIFRV QIRALLRASV YGNLAVMFPM ISGLEEFQQA KAFVEECKAE LKAEGIAYSD
SIQWGIMVEI PAAAVYADEL AKHVDFFSIG TNDLIQYTLA ADRMSEKVSY LYNPMHPAVL
RLIKMTIDGA HKHGKWVGMC GEMAGDERAI PTLVEYGLDE FSMSATSILT AKKIIMEQE