Gene CPR_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0804 
Symbol 
ID4205472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp929751 
End bp930731 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content36% 
IMG OID642565363 
ProductPTS system mannose/fructose/sorbose family IIAB subunit 
Protein accessionYP_698129 
Protein GI110802187 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2893] Phosphotransferase system, mannose/fructose-specific component IIA
[COG3444] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB 
TIGRFAM ID[TIGR00824] PTS system, mannose/fructose/sorbose family, IIA component
[TIGR00854] PTS system, mannose/fructose/sorbose family, IIB component 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGGAA TTATTCTTGC TAGTCACGGA GAATTTGCTA AAGGAATATT ACAATCTGGC 
TCAATGATTT TTGGAGAACA GGAAAATGTA AAAGCTGTTA CATTGATGCC TAGCGAAGGA
CCAGATGATA TTAAAGCAAA GATGAAAGAA GCAATTGCTT CCTTTGATAA TCAAGATGAG
GTTTTATTCT TAGTTGACCT ATGGGGTGGA ACACCATTTA ATCAAGCAAA CAGCCTAGTA
GAAGAACATG CAGATAAATG GGCAATCGTA GCTGGAATGA ATTTACCGAT GGTTATTGAA
GCTTATGCTT CACGTTTTTC AATGGAATCA GCACAAGAAA TTGCTGTTAA TATTCTAAAG
TCAGCTAGAG ATGGAGTTAA AGTTAAGCCA GAATCATTAG AACCAGAAGA GGATACTAAA
ACAAATACAG ATTCTGCACA ACAATCTAAT AATGTAGGTG CACCTGGCTC TTTTGAATAC
GTTTTGGCAC GTATTGATTC ACGTTTACTT CATGGTCAAG TAGCTACTGC TTGGACAAAA
ACTGTAAAAC CAACAAGAAT TATTGTTGTA TCAGACGATG TAGCTAAAGA TGAACTTCGT
AAGAAATTGA TTCAACAAGC AGCTCCTCCT GGAGTTAAAG CACATACTGT TCCAGTTAGC
CAAATGATTA AGCTTGCAAA AGATGACCAA CACTTTGGAG GACAACGTGC GTTACTTCTT
TTTGAAAATC CAGAAGATGT ACTAAGAGCA GTAGAGGGAG GAGTACCTAT AAAGACAGTT
AATGTTGGTT CTATGGCTCA CTCTCCTGGA AAGGTTCAAC CAAATAAAGT ACTTGCTTTC
GATCAAGAAG ATATTGATAC TTTTAAGAAG CTTAAAGAAG CTGGATTGGA TTTCGATGTT
CGTAAAGTTC CAAATGATAC AAAAGGAAAT ATGGACGAAA TTCTTAAAAA GGCACAAGAG
GAATTAAATA AATTAAAGTA A
 
Protein sequence
MVGIILASHG EFAKGILQSG SMIFGEQENV KAVTLMPSEG PDDIKAKMKE AIASFDNQDE 
VLFLVDLWGG TPFNQANSLV EEHADKWAIV AGMNLPMVIE AYASRFSMES AQEIAVNILK
SARDGVKVKP ESLEPEEDTK TNTDSAQQSN NVGAPGSFEY VLARIDSRLL HGQVATAWTK
TVKPTRIIVV SDDVAKDELR KKLIQQAAPP GVKAHTVPVS QMIKLAKDDQ HFGGQRALLL
FENPEDVLRA VEGGVPIKTV NVGSMAHSPG KVQPNKVLAF DQEDIDTFKK LKEAGLDFDV
RKVPNDTKGN MDEILKKAQE ELNKLK