Gene CPR_1513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1513 
Symbol 
ID4204128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1693112 
End bp1694464 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content31% 
IMG OID642566066 
ProductPTS system, sucrose-specific IIBC component 
Protein accessionYP_698831 
Protein GI110801553 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01996] PTS system, sucrose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000189676 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAG AACAAATAGT TGCTCAAGAA ATATTAAAAA ATATTGGTGG AAAAGAAAAT 
ATAAAATCAA TGGAACACTG TGCAACTAGG TTAAGACTTA TAGTTAAAGA TAAAAATCTA
ATAAATGAAA AGGCAATTGA GAATATTGAT GGAGTAAGAG GACAATTTTT TGCTGCTGCT
CAATATCAAA TAATTTTAGG AACTGGGTTT GTAAATAAAG TCTTTGCAGC TATGAATGGT
GAAGGAGTAG AAACTGGGAA TGTAAAAGAA GATGCATATA GTGATATGAC TTTACCACAA
AAAATATCTC GTACTTTAGG AGATATTTTT GTTCCTATAA TTCCAGTATT AGTTGCAACA
GGTTTATTTA TGGGATTAAG AGGACTTTTA ACTAATTTAG GAGTTGAATT TAGTCCAACT
TTTAATACTT TATCAGAGGT TTTAACAGAT ACAGCATTTA TATTCTTACC AGCCTTAGTA
GCTTGGTCAA CAATGAAAAA ATTTGGTGGA ACACCTGTGG TTGGTATAGT ATTAGGGCTT
ATGCTTGTAG CCCCTCAGCT TCCAAATGCT TGGCAGGTAG CAGGAGGAGC AAACCCAATA
TATATTTCTT TATTAGGAAT AAGTATCCCT ATAGTAGGTT ACCAAGGATC AGTACTTCCA
GCTTTAGTAT TAGGTATTAT AGCAGCTAAA TTAGAAAAAT TTATTAGAAA ATTTATGCCA
GATGTACTAG ATTTAATATT TACTCCATTT TTAACTTTAT TAGTATCAAT GATTCTTGGA
CTTTTAGTAG TAGGTCCTAT AATGCATACA AGTGAAGTAT ATATATTAGA TTTATTTAAA
ATGTTTTTAA ACTTACCATT TGGAATAGGT GGAGCAATAA TTGGAGGAGT TCATCAAGTT
ATAGTTGTAA CTGGAGTTCA TCATATATTT AATGCTTTAG AAGTTGAATT AATTTCAAGT
ACAGGATTAA ATCCATTTAA TGCAGTAATA ACTGGGGCTA TAGTAGCTCA AGGAGCAGCA
GCTCTTGCAG TTGGATTTAA GACAAAAGAT AAGAAAAAAC GTTCACTATA TATTTCTTCA
GCAATACCAG CCTTTTTAGG AATTACAGAA GCAGCTATAT TTGGAGTAAA CTTAAGATTT
ATTAAACCAT TTATATTTGC ATGTATAGGA GGAGCTGCAT CAGGAATGTT TGCATCAATA
ATGAAATTAG CTGGAACTGG TATGGGAATA ACAGCTATAC CGGGAACACT TCTTTATATT
AATACAGGAT TAATTCAGTA CTTCATAACA GTTGCTATTG GATTTGCTAT TTCATTTGTA
TTAACATATA TATTCTTTAA ACCACAAGAA TAA
 
Protein sequence
MSKEQIVAQE ILKNIGGKEN IKSMEHCATR LRLIVKDKNL INEKAIENID GVRGQFFAAA 
QYQIILGTGF VNKVFAAMNG EGVETGNVKE DAYSDMTLPQ KISRTLGDIF VPIIPVLVAT
GLFMGLRGLL TNLGVEFSPT FNTLSEVLTD TAFIFLPALV AWSTMKKFGG TPVVGIVLGL
MLVAPQLPNA WQVAGGANPI YISLLGISIP IVGYQGSVLP ALVLGIIAAK LEKFIRKFMP
DVLDLIFTPF LTLLVSMILG LLVVGPIMHT SEVYILDLFK MFLNLPFGIG GAIIGGVHQV
IVVTGVHHIF NALEVELISS TGLNPFNAVI TGAIVAQGAA ALAVGFKTKD KKKRSLYISS
AIPAFLGITE AAIFGVNLRF IKPFIFACIG GAASGMFASI MKLAGTGMGI TAIPGTLLYI
NTGLIQYFIT VAIGFAISFV LTYIFFKPQE