Gene CPR_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2051 
Symbol 
ID4206443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2272235 
End bp2273161 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content29% 
IMG OID642566601 
Productsugar transport system (permease) (binding protein dependent transporter) 
Protein accessionYP_699360 
Protein GI110802664 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4209] ABC-type polysaccharide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATA GAATAAAACG TTTTAGAAAC AATAAAGAAT TATTACTACT CACTATACCA 
GGGGCAATAT GGTTTTTAGT ATTTGCTTAC TTACCAATGT TTGGAGTAAT TGTAGCATTC
AAAAGATGGA GAATTCATGG AGGATTCTTT GAAAGTTTAA TGAATAGTAA ATGGGTTGGA
TTTGATAACT TTAAATTCTT ATTCAAAAGT AGTGATGCTT GGTTAATAAC TAAAAATACA
GTTTTATATA ATCTTGTATT TATAATTTTA GGAATAGTGC TTCCAGTAAC ATTAGCAATT
TTATTAAATG AATTATTAAA TAAAAAGTTA GCTAAGTTCT ATCAAAGTAG TATGTTCTTA
CCTTATTTCT TATCATGGGT TGTTGTAAGT TATTGTTTAT ATGCATTTTT AAGCCCAGAA
AAAGGATATG TTAATGGTAT TTTACAATCT ATGGGTGGAA AAGGAATTTC GTGGTATACA
GAGCCAAAAT ATTGGCCATT TATCATAATT TTTATGAGTC AATGGAAAGC AGTTGGATAC
GGAACCGTTG TTTATTTAGC TTCAATATGT GGTATAGATA AGAGTTATTA TGAAGCGGCA
ATGATAGATG GAGCAAGTAA GTTCCAACAA ATAAAATATA TAACAGTTCC ATTATTAAAA
CCTGTAATGA TCATAATGTT CATAACTTCA ATTGGTGGTA TGTTCAGAGG AGACTTAGGT
CTATTCTATC AATTACCAAA GGATTCAGGT GCTTTATATC CAGTAACAAA CGTAATAGAT
ACTTATGTAT ATAGAGGTCT TATGAACTTA GGAGATATTG GTATGAGTTC TGCAGCAAGT
TTATATCAAT CATTTGTAGG ATTAATACTT ATAGTAACTT CTAACGCTAT AGTAAGAAAA
GTAGATGAAG AAAACGCATT CTTCTAA
 
Protein sequence
MKDRIKRFRN NKELLLLTIP GAIWFLVFAY LPMFGVIVAF KRWRIHGGFF ESLMNSKWVG 
FDNFKFLFKS SDAWLITKNT VLYNLVFIIL GIVLPVTLAI LLNELLNKKL AKFYQSSMFL
PYFLSWVVVS YCLYAFLSPE KGYVNGILQS MGGKGISWYT EPKYWPFIII FMSQWKAVGY
GTVVYLASIC GIDKSYYEAA MIDGASKFQQ IKYITVPLLK PVMIIMFITS IGGMFRGDLG
LFYQLPKDSG ALYPVTNVID TYVYRGLMNL GDIGMSSAAS LYQSFVGLIL IVTSNAIVRK
VDEENAFF