Gene CPR_1341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1341 
Symbol 
ID4204632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1512448 
End bp1513512 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content35% 
IMG OID642565895 
Productd-galactose-binding periplasmic protein precursor 
Protein accessionYP_698661 
Protein GI110802557 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.93689e-05 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AAGGTTTAGC ACTAATTTTA ATTTCAGCTT TAACTATGGG TACTTTAGTT 
GGTTGTGGTG GAGGTTCAAG CTCTACTGGA TCATCTGGTG ATACGCCAAA AGAAAACGAT
TCACCTAAGA TAGGAGCTAC AATTTATAGT TTTGAAGATA ACTTCATGTC ATACCAACGT
AGAAATATAG AAAAACTTTG TAATGGAAAA GCCGAGTTAT TAATGAATGA CTCTCAAAAT
AATCAATCTA AGCAAATAGA ACAAGTAGAC ACTATGATAG CTAAGGGTGT TGATATTCTT
GCAATAAACT TAGTTGACCC TAAATCTGCT CCTACTGTAA TAGATAAAGC TAAGGCAGAC
AACTTACCAG TGGTATTCTT TAATAAGGAA CCAGACGAAG CTGTTATGCA AAGCTATGAT
AAAGCTTGGT ATGTTGGTAC AACCTCTGAA GAATCTGGAA TAATCCAAGG GGAAGTAATG
GTAGAAGGTT GGAAAGCTAA TCCTGCTTGG GACAAAAATG GTGATGGAAA AATACAATAT
GTAATGCTTA AAGGAGAACC TGGTCACCCT GATGCAGAAG CTCGTACAAA ATATTCTGTT
GACACAATAA ATAAAGCTGG TATAGAAACT GAGGAGTTAG CAATGGATAC AGCTATGTGG
GACTCAACTA AAGCTACTGA AAAGATGGAT GCTTGGATTG CTAAAAATGG TGACAATATA
GAAATGGTAA TCTGTAATAA TGACGGAATG GCTTTAGGTG CTATTTCTTC TCTTGAAAAA
GCAGGATATT TAGATGGAAC TCCTGAAAAG TTTGTTCCAA TATATGGTGT TGATGCTATT
CCTGAAGCTT TAGATAAAAT CAAAGCTGGT AAAATGGCTG GTACAGTATT AAACGATGCC
AAGAATCAAG CTCAAGCTCT AGTAGATTCT TGTATGAATT TAGTAAATGG AAAAGAGATA
AATGAAGGAA CTAATTGGAA ACTTGATAAT AAAAAATCAA TCCGTGTTCC ATATGTAGGA
ATAACTAAAG ACAATATAAA CGTTGCTGAG GATTCATATA AATAA
 
Protein sequence
MKKKGLALIL ISALTMGTLV GCGGGSSSTG SSGDTPKEND SPKIGATIYS FEDNFMSYQR 
RNIEKLCNGK AELLMNDSQN NQSKQIEQVD TMIAKGVDIL AINLVDPKSA PTVIDKAKAD
NLPVVFFNKE PDEAVMQSYD KAWYVGTTSE ESGIIQGEVM VEGWKANPAW DKNGDGKIQY
VMLKGEPGHP DAEARTKYSV DTINKAGIET EELAMDTAMW DSTKATEKMD AWIAKNGDNI
EMVICNNDGM ALGAISSLEK AGYLDGTPEK FVPIYGVDAI PEALDKIKAG KMAGTVLNDA
KNQAQALVDS CMNLVNGKEI NEGTNWKLDN KKSIRVPYVG ITKDNINVAE DSYK