Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1341 |
Symbol | |
ID | 4204632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1512448 |
End bp | 1513512 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642565895 |
Product | d-galactose-binding periplasmic protein precursor |
Protein accession | YP_698661 |
Protein GI | 110802557 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000193689 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAGGTTTAGC ACTAATTTTA ATTTCAGCTT TAACTATGGG TACTTTAGTT GGTTGTGGTG GAGGTTCAAG CTCTACTGGA TCATCTGGTG ATACGCCAAA AGAAAACGAT TCACCTAAGA TAGGAGCTAC AATTTATAGT TTTGAAGATA ACTTCATGTC ATACCAACGT AGAAATATAG AAAAACTTTG TAATGGAAAA GCCGAGTTAT TAATGAATGA CTCTCAAAAT AATCAATCTA AGCAAATAGA ACAAGTAGAC ACTATGATAG CTAAGGGTGT TGATATTCTT GCAATAAACT TAGTTGACCC TAAATCTGCT CCTACTGTAA TAGATAAAGC TAAGGCAGAC AACTTACCAG TGGTATTCTT TAATAAGGAA CCAGACGAAG CTGTTATGCA AAGCTATGAT AAAGCTTGGT ATGTTGGTAC AACCTCTGAA GAATCTGGAA TAATCCAAGG GGAAGTAATG GTAGAAGGTT GGAAAGCTAA TCCTGCTTGG GACAAAAATG GTGATGGAAA AATACAATAT GTAATGCTTA AAGGAGAACC TGGTCACCCT GATGCAGAAG CTCGTACAAA ATATTCTGTT GACACAATAA ATAAAGCTGG TATAGAAACT GAGGAGTTAG CAATGGATAC AGCTATGTGG GACTCAACTA AAGCTACTGA AAAGATGGAT GCTTGGATTG CTAAAAATGG TGACAATATA GAAATGGTAA TCTGTAATAA TGACGGAATG GCTTTAGGTG CTATTTCTTC TCTTGAAAAA GCAGGATATT TAGATGGAAC TCCTGAAAAG TTTGTTCCAA TATATGGTGT TGATGCTATT CCTGAAGCTT TAGATAAAAT CAAAGCTGGT AAAATGGCTG GTACAGTATT AAACGATGCC AAGAATCAAG CTCAAGCTCT AGTAGATTCT TGTATGAATT TAGTAAATGG AAAAGAGATA AATGAAGGAA CTAATTGGAA ACTTGATAAT AAAAAATCAA TCCGTGTTCC ATATGTAGGA ATAACTAAAG ACAATATAAA CGTTGCTGAG GATTCATATA AATAA
|
Protein sequence | MKKKGLALIL ISALTMGTLV GCGGGSSSTG SSGDTPKEND SPKIGATIYS FEDNFMSYQR RNIEKLCNGK AELLMNDSQN NQSKQIEQVD TMIAKGVDIL AINLVDPKSA PTVIDKAKAD NLPVVFFNKE PDEAVMQSYD KAWYVGTTSE ESGIIQGEVM VEGWKANPAW DKNGDGKIQY VMLKGEPGHP DAEARTKYSV DTINKAGIET EELAMDTAMW DSTKATEKMD AWIAKNGDNI EMVICNNDGM ALGAISSLEK AGYLDGTPEK FVPIYGVDAI PEALDKIKAG KMAGTVLNDA KNQAQALVDS CMNLVNGKEI NEGTNWKLDN KKSIRVPYVG ITKDNINVAE DSYK
|
| |