Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0931 |
Symbol | |
ID | 4204324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1064992 |
End bp | 1066617 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 21% |
IMG OID | 642565489 |
Product | solute-binding family 3 protein |
Protein accession | YP_698255 |
Protein GI | 110801475 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [COG2200] FOG: EAL domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000314501 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA ATAAAAGATT TATAATTCTT ATATTATTTT TTTTAATATT CATAATACTA TTTTTGCTTC ATGGAGTCAT CATAAATAAA GAAAAGAGAG AGCAAGTAGT TAAAATTGGA TTTTACGACG ATTATCCTCA TTTTTATATA GATCATAGAG GAAATGTTTG TGGATATTAT AAAGACATAA CTGAAAAATT AGCTGAAAAA CTTAATTTTA AAATTGAATA TGTAAACGGA AAAGTTCCAG AACTTTTAAA AGAGCTTAAA AATGGAGAAA TAGATTTGGT TTTTGGAATA AATAAATTGC CAGAGAGGGA GGAACTCTTT GACTTCACAA ATAATCCTAT AAATAATGAG ATGAAGCTTA TATATACTAA TAATGACATA AAATATGGTG ATTTAGAAGG CTTAAATGGT ATGAAAATGG GGTATATAGA AGGTGAGTTA GATAATGAAT GGATATTAGA TTATTTAAGC AAAAGAAATA TAAATGTTGA ATTAGTTAAT GGATCTTCTT ATAAAGCAAT AAAAACATTG TTAGTTGAAA ATAAAGTAGA CTTTATTGTA GATAATCCAG ATAGTGATAT AAAAAATAAG GGTAAAAATA TTAAAGAGAT CTTTCAATTT TCATCTGGAC CAAAATACAT TGTAGCAAAT AAAAATAATA ATGAGTTAAT AAAAAAAATT GATAAATCAC TTACTACAAT TAATCTTAAT TCATATCTTG ATAATGACGA TTATATAAAA AAAACTCATA ATTTTATTAT AGCCATTGTC AATAAAAATA TTATTCATTT AATTGTCTTT ATTTTAATTA TAATTTTATT TAATAGAGGT AAAAAAGGAA TAGCTAAAAT ATTAAATAAG AAAAAAATAT ATAATGATTT AAAAAAAGAC AACTATATTT TATATTATCA ACCAATAGTT GATTTTAGGA ATAACACAAT AAGAAGTGTT GAGGCCTTAT TAAGATTAAA AAAAGATGGT CAATTATTAA CTCCCAATTA TTTTATGAAT ATTATAGAAG AAGCTAATAT GATGAAAGAA ATCACATTAT GGGTATTAAA TAAGGTAATT AAAGATTATA ATATTATAAG TTGTTACAAT AATATAAATG AAGAAAATTT TTATATTTCT ATAAATGTAT CTTTTAATGA GATAAAAGAT ATAGAGTTTT TAAAGAAAAT AGTCAAAATA GTCAATGATA ATAAAATAAT AAAAAATAGT ATTTGTCTAG AAATTATAGA AAAATTTGGG GTAGAAGAGG TAGAAAAAAT ACAAAAAAAC ATTAAGTTTT TACGAGAAAA TGGAATTCTA ATTGCAATAG ATGATTTTGG TGTGGAGTAT TCAAACTTAG ATTTATTAAA AAAAATAGAT TCTAATATCA TTAAATTAGA TAAGTTTTTT GCAGATGGAA TTAATGATTC AGAAATAAGT ATTAAAGTAA TAGGTTTTAT ATTAGATATA TGCAGATTAT CAAATAAATC TATAGTTATA GAAGGAATAG AGAAAAAAGA ACAAGTAGAT ATAATAAAAA CATTTCTTTA TGAAAAAATT TATATTCAAG GATACTATTT TTCAAAACCA TTAGATATTA ATAGTTTAAA AGATTATACT TTTTAA
|
Protein sequence | MKKNKRFIIL ILFFLIFIIL FLLHGVIINK EKREQVVKIG FYDDYPHFYI DHRGNVCGYY KDITEKLAEK LNFKIEYVNG KVPELLKELK NGEIDLVFGI NKLPEREELF DFTNNPINNE MKLIYTNNDI KYGDLEGLNG MKMGYIEGEL DNEWILDYLS KRNINVELVN GSSYKAIKTL LVENKVDFIV DNPDSDIKNK GKNIKEIFQF SSGPKYIVAN KNNNELIKKI DKSLTTINLN SYLDNDDYIK KTHNFIIAIV NKNIIHLIVF ILIIILFNRG KKGIAKILNK KKIYNDLKKD NYILYYQPIV DFRNNTIRSV EALLRLKKDG QLLTPNYFMN IIEEANMMKE ITLWVLNKVI KDYNIISCYN NINEENFYIS INVSFNEIKD IEFLKKIVKI VNDNKIIKNS ICLEIIEKFG VEEVEKIQKN IKFLRENGIL IAIDDFGVEY SNLDLLKKID SNIIKLDKFF ADGINDSEIS IKVIGFILDI CRLSNKSIVI EGIEKKEQVD IIKTFLYEKI YIQGYYFSKP LDINSLKDYT F
|
| |