Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1104 |
Symbol | |
ID | 4204881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1248372 |
End bp | 1249775 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 642565660 |
Product | hypothetical protein |
Protein accession | YP_698426 |
Protein GI | 110801930 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.229225 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACAT CTGTAATGCT TACAGGATGT GGATCAAGTG ACAAAAAAGC AGAGGGAGGA GCAGCAAGTG AGGAACCAGT TAACTTAGTA TGGTATGTTA TAGGTAAACC TCAGACTGAT GGAGAATTAG TAGAAGAAGA AGTAAATAAA TATATAAAAG ATAAAATAAA TGCTACTGTA GACATAAAAC ATATTGACTT TGGTGATTAC AGCCAAAAAA TGAATGTAAT AGCTAACTCA GGAGAAGAAT ATGATTTAGC ATTTACATGT TCATGGGCTT TCCCATACTT AGAAAATGCT AGAAAAGGTG CTTTCCTTGA ATTAAATGAT TTATTAGATA AAGAGGGAGC AGACCTTAAA GGGGTTATAG ATGAAAGACT TTGGAAAGGT GCTGAGGTTG ATGGAAAAAT ATATGGAGTT CCAAACCAAA AAGAAATAGC AGGAGCACCT ATGTGGGTAT TTGATAAGGA GCTTGTTGAA AAATATGATA TTCCATATCA AGATTTACAC TCAGTAGAAG ATTTAGAACC ATGGTTACAA ATCATAAAAG AAAAGGAACC AGATTTTGTA CCATTCTATA CTCAAGGGGA TTCAATTCCA TTAGAATTTG ATGAAATAAT GAGACCTTTA GGAGTATTCT TTAATGATGA TACTTTAACA GTACAAAATA TGTATGAGAC AGAAGAAATG AAGGCTATGA TGACTAAATT AAGAGAATAC TATGAAAAAG GATATATAAA TCAAGATGCA GCAGTTAATA ATATGAAAAA TGAAGTTAAG AGATTTATGT GGAAAGCTGA TGGACAACCA TATGCAGAAA ATGGATGGGG ACAAGCTTTA GGTAGAGAAG TTGTAACATC ATCAATAATC CCTCCATATG TTACAAATAA TTCAACAACT GGAGCTATGA CTGCTATATC AGCAACATCT AAGCATCCTG AAAAAGCTAT GGAGCTTATA AACTTAGTAA ATACTGACTC TACATTAAGA AACCTATTAA TGTTTGGAAT AGAGGGAACT CACTATGAAA AGGTTAGTGA CAATCAAATA AAGAGAGATC CAAATGGACC ATATAGTGTT ACAAGTTGGG CTTACGGAAA CTTATTTGAT ACTTACGTTT TAGATAGTGA TCCAGCAGAT AAATGGGATG CTTTTGAAGA ATTTAACCAA GGTGCTAAGA CTTCACCAAT CTTAGGATTT AAGTTCAATA CAGAGCCAGT TACAACTCAA ATATCAGCAA TTAATAACGT ATTACAAGAG TTTGAAAGAA CTTTATACTC AGGTTCAGTA GATCCAGTAA AAGGATTAGA TGACTTAAAT AAAAAGTTAG CTGCATCTGG ATTAGATGAC ATAAAAGCTG AAATGCAAAA ACAATTAGAT GAATGGAAAG CTTCTAATAA ATAA
|
Protein sequence | MTTSVMLTGC GSSDKKAEGG AASEEPVNLV WYVIGKPQTD GELVEEEVNK YIKDKINATV DIKHIDFGDY SQKMNVIANS GEEYDLAFTC SWAFPYLENA RKGAFLELND LLDKEGADLK GVIDERLWKG AEVDGKIYGV PNQKEIAGAP MWVFDKELVE KYDIPYQDLH SVEDLEPWLQ IIKEKEPDFV PFYTQGDSIP LEFDEIMRPL GVFFNDDTLT VQNMYETEEM KAMMTKLREY YEKGYINQDA AVNNMKNEVK RFMWKADGQP YAENGWGQAL GREVVTSSII PPYVTNNSTT GAMTAISATS KHPEKAMELI NLVNTDSTLR NLLMFGIEGT HYEKVSDNQI KRDPNGPYSV TSWAYGNLFD TYVLDSDPAD KWDAFEEFNQ GAKTSPILGF KFNTEPVTTQ ISAINNVLQE FERTLYSGSV DPVKGLDDLN KKLAASGLDD IKAEMQKQLD EWKASNK
|
| |