Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0904 |
Symbol | |
ID | 4204386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 1036071 |
End bp | 1037390 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642565462 |
Product | ABC transporter, periplasmic substrate-binding protein, putative |
Protein accession | YP_698228 |
Protein GI | 110801770 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00221898 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAAAT TTTATGGTTT TGTTTTACTA ATATTTTGTT TTAATCTTTT TGGATGTTCT TTTAGAAATA CAGAAGTATC AAATAAAAAT ATTCAAGGAT ATGACAAAGG CGAGGAGCTT ATAACAATGT GGGTTCATGT TATAGAAGAA ACTTCAGAAG GACAAGCTTA TAAAAATTCA GTAGAAAGGT TTAATAAAGA ATACAATGGT AAGTATTGTT TAAGTGTTGA ATTTGTACCT CGTAATGAAA GTGGAGGAGG ATATACTGAT AAAATAAATT CATCAGTAAT TTCTGGAGGA CTTCCAGACA TAATAACTGT TGATGGGCCC AATGTATCAG CTTATGTTGC AAACAACATA ATTCAGCCTT TAGTAGGTAT AACTGATGAT GAAAAGGCTA AATATTTACC GTCAGTAATA GAGCAAGGAA CAATAAACAA TAAATTATAT GCATTAGGTC TAATGGAATC TAGTACGTTA TTTTATTATA ACAAAGATAT ATTAAACGAA GTAGGAATAC AAGTACCATC ATTTGATAAT CCATGGACTT GGGACGAATT AAATAAGGTC TGTGAAAAAG TTAAGAACTA TTTAGATAAA AAAAATGGAT ATCCAATAGA TATGTCATTC CCAGCAGGGG AAACAACTAT TTATTTTTAT GCACCATTTA TATGGTCAAA TGGTGGAGAT TTTGTAAGCT CTGATGGTTT AAAGGTTAAT GGAGTATTTA ATTCTGAAAA GAATGTAGAA ACTATTAGTT ATTTTAAAGA AATTACAGAC AAAGGATATA TACCTAAATA TACAATAAGT GATTTATTTG AAAAGGGAAG GGCTGCATTT AAATTTGATG GAGCATGGGC TATTACAAAT ATAAGAAATA ACTATCCAGC TTTTAATTTA GGAATAGCAC CATATCCAGT GGGAAATGAT TGGAATGGAG AAAAGTATAC ACCAACAGGA GGATGGGCTT TTGCAACAAC TACAACTTGT AAAAACCCTG AGGCTGCAAA AGAAGCAATC AAGTTTTTAA CTAATGCAGA AAGTGGCATA GATATGTATA ACTTAACAGG TAATTTACCA TCTACATTTG AAGCTTATGA AAATATTGAT GCATTTAAAA CTGATGAATT ATTTAAAACA GCATATTATC AGCTTGTTAA CTATGGTCAT CCAAGACCAA AATCACCAGC TTATCCTCAG ATAAGTACAT CATATCAGCA GGCTATTGAA GGTGTACTAT TAAATGATGA AACACCAGAA GAATCGTTAT ATAAAACAAT GAGAAGAATA GAAGATAAGT TAATACGTTA TCAAGATTAA
|
Protein sequence | MKKFYGFVLL IFCFNLFGCS FRNTEVSNKN IQGYDKGEEL ITMWVHVIEE TSEGQAYKNS VERFNKEYNG KYCLSVEFVP RNESGGGYTD KINSSVISGG LPDIITVDGP NVSAYVANNI IQPLVGITDD EKAKYLPSVI EQGTINNKLY ALGLMESSTL FYYNKDILNE VGIQVPSFDN PWTWDELNKV CEKVKNYLDK KNGYPIDMSF PAGETTIYFY APFIWSNGGD FVSSDGLKVN GVFNSEKNVE TISYFKEITD KGYIPKYTIS DLFEKGRAAF KFDGAWAITN IRNNYPAFNL GIAPYPVGND WNGEKYTPTG GWAFATTTTC KNPEAAKEAI KFLTNAESGI DMYNLTGNLP STFEAYENID AFKTDELFKT AYYQLVNYGH PRPKSPAYPQ ISTSYQQAIE GVLLNDETPE ESLYKTMRRI EDKLIRYQD
|
| |