Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2505 |
Symbol | |
ID | 4206227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 2722262 |
End bp | 2723488 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642567055 |
Product | NupC family protein |
Protein accession | YP_699752 |
Protein GI | 110803704 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1972] Nucleoside permease |
TIGRFAM ID | [TIGR00804] nucleoside transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.542894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATAGGT TTATTGGTGT AATCGGTCTT ATTTGTATTA TTGGTATAGC TGTTCTTTTT TCTGAAAATA GAAAGAAGAT CAACTGGCGC TTGGTTGGAA CAGGTCTTTT ATTACAAATT ATCTTCGCAT TACTAATTCT AAAAGTTCCT GCTGGTAGAG CAGTTTTTGA ATGGATTAGT AATGGAATAA CTAAATTATT AGATTTTACT AAAGAAGGTA GTTCATTCTT ATTTGGCCCA TTACTTGATA CAGACAAATT CGGTATGATA TTTGCTCTAC AAGTTTTACC AACAATTATC TTCTTCTCAT CATTAATGAG TGTACTTTAT CATTTAGGTA TAGTTCAAGT TGTTGTAAAA GTTATTGCTA AAGGGGTTGC TAAAGCATTA GGAACAAGTG GTGCTGAAAC TTTCAGTGCA GTTGGTAATA TCTTCTTAGG CCAAACAGAA GCTCCTCTAC TAGTTAAACC ATACATAAAG AACATGACTA AATCAGAAAT ATGCGCAATT ATGGTAGGTG GTATGGCTAC TGTTGCTGGT GGTGTTATGG CTGGTTATGT AGCTATGGGT GTTAACGCTG GTAACTTATT AGCAGCATCA ATCATGGCAG CTCCTGCCGG ATTAATATTA GCTAAAATAC TAGTTCCAGA AACTGAAGTT CCTGAAACTA AAGGTGGCGG AACTTTAGAC CTTAAAGTTG AAAGTGAAAA TGTTATTGAA GCTGCTGCAA ATGGTGCTTC AGAAGGTTTA GGATTAGCTT TAAATGTTGG TGCTATGCTT CTTGCATTCG TTGCTCTTAT AGCTATGATT AATGCTTTAT TCGGAGCAAT AGGTGGAATA TTTGGTGCAC CTTGGTTAAG CTTAAACTGG ATTCTTGGTA GATTATTCTC TCCATTAGCA TTTATAATGG GAGTTCCAAC TAAAGATGTT TTCGTAGCTG GAGACTTACT AGGAATTAAA TTAGCAGTTA ATGAATTCTT AGCTTACTCA CAATTATCAA ACTATATAGC AAATGGTACT TTAGAACCTA AAACTATAAT GATATTAACT TATGCTCTTT GTGGATTTGC TAACTTAAGT TCAGTTGCTA TACAATTAGG TGGTATTGGT GGATTAGCTC CAGAAAAGAA ACCAACTATA GCTAAGTTAG GATTTAAAGC ACTTTTAGGT GGTGTATTAG CTACTTGTAT GACAGCTACT ATAGCAGGTA TCTTATTTAG TGCTTAA
|
Protein sequence | MDRFIGVIGL ICIIGIAVLF SENRKKINWR LVGTGLLLQI IFALLILKVP AGRAVFEWIS NGITKLLDFT KEGSSFLFGP LLDTDKFGMI FALQVLPTII FFSSLMSVLY HLGIVQVVVK VIAKGVAKAL GTSGAETFSA VGNIFLGQTE APLLVKPYIK NMTKSEICAI MVGGMATVAG GVMAGYVAMG VNAGNLLAAS IMAAPAGLIL AKILVPETEV PETKGGGTLD LKVESENVIE AAANGASEGL GLALNVGAML LAFVALIAMI NALFGAIGGI FGAPWLSLNW ILGRLFSPLA FIMGVPTKDV FVAGDLLGIK LAVNEFLAYS QLSNYIANGT LEPKTIMILT YALCGFANLS SVAIQLGGIG GLAPEKKPTI AKLGFKALLG GVLATCMTAT IAGILFSA
|
| |