Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1971 |
Symbol | |
ID | 4204140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2177130 |
End bp | 2178110 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642566521 |
Product | IUNH family nucleoside hydrolase |
Protein accession | YP_699280 |
Protein GI | 110802670 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.260258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAGA GAAAGGTAAT TATTGATTGT GATCCAGGGA TAGATGATGC TTTAGCCATT ATTCTTGCAT TAAAGTCAAA AGAGATTGAG GTTGTTGGAA TAACCACCGT ATCAGGAAAT GTTGAAAGCG TGCAGGGAGC TAAAAATGCC TTAAAGGCAC TTAAGCTTTT AGGGAGATTG GATATTCCTG TTTACTTAGG AGAAAGTAAG CCAGTTAAAA GAGAGCTTGT AACAGCACAG GATACTCATG GGGAAGATGG CTTAGGAGAA ACTTTTTTAG AAGAGGTATC TAGTGAGTAT ATTAGAGAAA ATGGTGTTGA TTTTATTTTA AATACTTTAA AAAATCAGGA GAATGTTAGT ATTATTGCTC TTGGACCCCT AACAAACCTA TATAGAGCTA TAGAAAAGGA TTCAGAAACT TTTCATAGAG TTAAGGAAAT AGTTTCTATG GGGGGAGCTT ATAAAAGTCA TGGAAATTGT TCACCAGTAG CTGAATTTAA TTACTGGGTA GATCCTCATG GAGCAAGGGA GTTTCTAAAA AAGTTTAATG GTGAATTTAC CATGGTTGGT TTAGATGTTA CAAGAAAGAT AGTTTTAACA CCAAATTTAA GAGAAATGAT ACATCAATTT AATGATGAAA TTGGTAATTT TATATATGAT ATTACTAGAT TTTATGTTGA TTTCCATTGG GAACAAGAGA GAACCCTTGG ATGTGTTATA AATGATCCCT TAGCAGTAGA ATTTTTTATA AATAGAGATA TTTGTGAAGG TTTTAAAGCT TATGTGGACA TAGCTTGCGA AGATATATCA ATGGGGCAAA GCGTTGTTGA TGTTGCAGAT TTTTATAAGA GAAGAAAAAA TGTATTTGTT TTAGATAAAG TTAATAGCAA AAAATTTATG GTAAGCTTTC TTAATAAAAT ATTCCCAAGT TATAAAGAGG ATATTGAAAA TATACTTAAT AATCCAAAGT ATGGTATTTA A
|
Protein sequence | MDKRKVIIDC DPGIDDALAI ILALKSKEIE VVGITTVSGN VESVQGAKNA LKALKLLGRL DIPVYLGESK PVKRELVTAQ DTHGEDGLGE TFLEEVSSEY IRENGVDFIL NTLKNQENVS IIALGPLTNL YRAIEKDSET FHRVKEIVSM GGAYKSHGNC SPVAEFNYWV DPHGAREFLK KFNGEFTMVG LDVTRKIVLT PNLREMIHQF NDEIGNFIYD ITRFYVDFHW EQERTLGCVI NDPLAVEFFI NRDICEGFKA YVDIACEDIS MGQSVVDVAD FYKRRKNVFV LDKVNSKKFM VSFLNKIFPS YKEDIENILN NPKYGI
|
| |