Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1006 |
Symbol | |
ID | 5105605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 926006 |
End bp | 926944 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640506905 |
Product | inosine/uridine-preferring nucleoside hydrolase |
Protein accession | YP_001191098 |
Protein GI | 146303782 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0522631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0258769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACATT TCATCATTGA TTGCGACACG GCCGAGGATG ATATATTTAG TCTATTCCTT CTCCTGCACA AGGGAATGAA GGTACACGGT ATCACCGTGG TCGAGGGCAA CGTCTCATTC CCTGTGGAGG TGAGGAATGC GCTATGGGCC TCTGATTTTG CCCGTAGATA TTTCAAGGTG GACTTGAAGG TATACCCAGG GATGGAGAGA CCCCTAATCA AGGGGTTCAG GACAGTCGAG AACGTTCATG GAAAGGGGGG TATTGGTGAC TCTGCCCTGG AGACCAACGC CAAACCTGAG CCCAAGCACG CCGTCGACTT CATCCTGGAG ACCGCGGATC GCTATCCGGG AGAGCTCGAG TTCCTAGCGA TCTCTCCGCT CACGAACCTA GCCATGGCGT ACCTCAAGGA CAAAAGTCTC CCTGAAAAAA TAGGTAAGGT CTGGGTCATG GGTGGTACCA TCAACGGTCA CGGCAACATA ACGCCCGCCG CAGAGTACAA TATCTGGGTT GATCCCGACG CCGCGAAGCT AGTGTTTAAC GCTGGATTCG ACATTACCAT GGTTGCGTGG GACCTCATAA CACAGTACAC CGTGAATGAG GAGTGGGAGG AGATTAAGAG GATGAACACC GAGATGAGTC AGCTCTACAT CAACTTCTAT ACCCACTACA GGAACTTCGC CATGACCAGG CAGAAGATGA GGGGGAACCC ACACCCCGAC CTCATCACAA CTGCTGTTGC GTTGGATCAG AGTGTGGCGA CCCGCGTGGA GAGGCAGTTC GTCGACGTGG AGAACTGCGA TTGCCTTACC AGGGGTGCCA CGGTGATTGA CTATCTAGGA GTCCTGGGCA AGGAGCCCAA CGTGAACGTG GTCTACGAAA TAGATAGGGG CAAGTTCATT GCCATGCTCC TAGATCTTCT GGGAGGTCAA CGGGTCTGA
|
Protein sequence | MRHFIIDCDT AEDDIFSLFL LLHKGMKVHG ITVVEGNVSF PVEVRNALWA SDFARRYFKV DLKVYPGMER PLIKGFRTVE NVHGKGGIGD SALETNAKPE PKHAVDFILE TADRYPGELE FLAISPLTNL AMAYLKDKSL PEKIGKVWVM GGTINGHGNI TPAAEYNIWV DPDAAKLVFN AGFDITMVAW DLITQYTVNE EWEEIKRMNT EMSQLYINFY THYRNFAMTR QKMRGNPHPD LITTAVALDQ SVATRVERQF VDVENCDCLT RGATVIDYLG VLGKEPNVNV VYEIDRGKFI AMLLDLLGGQ RV
|
| |