Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_0785 |
Symbol | |
ID | 7092643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 866219 |
End bp | 867175 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643464122 |
Product | protein of unknown function DUF58 |
Protein accession | YP_002361117 |
Protein GI | 217976970 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.159111 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCTGG TCCGCCTCCT GACGGGCGAC GAAAGCCGTC CGGGCGCAGG GCGCCAGGAA GAAGCGCTGA CGCTGTCGCA GCGCTTTCCC GGCCTCGTCG TCTCCGCGCG GGATGTGGCG GCGAGCGTTT TGCAAGGGGT ACACGGACGC CGCCGCGCCG GCTCCGGCGA GACGTTCTGG CAGTTTCGCC CTTTTGTCGC GGGCGAATCG CGCGGCCGCA TCGACTGGCG CCGTTCGGCG CGCGACGATC GGCTCTATGT GCGTGAACGG GAATGGGAGG CCGCCCATAC GGTGATGCTC TGGATCGACC GTTCCGCTTC GATGCGCTTT GTCTCAAAGC TCGCCTTGCA GGCGAAGATC GACCGCGCTC TCGTTCTGGG CCTTGCCGCC GCCGATCTTC TCGTCCAGGG CGGCGAGCGC GTCGGCTTGC TCGGCCTGAC CCGCCCGCTG GCGGCGCGCA ATATCGTCGA AAGGTTCGGC GAAGTTCTTC TCAACGAATT CCGGCTGCGG GAAAAAAACG GCAAAGCCAG CGAAGCCGAG GAGCTGCCCC CGCCCGAAGT CCTGCCGCGC AATTCGCAGG CGGTGCTGAT CGGCGATTTC CTCAGCGCCC CGCAAGATAT TGCGGCGACG ATCGAAGCGC TCGGCGCCCT TGGCGCGCGC GGCCACCTCG TGATGATCGC GGACCCCGTG GAAGAAACCT TTCCCTTTGC CGGCAACACG GAATTCATCG ACGTCGATTC GCCGGCGCGG CTCCGCATCG GGCAGGCGGA ATCTTTCCGC GCCGATTATA TCCGCAGGCT CACCGCCCAT CGCGAGGCGA TCCGCGCGGC GGCGCGGGCG CGCGGCTGGA CCTTGATGCT GCACAGGACA GATCGCCCGG CGACCGAGGC GCTGCTGGGT TTGAGAATGC AGCTTGAGGC CAATCTGTTT AACGCCGCCG CCGGCCACGC GCTTTGA
|
Protein sequence | MPLVRLLTGD ESRPGAGRQE EALTLSQRFP GLVVSARDVA ASVLQGVHGR RRAGSGETFW QFRPFVAGES RGRIDWRRSA RDDRLYVRER EWEAAHTVML WIDRSASMRF VSKLALQAKI DRALVLGLAA ADLLVQGGER VGLLGLTRPL AARNIVERFG EVLLNEFRLR EKNGKASEAE ELPPPEVLPR NSQAVLIGDF LSAPQDIAAT IEALGALGAR GHLVMIADPV EETFPFAGNT EFIDVDSPAR LRIGQAESFR ADYIRRLTAH REAIRAAARA RGWTLMLHRT DRPATEALLG LRMQLEANLF NAAAGHAL
|
| |