Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0800 |
Symbol | |
ID | 5321637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 860428 |
End bp | 861828 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640789737 |
Product | multicopper oxidase type 3 |
Protein accession | YP_001326491 |
Protein GI | 150396024 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2132] Putative multicopper oxidases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.581336 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATGC TCGACCGCAG AATGTTTCTC CAGGCTTCAG CCGCATTTGG CGGCGCTTTC GCGCTCGGCG CGGGTCTTGC CGCGAAGGCC GGTGGGGCGC CCGATCCGCA GATCCTGACG GCGCGCTTTA CCGAGGCGCG GATCGCAACC GGCGGCACCA CGCCCCGTCT CATGACTTAT GATCTTAGCG GAACGGCAGG CTCCGGCGTG CCGCCGGTCC TCAGGATGCG CAAGGGCGAG CCTTATGCGG CGCGACTGAT CAATCGTCTC GATGAGCCGA CGACGGTCCA CTGGCATGGT TTGAGGATCG TCAATGCGAT GGACGGCGTA CCGGAAATGA CGCAGCCCTA TGTCTATCCC GGCGAGGGCT TCGACTACCT CTTCACGCCG CCCGATGCCG GCACCTTCTG GTATCACCCG CATTGCAACA CGCTGATGCA GATGGGGAGC GGCCTGACCG GCGTGATCGT CGTCGAGAAC CCGAAGGATC CGGCCTTCGA TGCCGAGATC GTCCTCAATC TCAGGGACTG GCGGCTAAAC GCAAGCGGCG CATTCATCGC CCCTTTCAAA CCGCGCGATG CGGCGCGCGG CGGCACCTAT GGCACGGTAA GAACCGCGAA CTGGCAGCGG GAACCGGTTT ACGACGCTCC GGCCGGCGGG CTCGTCCGGG TGAGAATCGC CGCCACCGAC GTCACGCGCA TCTATAGCAT CGGCCTCGAA GGCGCGGCGG CCAAGGTGAT CGCGCTCGAC GGCAATCCGG TCGAAATGCC CTTCGCGCTG GACCGGCTGG ATATCGGCCC CGGGCAGCGT GTCGACCTTG CGCTTCGCAT GCCGGAAAAT GAGGAGAGCC GGGCGACTCT CGACAATTTC CGCGGCTCCA GCCCCTGGAC CATTGCGACT TTCCGAGCAG TCGGCGCCTC GCTGAAGCGT GATCTCAGGG ACATCAACCC CCTGCCCCCT AACCCAGTCG CCGAGGCCGA CCTTTCCACG GCCAGGCGTA TCCCGATCGA TCTGACCGCA ACGGCGGAGC AGGGTGTGTC GACTTCAATC TGCGGCACGC TCGGCTATAC GTTCTGGGCG ATCAACAAGG TCCCGTGGCC GGGCGACACG CCCGATCCGG TCGCGCCGAT CGAGGAACTC AAACTTGGGA AAAGCTATGT GCTCGAGATC GCGAACCGCA CTCCGCACGC CCATCCCATC CATCTGCACG GCCTGAGCTT CCGCGTCCTC GCCGCGAACA AGCGCACCGT GCTGCCGCCG CCGACCGACA CCATCCTGCT CCTGCCGGAC GAACAGGCGC AGCTGGCACT GGTCGCGGAC AATCCCGGCG ACTGGCTTTT CCATTGCCAT ATCATCGAAC ATCAAAAGAC CGGCATGGCG GCTTATTTCC GGGTCGCCTG A
|
Protein sequence | MPMLDRRMFL QASAAFGGAF ALGAGLAAKA GGAPDPQILT ARFTEARIAT GGTTPRLMTY DLSGTAGSGV PPVLRMRKGE PYAARLINRL DEPTTVHWHG LRIVNAMDGV PEMTQPYVYP GEGFDYLFTP PDAGTFWYHP HCNTLMQMGS GLTGVIVVEN PKDPAFDAEI VLNLRDWRLN ASGAFIAPFK PRDAARGGTY GTVRTANWQR EPVYDAPAGG LVRVRIAATD VTRIYSIGLE GAAAKVIALD GNPVEMPFAL DRLDIGPGQR VDLALRMPEN EESRATLDNF RGSSPWTIAT FRAVGASLKR DLRDINPLPP NPVAEADLST ARRIPIDLTA TAEQGVSTSI CGTLGYTFWA INKVPWPGDT PDPVAPIEEL KLGKSYVLEI ANRTPHAHPI HLHGLSFRVL AANKRTVLPP PTDTILLLPD EQAQLALVAD NPGDWLFHCH IIEHQKTGMA AYFRVA
|
| |