Gene Smed_0800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0800 
Symbol 
ID5321637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp860428 
End bp861828 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content65% 
IMG OID640789737 
Productmulticopper oxidase type 3 
Protein accessionYP_001326491 
Protein GI150396024 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.581336 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGC TCGACCGCAG AATGTTTCTC CAGGCTTCAG CCGCATTTGG CGGCGCTTTC 
GCGCTCGGCG CGGGTCTTGC CGCGAAGGCC GGTGGGGCGC CCGATCCGCA GATCCTGACG
GCGCGCTTTA CCGAGGCGCG GATCGCAACC GGCGGCACCA CGCCCCGTCT CATGACTTAT
GATCTTAGCG GAACGGCAGG CTCCGGCGTG CCGCCGGTCC TCAGGATGCG CAAGGGCGAG
CCTTATGCGG CGCGACTGAT CAATCGTCTC GATGAGCCGA CGACGGTCCA CTGGCATGGT
TTGAGGATCG TCAATGCGAT GGACGGCGTA CCGGAAATGA CGCAGCCCTA TGTCTATCCC
GGCGAGGGCT TCGACTACCT CTTCACGCCG CCCGATGCCG GCACCTTCTG GTATCACCCG
CATTGCAACA CGCTGATGCA GATGGGGAGC GGCCTGACCG GCGTGATCGT CGTCGAGAAC
CCGAAGGATC CGGCCTTCGA TGCCGAGATC GTCCTCAATC TCAGGGACTG GCGGCTAAAC
GCAAGCGGCG CATTCATCGC CCCTTTCAAA CCGCGCGATG CGGCGCGCGG CGGCACCTAT
GGCACGGTAA GAACCGCGAA CTGGCAGCGG GAACCGGTTT ACGACGCTCC GGCCGGCGGG
CTCGTCCGGG TGAGAATCGC CGCCACCGAC GTCACGCGCA TCTATAGCAT CGGCCTCGAA
GGCGCGGCGG CCAAGGTGAT CGCGCTCGAC GGCAATCCGG TCGAAATGCC CTTCGCGCTG
GACCGGCTGG ATATCGGCCC CGGGCAGCGT GTCGACCTTG CGCTTCGCAT GCCGGAAAAT
GAGGAGAGCC GGGCGACTCT CGACAATTTC CGCGGCTCCA GCCCCTGGAC CATTGCGACT
TTCCGAGCAG TCGGCGCCTC GCTGAAGCGT GATCTCAGGG ACATCAACCC CCTGCCCCCT
AACCCAGTCG CCGAGGCCGA CCTTTCCACG GCCAGGCGTA TCCCGATCGA TCTGACCGCA
ACGGCGGAGC AGGGTGTGTC GACTTCAATC TGCGGCACGC TCGGCTATAC GTTCTGGGCG
ATCAACAAGG TCCCGTGGCC GGGCGACACG CCCGATCCGG TCGCGCCGAT CGAGGAACTC
AAACTTGGGA AAAGCTATGT GCTCGAGATC GCGAACCGCA CTCCGCACGC CCATCCCATC
CATCTGCACG GCCTGAGCTT CCGCGTCCTC GCCGCGAACA AGCGCACCGT GCTGCCGCCG
CCGACCGACA CCATCCTGCT CCTGCCGGAC GAACAGGCGC AGCTGGCACT GGTCGCGGAC
AATCCCGGCG ACTGGCTTTT CCATTGCCAT ATCATCGAAC ATCAAAAGAC CGGCATGGCG
GCTTATTTCC GGGTCGCCTG A
 
Protein sequence
MPMLDRRMFL QASAAFGGAF ALGAGLAAKA GGAPDPQILT ARFTEARIAT GGTTPRLMTY 
DLSGTAGSGV PPVLRMRKGE PYAARLINRL DEPTTVHWHG LRIVNAMDGV PEMTQPYVYP
GEGFDYLFTP PDAGTFWYHP HCNTLMQMGS GLTGVIVVEN PKDPAFDAEI VLNLRDWRLN
ASGAFIAPFK PRDAARGGTY GTVRTANWQR EPVYDAPAGG LVRVRIAATD VTRIYSIGLE
GAAAKVIALD GNPVEMPFAL DRLDIGPGQR VDLALRMPEN EESRATLDNF RGSSPWTIAT
FRAVGASLKR DLRDINPLPP NPVAEADLST ARRIPIDLTA TAEQGVSTSI CGTLGYTFWA
INKVPWPGDT PDPVAPIEEL KLGKSYVLEI ANRTPHAHPI HLHGLSFRVL AANKRTVLPP
PTDTILLLPD EQAQLALVAD NPGDWLFHCH IIEHQKTGMA AYFRVA