Gene Smed_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1481 
Symbol 
ID5322339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1567144 
End bp1568760 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content65% 
IMG OID640790429 
Product4-diphosphocytidyl-2C-methyl-D-erythritol synthase 
Protein accessionYP_001327161 
Protein GI150396694 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0303] Molybdopterin biosynthesis enzyme
[COG2068] Uncharacterized MobA-related protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.475919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00908696 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGTTCG GACCTGTGCG GCCTGCGGAT GCCGAAGGCG CTCTTCTCGG CCATTCGGTA 
AAGATCGGCA CAGTGAAGCT ATCGAAGGGA CATCGCCTGA ACCCCGACGA TGTCGCGGCG
CTTGCAGAAG CGGATGTAAC GGAGGTCATT GTCGCGGTGC TCGAGCCCGA CGACCTGCCT
GAGGACGAAG CCGCGTCACG CATAGCGGCC GCCATCGCCC CGGATCATCT GCGATTCTCG
CGGGCGGCGA CCGGCCGGGT CAACATCTAT GCCACCGCTG CCGGTCTTTT CGTGGCCAAT
CGCGCCGTTG TCGATCTGCT GAACCGCATA GATCCGGCGA TCACGCTCGC CTGCCTTGCC
GATCATGTTG CCGTCGAGGC CGGCGACATG GTGGCGACGA TCAAGATCAT CCCTCTTGCG
GTGCAGCAAG TGCTCGTGGA GCAAGCGCAG GCACTGCTTC GGGCGGACCG GGCCTTCGAG
GTCAAGCCGT TCACGCCTCG GCGCGCGGCC CTGGTGGCAA CCGAGCTGCC TTCCCTGAAG
AGCCAGGTAA TGGACAAGAC GCGCGCCGTT CTCACCACGC GGCTGCTGCA ATCGGACAGC
CGGCTCGATA GCGAGCGACG CGTGGCGCAT ACGACCGAAG CGGTCGCGGC AGCAATCGTC
GAGGCCGCCT CGACGCATGA CCTGATCATC GTCTTCGGCG CCTCGGCCGT GGCCGATCCC
GACGATGTTA TCCCGGCTGC CATCCGGCGC GCCGGCGGTG TCGTCGAACA TGTCGGAATG
CCCGTCGATC CAGGCAATCT GCTTGTGCTC GGTAGACTTG GCGAAATTCC TGTCCTGGGT
GCTCCCGGTT GTGCTCGAAG CCCGCGGGAA AACGGCTTTG ACTGGATCCT CGATCGTTTG
CTCGCGGGTG AGTGGCCGAG CAGCGAAGAC ATTACAGGGT TCGGCGTCGG CGGGTTGCTC
AAGGAAATAC CGACTCGTCC GCAGCCACGG GAAGGCATCG CCGACAAGCG CGCCGGCCCT
GTTGAGCTCG TCGTTCTGGC AGCCGGTAGG GCGAGCCGGA TGGGGCCGGA GGGCCGGCAC
AAGCTCCTGG CGGAGTTCGA AGGGATGCCG CTCGTGCGGC GGTCAGTGGA GGCGGCCATC
GGCGCGGCTC CGGGCAGAGT GACGGTGGTG ACCGGGCACA GGGAGGCCGA GATTCAGGCG
GCACTCGCAG GCTTGTCCGC CAACTTCGTC TCAAATCCCG ACTACGCGGG CGGGATGGCA
TCTTCGTTGA TTGCGGGCCT CTCCGCTCTC GACACCGGTG CTGGCGGCAT GCTCGTCATG
CTTGCCGATA TGCCGGGGAT CACATCGGAA CACCTGGCAG AATTGATCGC CGCCTTCGAA
GTCGAGTCCG GCCGAGCAGT GGTGCGTGCC GTCGCCGGTG GGCAGCGCGG CAATCCGGTC
ATTCTGCCAA AGGAGACATT CCACGCCGTC CGGCAGCTCG TCGGCGACGT CGGGGCCAGG
CACATCGTCG AGCGGTGCGG CTTGCCGGTG ATCGATGTCG AACTCGGCCC TGCTGCGCGT
CTCGATCTCG ACACACCGGA GGCGATCCTG GCTGCGGGTG GGATTTTGAA GGATTGA
 
Protein sequence
MRFGPVRPAD AEGALLGHSV KIGTVKLSKG HRLNPDDVAA LAEADVTEVI VAVLEPDDLP 
EDEAASRIAA AIAPDHLRFS RAATGRVNIY ATAAGLFVAN RAVVDLLNRI DPAITLACLA
DHVAVEAGDM VATIKIIPLA VQQVLVEQAQ ALLRADRAFE VKPFTPRRAA LVATELPSLK
SQVMDKTRAV LTTRLLQSDS RLDSERRVAH TTEAVAAAIV EAASTHDLII VFGASAVADP
DDVIPAAIRR AGGVVEHVGM PVDPGNLLVL GRLGEIPVLG APGCARSPRE NGFDWILDRL
LAGEWPSSED ITGFGVGGLL KEIPTRPQPR EGIADKRAGP VELVVLAAGR ASRMGPEGRH
KLLAEFEGMP LVRRSVEAAI GAAPGRVTVV TGHREAEIQA ALAGLSANFV SNPDYAGGMA
SSLIAGLSAL DTGAGGMLVM LADMPGITSE HLAELIAAFE VESGRAVVRA VAGGQRGNPV
ILPKETFHAV RQLVGDVGAR HIVERCGLPV IDVELGPAAR LDLDTPEAIL AAGGILKD