Gene Smed_5491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5491 
Symbol 
ID5319793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp457164 
End bp460085 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content62% 
IMG OID640777246 
Productmolybdopterin oxidoreductase 
Protein accessionYP_001314178 
Protein GI150377583 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACGA AGCGCAAGGA TCGGCAGGCA AGCCGCACGC ATCTGCAAAC CCTGACCGGC 
GGGATGGCAG CGGCCCCGTT GGACCGACGG ACATTCCTCC GCCGCTCCGG GCTCGTCGCC
GGCGGTCTGG CGGCACTCGG TTCCTTCCAA CTCGGCACAG TCAAAAAGGC CGCCGCAATC
AGCCCCCCGC AACCGGGCGT GCCGATCGAA CTCAAGAAGA GCATCTGTAC CCACTGTTCG
GTCGGATGCA CCGTAACGGC AGAGGTCCAG AACGGCGTGT GGACGGGCCA GGAACCATCC
TGGGACAGCC CCTTCAATCG CGGCTCGCAT TGCGCCAAAG GCGCCAGCGT ACGTGAACTG
GTGCACAGCG ACCGCCGTCT GAGATACCCG ATGAAGCTCG TCGACGGCCG ATGGGAGCGT
ACCACTTGGG AGGAGGCGAT CAACGGCATC GGCGACAAGC TTCTCGAGAT CCGCGAGAAG
TCCGGGCCGG AATCCACCTA TTGGATGGGG TCGGCGAAAT TCTCCAACGA AGGCACCTAT
CTCTTCCGCA AGCTCGCCGC TCTGTGGGGA ACCAACAACC AGGATCACCA GGCACGAATC
TGTCATTCGA CGACGGTTGC CGGCGTAGCC AACACCTGGG GCTACGGCGC GATGACCAAC
TCGTACAACG ACATCAGAAA TTCAAAGACG ATGCTCGTTC TTGGCGGAAA TCCGGCCGAG
GCGCATCCGG TCGCGATGCA GCATCTGCTC GAGGGCAAGG AACTCAACAA CGCCAATTTC
ATCGTCATGG ACCCGCGCTT CACGCGCACG GCCGCCCATG CGACCGAATA CGTCCGCTTC
AGGTCAGGCA CCGACATCGC GCTGATCTGG GGCATGCTTT GGCATATTTT CGAGAATGGC
TGGGAGGACA AGGAATTCAT CACGCAACGC GTCTACGGGA TGGATGAGGT TCGAAAGGAG
GTCGCCAAGT ACACTCCGGA CGAGGTGGAA ATGATCACCG GCGTGCCCGG CGAGCAGTTG
AAGCGCGTCG CCGAGACGTT CGCGACCCAG AAGCCATCGA CGATCATCTG GTGCATGGGC
CAGACGCACC ACACTGTTGG AACCGCCAAT ACGCGGGCGA GCTGCATCCT CTGTCTCGCT
ACCGGCAACA TCGGCAAGCC GGGCACCGGC GCCAATATCT TCCGCGGCCA CGACAATGTT
CAGGGCGCGA CTGACATCGG GCTCGACGTA GTCACTCTGC CCTTCTACTA CGGTCTGACG
GAAGGGGCCT GGAAGCACTG GGGGCGGGTG TGGGAAATCC CCTATGAGGA CCTGGTGGGC
CAATTCTCTT CCAAGGAACT GATGGAAACA CCCGGCATTC CGCTGACCCG ATGGTTCGAT
TCGGTTTCAC TGCCAAAGGA AGACGTCGGG CAACCCGACG TCGTGCGGTC GATGTTCGTA
CAGGGCCACG CCAGCAACAG CATCACGCGC ATACCCGAGT CGATCAAGGG GCTTGCCGGA
CTTGAACTGC TTGTCGTGGC CGATCCCCAC CCCACGACCT GGGCCTCGCT CGCGGTGCAG
GCGGGGCGCA AGGACAACAC CTATCTCCTG CCGGTCTGCA CGCAGTTCGA AACCTCGGGA
TCGCGGGTCG CCTCCAACCG CTCGATCCAG TGGGGCGAGC AAATCGTCCC TCCAAGCTTT
GAGCAGAAGG ACGATTATCA GGTCCTGTAC CTGCTCTCGC AGAAGCTTGG CCTTGCCGAA
TGGATGTTCA AGAACATCGC AGTCGAAGGT GATAGGCCGG TACCCGAGGA CGTGCTGCGT
GAAATGAACC GCGGCGGCTG GTCGACCGGC TATTGTGGCC AGTCGCCCGA GCGGCTCAAG
GCGCATATGC GCAACCAGCA CAAATTCGAC CTGCTCACCC TGCGCGCGCC GAAGGACGAC
CCGGAGGTCG GCGGAGATTA TTACGGTCTC CCTTGGCCAT GCTGGGGGAA GCCGGAACTC
CGTCACCCGG GAACTGCAAA CCTCTACAGC ACCGGCCTGA ACGTCATGGA CGGGGGCAGC
CCATTCCGCG CCCGCTTCGG CGTGGAACGC AACGGCGAAA CGCTTCTCGC CGAGGGTTCT
TACACGGAAG GCTCCGAACT CACGGACGGT TATCCGGAGT TCACCATGGC CGTGCTCCAG
AAACTCGGCT GGGATCGGGA TCTCACCCCG GAGGAGCTAG GCATCATCCA GGCGATCGGC
GATGATATGG GCGACATCGG CAAGGTGTCG TGGTCTACCG ACCTGTCTGG CGGCATCCAG
CGCGTGGCGC TCAGCCATGG CTGTCACCCC TACGGCAATG GCAAGGCCAG GGCACTTGCC
TGGAATTTGC CGGATCCGGT GCCGATCCAC CGCGAGCCGA TCTACACGCC GAGGGTGGAC
CTTGTAGCCA AGTATCCGAC CTATCCGGAC GCCAAGCAGT TCCGCCTGCC CAATATCGGG
TTCAGTGTGC AGAAGGCGGC TGTGGACAAT GGCACCGCGA AATCGTTCCC GATTATTCTG
ACGACCGGAC GCCTCGTCGA ATACGAGGGT GGCGGCGAGG AAACCCGCTC GAACCGGTGG
CTTGCCGAGT TGCAGCAGGA CATGTTCGTC GAGATCAACG TCGACGATGC AGCCGAGCGC
GGCATCAGTG ACGGAGGTTG GGTCTGGGTC AACGGAGCGG AGAACGACGT CAAGGCAAAG
GTAAAGGCCC TGGTCACCGA GCGTGTCGGC AAGGGTGTCG CCTTCATGCC ATTTCACTTC
GGCGGCTGGT TCCGGGGCGA GGACTTGCGA GCCAATTACC CCGAAGGGAC CGACCCTTAC
GTTCTGGGCG AAAGCGCCAA CAGTATCACC ACCTACGGCT ACGATCCTGT AACGGGGATG
CAGGAACCGA AGGTTACGCT TTGCCAGATC GCGGCGGTGT AG
 
Protein sequence
MLTKRKDRQA SRTHLQTLTG GMAAAPLDRR TFLRRSGLVA GGLAALGSFQ LGTVKKAAAI 
SPPQPGVPIE LKKSICTHCS VGCTVTAEVQ NGVWTGQEPS WDSPFNRGSH CAKGASVREL
VHSDRRLRYP MKLVDGRWER TTWEEAINGI GDKLLEIREK SGPESTYWMG SAKFSNEGTY
LFRKLAALWG TNNQDHQARI CHSTTVAGVA NTWGYGAMTN SYNDIRNSKT MLVLGGNPAE
AHPVAMQHLL EGKELNNANF IVMDPRFTRT AAHATEYVRF RSGTDIALIW GMLWHIFENG
WEDKEFITQR VYGMDEVRKE VAKYTPDEVE MITGVPGEQL KRVAETFATQ KPSTIIWCMG
QTHHTVGTAN TRASCILCLA TGNIGKPGTG ANIFRGHDNV QGATDIGLDV VTLPFYYGLT
EGAWKHWGRV WEIPYEDLVG QFSSKELMET PGIPLTRWFD SVSLPKEDVG QPDVVRSMFV
QGHASNSITR IPESIKGLAG LELLVVADPH PTTWASLAVQ AGRKDNTYLL PVCTQFETSG
SRVASNRSIQ WGEQIVPPSF EQKDDYQVLY LLSQKLGLAE WMFKNIAVEG DRPVPEDVLR
EMNRGGWSTG YCGQSPERLK AHMRNQHKFD LLTLRAPKDD PEVGGDYYGL PWPCWGKPEL
RHPGTANLYS TGLNVMDGGS PFRARFGVER NGETLLAEGS YTEGSELTDG YPEFTMAVLQ
KLGWDRDLTP EELGIIQAIG DDMGDIGKVS WSTDLSGGIQ RVALSHGCHP YGNGKARALA
WNLPDPVPIH REPIYTPRVD LVAKYPTYPD AKQFRLPNIG FSVQKAAVDN GTAKSFPIIL
TTGRLVEYEG GGEETRSNRW LAELQQDMFV EINVDDAAER GISDGGWVWV NGAENDVKAK
VKALVTERVG KGVAFMPFHF GGWFRGEDLR ANYPEGTDPY VLGESANSIT TYGYDPVTGM
QEPKVTLCQI AAV