Gene Smed_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4003 
Symbol 
ID5318283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp457288 
End bp459606 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content64% 
IMG OID640775811 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001312744 
Protein GI150376148 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.567509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACGC ATGAGAGAAT CGACAGAGAT TTGTCCGAAC GGGATTTCAC CGTCGTGGGC 
AAAAGCGTCA AGCGTTCCGA TACGCTGGAA AAGGTGACGG GGGCGGCGAG ATATGCTGGA
GACGTCGCTT TGCCCGGCAT GCTCTATGCC AAGATGAAGC GCAGCAACAT CGCGCATGCG
CGCATCAAAA GCATCGATAC GTCCAAGGCG CTGGCGCTCC AAGGAGTGAA GGCGGTGCTG
ACCCACCAGG ACGTTCCGCG CGTGCTGCAT GCCGGCTCGC CGCATCCACG CTCGGCATCG
GTTACAAAGG ACCAGTACAT CCTGGACGAG AGGGTACGCT ACTGGGGCGA GGGCGTTGCC
GCCGTCGCTG CCGTCAGTGA GGAAATCGCC GAGCGGGCGG TCGCACTCAT CGAGGTCGAA
TACGACCCTC TGCCCGGCCT CTTCACCATC GAAGCGGCAT CGGATCCCGC GGCACCGCCG
ATTCATGAGA ACGGCCTCGG TCAGAATTAC GTGCTGCCGC CGGTCTTCGT CACGCGGGGC
GACGTCGACA AGGGCTTTGG CGAAGCGGAT CTGGTTATCG AGCGCGAATA TGATCTCGGC
CGCCCGACTC CGGCCTATAT GGAGCCGAAC GTCTGTGTCA GCCAGTGGGA CGGCAACGGC
AAGCTGACCA TGTGGACCTC GACGCAAAGT GCCTTCATGG TCCGCGGCAC CCTCGCCGAA
GTGCTCGGCG TGCCGCTGAA CAAGGTGCGC GTGATCGTCG ACCACATGGG CGGAGGCTTC
GGCGCAAAGC AGGATCTGTT TCAGAACGAA TTCCTCTGCG CGCTGCTTGC ACGCCGGACC
GGGCGCCCCG TCAAGATGGA GTTCAGCCGG AAGGAAACAT TCGTGGGCGG CCGCTCGCGT
CACCCCGGCA AGATCTGGCT CAAGCAGGGC TTCACCAGGG ATGGCCGCAT CGTCGCCCGC
GAAGCCAGGG TCACGTTCAA TTCCGGGGCC TACGGATCGC ACGGTCCAGG CGTAACCAAT
GTCGGCACTG CTGCACTGAC CTCGCTCTAT CGCTGCGAAA ATGTGCGTCT CGAAGGCCGC
TGCATCTACA CGAACTCGCC GATCGCCGGG GCCTTTCGCG GCTATGGCGT CGTGCAGACA
TATTACGCAC TCGACCTGAT GATGGACGAG GCGGCCGAGA AGCTCGGTTT CGATCCGGCC
GAGTTCAAGC TGATGAATGC GGTGCGCGAG GGCGACATGG CGCCGTCGGG GCATCCGATC
GTCGGCCATG GGCTGGTGGA TTGTATCCGC CGTGTCATGG AAGAGACGAA CTGGCACGAA
CTGCGCCGGC GCGAGAAGCC GGAAACCGTC AAGCGGCGCG GGATTGGGAT CGGGTGCGAA
ATGCACGGCT CCAGCGCCTA TCCCGGCATC AAGGAGCAGG GCAACGCGAT CGTCAAAATG
AACGAGGACG GCACGGTGAC CCTGATCACC GGTACAGCTG GCCTTGGCAC CGGCGCGCAT
ACGGCGCTGT CGCAGATCGT CGCCGAAGAG CTCGGAGTGC CGTTCGAGGC CGTCTCCGTC
GTTCAGGGCG ATACCGACAT GGTGCCCTGG GACATCGGCG CCTTTGCCAG CCACACGACT
TATCTGGGCG GCCGGGCGGC GCAGCTCGCG GCGGCCGACG TGCGCAGGCA GGTGCTCGAG
CACGCCGCCC CCCTGCTCAA GGCGGAGCCC GGCGACCTCG CGATCCGCGA TGGATTCGTG
GTCGTCGCCA ACGGCTCGAA CCAGAGCCTC CGGCTTTCCG AAGCCGTGGG GCCGCAGCGG
GGCATGCCGG CGGTGCAACT GGTCGGCGTC GGTACCTATA TGCCGACGAA GTCCTACTCC
TTCGCCGCGC ATTTCGCCGA AGTCGAGGTG GATACGGAGA CGGGCGAGGT CGCCGTTCTG
GAGGTCGTGC CGGTACACGA GATCGGCAGG GTCATCCATC CGATCGCCGC GGCAGGACAG
ATCGAGGGCG GCATTCAGCA GGGCATCGGC CACACGCTCA GCGAGGACTA CGTCATCGAC
CTTACCGACG GGCGCTCGCT CAATCCGAGC TTCGTCGACT ACAAGATGCC GCTGTCGATG
GACATGCCGT CCATCCGCAC CATCATCATC GAGACGGCGC CGGATCCCGG CGGCCCATAC
GGCGCCAAAG GCGTCGGCGA GGATCCGATC ATCGCGATCG GGCCGGCCAT CGCCAACGCG
ATCTACGACG CCATCGGCGT CCGCTTCCAT CATTACCCGA TAACGCCCGA GCAAGTGTTG
AACGCGCTCA AGACCAAAGC CAACGAAACG AGGCAGTGA
 
Protein sequence
MNTHERIDRD LSERDFTVVG KSVKRSDTLE KVTGAARYAG DVALPGMLYA KMKRSNIAHA 
RIKSIDTSKA LALQGVKAVL THQDVPRVLH AGSPHPRSAS VTKDQYILDE RVRYWGEGVA
AVAAVSEEIA ERAVALIEVE YDPLPGLFTI EAASDPAAPP IHENGLGQNY VLPPVFVTRG
DVDKGFGEAD LVIEREYDLG RPTPAYMEPN VCVSQWDGNG KLTMWTSTQS AFMVRGTLAE
VLGVPLNKVR VIVDHMGGGF GAKQDLFQNE FLCALLARRT GRPVKMEFSR KETFVGGRSR
HPGKIWLKQG FTRDGRIVAR EARVTFNSGA YGSHGPGVTN VGTAALTSLY RCENVRLEGR
CIYTNSPIAG AFRGYGVVQT YYALDLMMDE AAEKLGFDPA EFKLMNAVRE GDMAPSGHPI
VGHGLVDCIR RVMEETNWHE LRRREKPETV KRRGIGIGCE MHGSSAYPGI KEQGNAIVKM
NEDGTVTLIT GTAGLGTGAH TALSQIVAEE LGVPFEAVSV VQGDTDMVPW DIGAFASHTT
YLGGRAAQLA AADVRRQVLE HAAPLLKAEP GDLAIRDGFV VVANGSNQSL RLSEAVGPQR
GMPAVQLVGV GTYMPTKSYS FAAHFAEVEV DTETGEVAVL EVVPVHEIGR VIHPIAAAGQ
IEGGIQQGIG HTLSEDYVID LTDGRSLNPS FVDYKMPLSM DMPSIRTIII ETAPDPGGPY
GAKGVGEDPI IAIGPAIANA IYDAIGVRFH HYPITPEQVL NALKTKANET RQ