Gene Smed_5507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5507 
Symbol 
ID5319809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp472946 
End bp474895 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content63% 
IMG OID640777262 
Productputative molybdopterin biosynthesis protein MoeA/LysR substrate binding-domain-containing protein 
Protein accessionYP_001314194 
Protein GI150377599 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme
[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAAAA ATCCTTCTTC AATCCCGCGA AACACCGACC AGAAGCAATT TCTGACGATA 
TTGTCACGTG AGGAGGCCCT GGCGCGTTTC GAAGCAGCGC TGTTTCCCTG GCCGGTACCG
ACCGAAACAT TGAGGTTAGC CGAGGCGCTC GGGCTGGCTC TTGCCGAAGA CGTCGTGGCG
CAGGTCGACG TTCCACCCTT CGACCGTTCC AATGTCGACG GGTTTGCGGT GCGAGCAGCA
GATCTGGTTG CCGCCTCTGA CCTACAGCCT GTTCGATTGA CGCTGAACAT GGAGACGATT
GCCTGCGGCA GCCTCCCGCA ACTCGCGGTA TTGCCCGGGA CCGCCACAGC CATCGCCACG
GGTGGCCCTC TGCCGCGCGG TGCGGACGCG ATCGTCATGG TGGAATATAC TCAGCCAGCA
GAAGGTGCGG CCATAGAAGT GTGGCGTGCT GTCTCGCCGG GGCAGTTCGT CTCCAGCGCC
GGCTCGGACA TGGCGCGAGG AGAAGTCGTG TTGCGGGCTG GGTCGGTGAT CGGTGCGCGT
GAGATCGGGA TACTCGCAGC CTGCGGAACC GCGCAGGTAA CGGTGGCACG CAAACTGCGC
GTTGCAGTTC TGTCCACCGG TGATGAGCTC GTGCAACCGG GCGAACCGCT GCAGCCGGCC
GGCATTTATG ACGCCAATGG GCCGATCGTA AGTGCGGCGG TCACCGAAAA CGGGGGCGAG
GCCTATTTTT TGGGTGCGTT TCCAGACGAC GAAGCCAGGC TCGAGACCGC GATGCGCGAG
GCGCTCGACT CACATGACGT CCTGATTCTG TCCGGAGGCA CTTCCAAGGG TGCGGGAGAT
GTCAGCTACC GTATTATCGG CCGTCTCGGA CAACCGGGCA TCATCGCTCA CGGTGTTGCG
CTCAAGCCCG GAAAGCCGCT CTGTCTGGCG GTATGTGACG GCAAGCCGGT TATCATCTTG
CCAGGGTTCC CGACCTCGGC CATGTTCACC TTCCACGATA TGGTGGTGCC CATCCTGCGG
CGCATGGCGG GGCTGCCGCC GCGCGTCGAT GCGCAAACGA GTGCGCAGCT TCCGTTCCGT
GTTCCGTCCG AACTGGGGCG CACCGAGTTT GTCATGGTCT CGCTGGTGCA AGGGCGGGAT
GGACTGATGG CCTATGCGAC CGGCAAGGGC TCGGGAGCGA TTACCGCCTT CGCCCAGGCT
GACGGTTTTA TTCGCATTGA CGCCTTCGTC GACCATCTGC CAGCGGGCGC ACAACTGCCG
GTGACGCTGT TCACGCCGCA GGTCAAGGTG CCCGACCTCG TTGTGATCGG CAGTCACTGC
ACGGGCCTTG ATCTCGTCGT GGGCAAAATC GCGCGCCAAG GGATTTCGGT GCGATCGTTG
GCCGTCGGAA GCCTGGGCGG TCTTGCCGCC GCCAAACGCT GTGAGTGCGA CCTTGCACCG
ATCCATCTTT TCGATCCTCA GACACAGGTC TACAACACGC CATTCCTGGG CGAGGGTATG
GAGCTTGTGC CGGGCTGGCG GCGCATACAA GGCATCGTGT TTCGCCGCGG CGACGCGCGC
TTCGAAGATC ATGCTGCGCC GGAGGCGGTG GAAGCGGCTC TGGCTGATCC CGAATGCATG
ATGGTCAATC GAAACCAGGG TGCCGGTACG CGCATTCTGA TCGACCAATT GCTCGGCCAG
AAGCGCCCTG ATGGCTACTG GAACCAGCCT CGTTCACACA ATGCAGTCGC CGCCGCTGTC
GTACAGAAAC GCGCCGATTG GGGCGTCACG ATCGGGCCGG TCGCGCGCGC GGCTGGCCTG
GGATTCATAC CGCTGACGCA AGAGCACTTT GATTTCGCGC TGGTTGCGGA CCGGAAGGAG
AGGGTGGCAG TTCAGGCATT CCTTGCCGCC TTGCTGTCGC CTGACATGCA GGAGGCGCTG
GAGCGGGCGG GATTCAGTCG AGCTTGCTAA
 
Protein sequence
MVKNPSSIPR NTDQKQFLTI LSREEALARF EAALFPWPVP TETLRLAEAL GLALAEDVVA 
QVDVPPFDRS NVDGFAVRAA DLVAASDLQP VRLTLNMETI ACGSLPQLAV LPGTATAIAT
GGPLPRGADA IVMVEYTQPA EGAAIEVWRA VSPGQFVSSA GSDMARGEVV LRAGSVIGAR
EIGILAACGT AQVTVARKLR VAVLSTGDEL VQPGEPLQPA GIYDANGPIV SAAVTENGGE
AYFLGAFPDD EARLETAMRE ALDSHDVLIL SGGTSKGAGD VSYRIIGRLG QPGIIAHGVA
LKPGKPLCLA VCDGKPVIIL PGFPTSAMFT FHDMVVPILR RMAGLPPRVD AQTSAQLPFR
VPSELGRTEF VMVSLVQGRD GLMAYATGKG SGAITAFAQA DGFIRIDAFV DHLPAGAQLP
VTLFTPQVKV PDLVVIGSHC TGLDLVVGKI ARQGISVRSL AVGSLGGLAA AKRCECDLAP
IHLFDPQTQV YNTPFLGEGM ELVPGWRRIQ GIVFRRGDAR FEDHAAPEAV EAALADPECM
MVNRNQGAGT RILIDQLLGQ KRPDGYWNQP RSHNAVAAAV VQKRADWGVT IGPVARAAGL
GFIPLTQEHF DFALVADRKE RVAVQAFLAA LLSPDMQEAL ERAGFSRAC