Gene Smed_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0161 
Symbol 
ID5320991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp177449 
End bp178537 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content66% 
IMG OID640789094 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001325855 
Protein GI150395388 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.46747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.733228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGCG GGCTGGAGCA CCTCGCGCGG CGCGGCCTCT TCCTCTTCGA CCCGGAAGCG 
GCTCATGGCC TTTCGATCAC AGCGCTGAAA ACCGGCCTCG TACCGAGCTG CGCCGCCCCG
GCCGACCCAC GCCTCCAGCA GAGCGTTGCG GGCCTCGCCT TTCCCAATCC GGTCGGCATG
GCGGCCGGCT ACGACAAGAA TGCCGAGGTG CCGGAAGCCT TGCTGAAGAT CGGTTTCGGT
TTTACCGAAA TCGGCACAGT GACGCCGAGA CCGCAGCCCG GCAACGACAA GCCGCGGCTT
TTCCGGCTCA TCGAGGACGA GGCGGTGATC AACCGCCTCG GCTTCAACAA TGAGGGACAT
GGTGCGGCGC TGGCGCGGCT CAAGGCCTGC TCGCGCGAGG CGCTGATCGG CGTCAATATC
GGCGCCAACA AGGACAGTGC CGACCGCATT GCCGATTACG TGACCGGCAT CCGGACCTTC
TACGCGGTCG CCCGCTACTT CACCGCCAAC ATCTCCTCGC CCAACACGCC GGGCCTGCGC
GACCTGCAGG CGCGCGAGAG CCTCGCGACG CTGCTTTCGG CGGTGCTTGC CGCCCGCGAA
GATGAAGCGG GAAAGTGCGG GCGGCGGGTT CCGGTCTTCC TCAAGATCGC TCCGGACCTG
ACCGAGGAGG GCATGGACGA CATCGCGGCG GAAGTGCTGG CGCAGGGTCT CGACGGCTTG
ATCGTATCCA ACACGACGCT CGCGCGCGCG CGTCTCCGGG ACAGGAAACA GGCCAGTGAG
GTCGGCGGGC TCTCTGGAAA GCCGCTATTC GAGAAGTCGA CCGCGGTTCT TGCCCGGATG
CGCAGGCGCG TGGGCCCCGA CTTGCCGATC ATCGGCGTCG GCGGCGTGAG CTCGGCGGAA
ACCGCTGCGG AAAAGATCCG CGCCGGAGCC GATCTGGTAC AGCTTTACTC CTGCATGGTC
TATGAGGGGC CGAGCTTGCC GGGCCGTATC GTGCGCGGCC TCTCCGCGCT TTGCGACCGC
GAGAAGCTTG CCTCGATCCG GGAGATCCGC GACAGCCGGG TCGATTACTG GACTGGCATG
AACGTCTGA
 
Protein sequence
MIGGLEHLAR RGLFLFDPEA AHGLSITALK TGLVPSCAAP ADPRLQQSVA GLAFPNPVGM 
AAGYDKNAEV PEALLKIGFG FTEIGTVTPR PQPGNDKPRL FRLIEDEAVI NRLGFNNEGH
GAALARLKAC SREALIGVNI GANKDSADRI ADYVTGIRTF YAVARYFTAN ISSPNTPGLR
DLQARESLAT LLSAVLAARE DEAGKCGRRV PVFLKIAPDL TEEGMDDIAA EVLAQGLDGL
IVSNTTLARA RLRDRKQASE VGGLSGKPLF EKSTAVLARM RRRVGPDLPI IGVGGVSSAE
TAAEKIRAGA DLVQLYSCMV YEGPSLPGRI VRGLSALCDR EKLASIREIR DSRVDYWTGM
NV