Gene Rleg_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0222 
Symbol 
ID8011447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp232042 
End bp233130 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content65% 
IMG OID644822815 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_002974072 
Protein GI241202976 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATC CCTTCAAGCG TCTCGCCCGC AAGGGTCTCT TCCTGTTCGA TCCGGAAACG 
GCGCACGGCA TGTCGATCGC CGCCTTGAAA TCCGGTCTCG TGCCCGCCTG CCAGATCACC
CCCGATCCGC GCCTGCGCCA GACCGTTGCC GGCCTTACCT TCGAAAATCC GCTCGGCATG
GCCGCCGGCT ACGACAAGAA TGCCGAGGTG CCGGAGGCGC TGCTGAAGCT CGGTTTCGGC
TTTACCGAGA TCGGTACGGT GACGCCGAAG CCGCAATCCG GCAATCCGCG CCCGCGCATC
TTCCGCCTGG TCGAGGATGA AGCCGTCATC AACCGCCTCG GCTTCAACAA TGAAGGCCAT
GATGCCGCCT TCGGGCACCT CGCCGCGCTG AGGGGCGGGG GCATGATCGG CGTCAATATC
GGCGCCAACA AGGATAGCGA GGACCGCATC GCCGATTATG TCGCCGGCAT CCGCCGCTTT
TATTCCGTCG CGCGCTATTT CACCGCCAAC ATCTCCTCGC CGAACACCCC CGGCCTGCGC
GACCTGCAGG GGCGCGAAAG CCTTGCGGTG CTGTTATCAG CCGTGCTTGC GGCGCGTGAC
GAAATGGCAG CGGCATCCGG CCGGACGATC CCGGTCTTTC TGAAGATCGC GCCTGATCTG
ACCGAGGAAG GCATGGACGA TATCGCAGCC GAGGCGCTTT CGCATGGGCT CGACGGGCTG
ATCGTCTCCA ACACCACGCT GTCGCGCGAC GGCCTCAAGG ATCAGCGCCA GGCGAAGGAG
GCGGGTGGAC TTTCCGGCGT GCCGCTTTTC GAAAAGTCGA CGGCGGTGCT CGCCAGGATG
CGCAAGCGCG TCGGCCCTGA TCTGCCGATC ATCGGCGTCG GTGGCGTCTC CTCGGCCGAG
ACCGCGCTGG AGAAGATCAG GGCGGGCGCC GATCTCGTCC AGCTCTATTC CTGCATGGTC
TATGAAGGCC CCGGTCTGGC CGGCGATATC GTCCGCGGCC TGTCGAAACT CCTGGACCGC
GAAAAGGCCG CCTCGATCCG CGACCTGCGT GATGTCAGGC TGGATTATTG GGCGGCGCGG
AAGGTCTGA
 
Protein sequence
MIDPFKRLAR KGLFLFDPET AHGMSIAALK SGLVPACQIT PDPRLRQTVA GLTFENPLGM 
AAGYDKNAEV PEALLKLGFG FTEIGTVTPK PQSGNPRPRI FRLVEDEAVI NRLGFNNEGH
DAAFGHLAAL RGGGMIGVNI GANKDSEDRI ADYVAGIRRF YSVARYFTAN ISSPNTPGLR
DLQGRESLAV LLSAVLAARD EMAAASGRTI PVFLKIAPDL TEEGMDDIAA EALSHGLDGL
IVSNTTLSRD GLKDQRQAKE AGGLSGVPLF EKSTAVLARM RKRVGPDLPI IGVGGVSSAE
TALEKIRAGA DLVQLYSCMV YEGPGLAGDI VRGLSKLLDR EKAASIRDLR DVRLDYWAAR
KV