Gene Rleg_1390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1390 
Symbol 
ID8012483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1379420 
End bp1380709 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content64% 
IMG OID644823975 
Productdihydroorotase 
Protein accessionYP_002975221 
Protein GI241204125 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.234318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0725685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACC CGATCGTCCT CAAGAACGTC CGCATCATCG ACCCGTCGCG CAATCTCGAC 
GAGGTGGGGA CGATCATTGC CGAAAACGGC GTGATTCTCG CCGCCGGCGG CAAGGCGCAG
AACCAGGGCG CGCCTGATGG AGCCGTCATC CGCGACTGCA CGGGCCTTGT CGCGACGCCC
GGCCTCGTCG ATGCGCGCGT CCATGTCGGC GAACCCGGCG GCGAACACCG TGAGACGATC
GCCTCGGTGA GCCGGGCGGC CGCCGCCGGC GGCGTCACCT CGATCATCAT GATGCCGGAC
ACCGATCCCA TCATCGACGA CATCGCACTC GTCGAATTCG TCAAGAAGAC GGCGCGGGAT
ACGGCCGCCG TCAACGTCTA TCCGGCAGCC GCCATCACCA AGGGCCTTGC CGGCGAGGAG
ATGACGGAGA TCGGCCTGTT GATGCAGGCA GGCGCCGTCG CCTTTACCGA TGCCCATTCC
AGCGTCCACG ACACACAGGT GCTGCGCCGG ATCATGACCT ATGCGCGCGA ATTCGGCGCC
GTCATCTGCT GCGAAACACG CGACAAATAT CTCGGCGCCA ACGGCGTCAT GCATGAGGGG
CTTTTCGCCA GCTGGCTCGG GCTTTCCGGC ATTCCAAAGG AAGCCGAGCT CATCCCGCTC
GAACGCGATC TGAGGATCGC GCAGCTGACG CGCGGCCGTT ATCACGCCGC GATGATCTCG
GTGCCGGAAT CGGTCGAGGC GATCGAGCGC GCCCGCAGCC GCGGCGCCAA GGTGACCTGC
GGCATCTCGA TCAACAATCT GGCGCTCAAC GAAAACGACA TCGGCGAATA CCGCACCTTC
TTCAAGCTCT ATCCGCCGCT GCGCCCGGAA GACGACCGGG TGGCGATGGC CGACGCCCTT
GCGAGCGGCG CGATCGATAT CATCGTCTCC TCGCACGACC CGCAGGATGT CGATACGAAG
CGCCTGCCCT TCGGCGAGGC GGAGGACGGC GCGATCGGCC TCGAAACCAT GCTAGCGGCA
GCCCTCAGGC TTCATCATGG CGGCCAGGTG AGCCTGATGC GTCTGATCGA CGCCATGTCG
ACCCGTCCCG CTCAGATTTT CGGCCTGAAT GCCGGCACGC TGAAGCCGGG CGCTGCGGCT
GATATCGCGT TGATCGATCT CGATGAGCCT TGGCTTGTCG CCAAAGACAT GCTTCTCTCC
CGCTCGAAGA ACACTCCGTT CGAGGATGCG CGCTTCAGTG GGCGGGCGGT TGCGACATAC
GTCTCGGGAA AGCTTGTCCA CGCAATTTAG
 
Protein sequence
MSNPIVLKNV RIIDPSRNLD EVGTIIAENG VILAAGGKAQ NQGAPDGAVI RDCTGLVATP 
GLVDARVHVG EPGGEHRETI ASVSRAAAAG GVTSIIMMPD TDPIIDDIAL VEFVKKTARD
TAAVNVYPAA AITKGLAGEE MTEIGLLMQA GAVAFTDAHS SVHDTQVLRR IMTYAREFGA
VICCETRDKY LGANGVMHEG LFASWLGLSG IPKEAELIPL ERDLRIAQLT RGRYHAAMIS
VPESVEAIER ARSRGAKVTC GISINNLALN ENDIGEYRTF FKLYPPLRPE DDRVAMADAL
ASGAIDIIVS SHDPQDVDTK RLPFGEAEDG AIGLETMLAA ALRLHHGGQV SLMRLIDAMS
TRPAQIFGLN AGTLKPGAAA DIALIDLDEP WLVAKDMLLS RSKNTPFEDA RFSGRAVATY
VSGKLVHAI