Gene Rleg2_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1298 
Symbol 
ID6980022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1321032 
End bp1322336 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content65% 
IMG OID643396015 
Productdihydroorotase 
Protein accessionYP_002280818 
Protein GI209548901 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0828078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00606747 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAACC CGATCGTCCT CCAGAACGTC CGCATCGTCG ATCCCTCGCG CAATCTCGAC 
GAGGTGGGCA CGATCATCGC CGCAAACGGC GTGATCCTTG CTGCCGGAAG CGCCGCGCAG
AACCAGGGCG CACCCGAAGG TGCCGATATC CGCGACTGCA CCGGCCTTGT CGCCACACCG
GGCCTCGTCG ATGCGCGCGT CCATATCGGC GAACCCGGCG GCGAGCACCG CGAGACGATC
GCCTCGGCAA GCCGGGCAGC GGCCGCCGGC GGCGTCACCT CGATCATCAT GATGCCGGAC
ACAGATCCCG TGATCGACGA CATCGCCCTC GTCGAATTCG TCAAGAAGAC GGCGCGCGAT
ACCGCCGCCG TCAACGTCTA TCCGGCCGCC GCCCTCACCA AGGGACTTTC CGGCGAGGAA
ATGACCGAGA TCGGCCTGCT GATGCAGGCA GGCGCCGTCG CCTTCACCGA TGCCCATTCC
AGCGTCCACG ATACGCAGGT GCTGCGCCGG ATCATGACCT ATGCGCGCGA ATTCGGCGCC
GTCATCTGCT GCGAAACCCG CGACAAATAT CTCGGCGCCA ACGGCGTCAT GCATGAGGGG
CTTTTCGCCA GCTGGCTCGG GCTCTCCGGC ATTCCAAGAG AAGCCGAGCT CATCCCGCTC
GAACGCGATC TCAGGATCGC AGAGCTGACA CGCGGGCGTT ATCATGCCGC GATGATCTCG
GTGCCGCAAT CGGTCGAGGC GATCGAACGC GCCCGCAGCC GTGGCGCCAA AGTGACCTCG
GGCATCTCGA TCAACAATCT GGCGCTCAAC GAAAACGACA TCGGCGAATA TCGCACCTTC
TTCAAGCTCT ATCCGCCGCT GCGCCCGGAA GACGACCGGG TCGCCATGGT CGAGGCGCTG
GCAAGCGGCG CGATCGATAT CATCGTCTCC TCGCACGACC CGCAGGACGT CGATACGAAG
CGCCTGCCCT TCGGCGAGGC TGAAGACGGC GCGGTCGGCC TCGAAACCAT GCTGGCGGCG
GCTCTCAGGC TTCACCATGG CGGCCAGGTG AGCCTGATGC GCCTGATCGA CGCCATGTCC
ACCCGCCCGG CCGAGATTTT CGGCCTCCCC GCCGGCACGC TGAAGCCGGG GGCTGCGGCC
GATATCGCGC TGATCGATCT CGATGAGCCT TGGCTTGTCG CCAAAGACAT GCTTCTCTCC
CGCTCGAAGA ACACGCCGTT CGAAGATGCG CGCTTCAGCG GGCGGGCAAT CGCGACATAC
GTCTCGGGAA GCTTGTCCAT GCACTCTAGG ACGCGGCTGG ACTGA
 
Protein sequence
MSNPIVLQNV RIVDPSRNLD EVGTIIAANG VILAAGSAAQ NQGAPEGADI RDCTGLVATP 
GLVDARVHIG EPGGEHRETI ASASRAAAAG GVTSIIMMPD TDPVIDDIAL VEFVKKTARD
TAAVNVYPAA ALTKGLSGEE MTEIGLLMQA GAVAFTDAHS SVHDTQVLRR IMTYAREFGA
VICCETRDKY LGANGVMHEG LFASWLGLSG IPREAELIPL ERDLRIAELT RGRYHAAMIS
VPQSVEAIER ARSRGAKVTS GISINNLALN ENDIGEYRTF FKLYPPLRPE DDRVAMVEAL
ASGAIDIIVS SHDPQDVDTK RLPFGEAEDG AVGLETMLAA ALRLHHGGQV SLMRLIDAMS
TRPAEIFGLP AGTLKPGAAA DIALIDLDEP WLVAKDMLLS RSKNTPFEDA RFSGRAIATY
VSGSLSMHSR TRLD