Gene Rleg2_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0129 
Symbol 
ID6978839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp121254 
End bp122294 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content64% 
IMG OID643394840 
Productdihydroorotase 
Protein accessionYP_002279657 
Protein GI209547740 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0418] Dihydroorotase 
TIGRFAM ID[TIGR00856] dihydroorotase, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.969204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCGA TCACCATCCG CCGCCCTGAC GACTGGCACC TGCATCTGCG CGATGGCGCG 
ATGCTCGAAG GCGTGATCGC CGATACGAGC CGCACCTTCG CCCGCGCCAT CATCATGCCC
AATCTCGTGC CGCCCGTCGT CACCACGTCG GATGCGACGG CCTATCGCGA GCGCATTCTG
AAGGCGCTGC CGGCCGGCCA TCGTTTCCAG CCGCTGATGA CGCTTTATCT CACCGAGCAT
ACCAGCCCCG ACGATGTCGA AGCCGGTGCC AGGAGCGGCC TCATCACTGC CGTCAAGCTT
TATCCGGCCG GCGCCACCAC CAATTCGCAT GGCGGCGTCC GCGACATGGA AAAGGCGATG
CCGGTGCTGG AGCGCATGGC TGCGATTGGC CTGCCGCTCT GCGTCCATGG CGAAGTGACG
ACGCCTGAGG TCGATATCTT CGATCGCGAA GCCGTCTTCA TCGATACCGT GCTCGATCCG
CTGCGCCGGC GCCTGCCGGA GCTGAAGGTG ACGATGGAGC ATGTGACGAC ATCGGACGGT
ATCGACTACA TCAAGGCGGC CAAGGCCAAT CTTGCCGGCT CGATCACCAG CCATCATCTC
ATCATCAACC GCAACGCCAT CCTCGTCGGC GGTATCCGCC CGCATTATTA TTGCCTGCCG
GTCGCCAAGC GCGAGAACCA TCGGTTGGCG CTGCGCGCCG CCGCCGTGAG CGGTGACGCC
CGCTTCTTCC TCGGCACCGA TTCCGCCCCG CATGTCGACC CGCTGAAGGA ATGCGCCTGC
GGCTGCGCCG GCATCTACAC CTCGATCAAC ACGATGAGCT GCCTTGCGCA TGTCTTCGAG
GAGGAGGACG CGCTGGACAG GCTCGAAGCC TTCACCTCGC TGAACGGACC GGCCTGGTAC
GGGCTTCAGC CGAACGAGGA GCGCATCACC CTGTCCAGGC AGGCCGAGCC GGTCGTTTTT
CCCGCCAAGA TAGAAACCGG CGCCGGTTCG GTGACGGTGT TCGATCCGAT GTATCCGCTG
CATTGGCACG TTGTGGCGTA G
 
Protein sequence
MQSITIRRPD DWHLHLRDGA MLEGVIADTS RTFARAIIMP NLVPPVVTTS DATAYRERIL 
KALPAGHRFQ PLMTLYLTEH TSPDDVEAGA RSGLITAVKL YPAGATTNSH GGVRDMEKAM
PVLERMAAIG LPLCVHGEVT TPEVDIFDRE AVFIDTVLDP LRRRLPELKV TMEHVTTSDG
IDYIKAAKAN LAGSITSHHL IINRNAILVG GIRPHYYCLP VAKRENHRLA LRAAAVSGDA
RFFLGTDSAP HVDPLKECAC GCAGIYTSIN TMSCLAHVFE EEDALDRLEA FTSLNGPAWY
GLQPNEERIT LSRQAEPVVF PAKIETGAGS VTVFDPMYPL HWHVVA