Gene Rleg2_5803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5803 
Symbol 
ID6977192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp212289 
End bp213503 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content62% 
IMG OID643393258 
Productdihydroorotase 
Protein accessionYP_002278076 
Protein GI209546186 
COG category[R] General function prediction only 
COG ID[COG3964] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.306417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.050491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGT CCGGCAACCA GGCGAAGAAG CCGCTCCTCC TCACCAATGT GAAACCGATG 
GCTTTCGGTG CGGGGACGCC GGAAGGGCCC GTCGACATTC TCGTCGATGG CGACGGCAGG
ATCGCAAGGA TCGGTCCGGC GCTTGCCGTT TCTGAGGATG TGACCCGCAT CGACGGCAAG
GGCGCCTTCG TCTCGCCGGG CTGGATCGAC CTGCATGTGC ATATCTGGCA TGGGGGCACC
GACATTTCCA TTCGTCCCTC CGAATGCGGT CTCGAGCGCG GCGTCACCAC GCTGGTCGAT
GCCGGTTCGG CCGGCGAGGC GAATTTCCAC GGCTTCCGCG AATATATCAT CGAGCCCTCA
CGCGAGCGTA TCAAGGCCTT CCTGAACCTC GGCTCGATCG GCCTCGTCGC CTGCAACCGT
GTCGCCGAAC TCAGGGATAT CAGAGATATC GATCTCGACC GCATCCTCGA AGTCTATGCC
GAAAACAGCG AGCACATCGT CGGCATCAAG GTGCGCGCCA GCCATGTGAT CACCGGCTCC
TGGGGTGTGA CCCCCGTCAA GCTCGGCAAG AAGATCGCCA AGATCCTGAA AGTGCCGATG
ATGGTGCATG TCGGCGAACC GCCGGCGCTC TATGACGAAG TGCTGGAGAT TCTCGGCCCC
GGCGATGTCG TCACCCACTG CTTCAACGGC AAAGCGGGGT CGAGCATCAT GGAGGACGAG
GATCTTTTTA ATCTCGCCGA GCGCTGCGCC TCCGAGGGCA TTCGTCTCGA CATCGGCCAT
GGCGGCGCCT CCTTCTCTTT CAAGGTGGCG GAAGCGGCAA TTGCGCGCGG GCTGCTGCCG
TTCTCGATCT CGACCGACCT GCACGGCCAT TCGATGAACT TCCCGGTCTG GGACCTGGCG
ACGACGATGT CGAAGCTGCT CAGCGTCGGC ATGCCCTTCG ACAAGGTGGT CGAGGCCGTC
ACCCATGCTC CGGCATCGGT CATCAAGCTG TCGATGGAGA ACCGGCTTGC CGTCGGCGCG
CAAGCCGAAT TCACGATTTT CGACCTCGTC GATTCCGACC TTGAGGCGAC GGATTCCAAC
GGCGACGTCT CGGTCTTGAA CAAACTGTTC GAGCCGCGTT ACGCGGTGAT AGGTACCGAT
GCCGTTACCG CCAGCCGCTA TGTGCCGCGG GCACGCAAGC TGGTGCGCCA CAGCCACGGT
TATTCCTACC GGTAG
 
Protein sequence
MSMSGNQAKK PLLLTNVKPM AFGAGTPEGP VDILVDGDGR IARIGPALAV SEDVTRIDGK 
GAFVSPGWID LHVHIWHGGT DISIRPSECG LERGVTTLVD AGSAGEANFH GFREYIIEPS
RERIKAFLNL GSIGLVACNR VAELRDIRDI DLDRILEVYA ENSEHIVGIK VRASHVITGS
WGVTPVKLGK KIAKILKVPM MVHVGEPPAL YDEVLEILGP GDVVTHCFNG KAGSSIMEDE
DLFNLAERCA SEGIRLDIGH GGASFSFKVA EAAIARGLLP FSISTDLHGH SMNFPVWDLA
TTMSKLLSVG MPFDKVVEAV THAPASVIKL SMENRLAVGA QAEFTIFDLV DSDLEATDSN
GDVSVLNKLF EPRYAVIGTD AVTASRYVPR ARKLVRHSHG YSYR