Gene Rleg_6703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6703 
Symbol 
ID8022613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp131638 
End bp132846 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content62% 
IMG OID644833570 
Productdihydroorotase 
Protein accessionYP_002984704 
Protein GI241666620 
COG category[R] General function prediction only 
COG ID[COG3964] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.235788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGCG ACCAGGCGAA GAAGCCGCTT CTCCTCATCA ATGTCAAACC GATGGCTTTC 
GGTTCTGGCC CATCCGAGGG GGCGACCGAC ATCCTCGTCA ATGCCGACGG CAAGATCGCC
GAGATCGGTC CGTCGCTTAC CGTCTCGCAG GATGTGACGC GCATCGACGG CAAAGGCGCC
TTCATTTCGC CGGGCTGGGT CGATCTGCAC GTGCATATCT GGCACGGCGG CACAGATATT
TCCATCCGCC CGTCCGAATG CGGCCTCGAG CGCGGCGTCA CTACGCTCGT CGATGCCGGT
TCGGCTGGTG AGGCGAATTT CCATGGCTTC CGCGAATATA TCATCGAGCC CTCGCGCGAA
CGCATCAAGG CCTTCCTCAA TCTCGGCTCC ATAGGCCTCG TCGCCTGCAA CCGCGTCGCG
GAATTGCGGG ATATCCGCGA TATCGATCTC GACCGCATCC TCGAAGTCTA TGCCGCAAAC
AGCGAGCACA TCGTCGGCAT CAAGGTGCGC GCCAGCCATG TCATCACCGG ATCCTGGGGC
GTCACCCCGG TCAAGCTCGG CAAGAAGATC GCCAAGATCT TGAAAGTGCC GATGATGGTG
CATGTCGGCG AGCCGCCGGC GCTCTATGAC GAGGTGCTGG AGATCCTCGG CCCCGGCGAC
GTCGTCACCC ACTGCTTCAA CGGCAAGGCC GGCTCGAGCA TCATGGAAGA CGAGGACCTC
TTTAATCTCG CCGAGCGCTG CGCCTCCGAA GGCATCCGTC TCGACATCGG CCATGGTGGC
GCCTCCTTCT CCTTCAAGGT CGCCGAGGCG GCGATCGCGC GCGGGCTTCT GCCGTTCTCG
ATCTCTACCG ACCTGCACGG CCATTCGATG AACTTCCCGG TCTGGGACCT GGCGACGACG
ATGTCGAAGC TGCTCAGTGT CGGCATGCCT TTCGACAAAG TGGTGGAAGC CGTCACCCAT
GCGCCGGCAT CCGTCATCAA GCTGTCGATG GAGAACCGGC TCGCGGTCGG TTCGCAAGCC
GAATTTACCA TTTTCGACCT GGTCGATTCC GATCTCGAGG CGACGGATTC CAACGGCGAC
GTTTCGGTCC TCAATAAGCT GTTCGAGCCG CGCTATGCGG TGATGGGAGC CGATGCCTTT
GCCGCCAGCC GCTACGTGCC GCGGGCGCGC AAGCTGGTGC GCCATAGCCA CGGCTATTCC
TACAGGTAG
 
Protein sequence
MPGDQAKKPL LLINVKPMAF GSGPSEGATD ILVNADGKIA EIGPSLTVSQ DVTRIDGKGA 
FISPGWVDLH VHIWHGGTDI SIRPSECGLE RGVTTLVDAG SAGEANFHGF REYIIEPSRE
RIKAFLNLGS IGLVACNRVA ELRDIRDIDL DRILEVYAAN SEHIVGIKVR ASHVITGSWG
VTPVKLGKKI AKILKVPMMV HVGEPPALYD EVLEILGPGD VVTHCFNGKA GSSIMEDEDL
FNLAERCASE GIRLDIGHGG ASFSFKVAEA AIARGLLPFS ISTDLHGHSM NFPVWDLATT
MSKLLSVGMP FDKVVEAVTH APASVIKLSM ENRLAVGSQA EFTIFDLVDS DLEATDSNGD
VSVLNKLFEP RYAVMGADAF AASRYVPRAR KLVRHSHGYS YR