Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1390 |
Symbol | |
ID | 8012483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1379420 |
End bp | 1380709 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644823975 |
Product | dihydroorotase |
Protein accession | YP_002975221 |
Protein GI | 241204125 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.234318 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0725685 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACC CGATCGTCCT CAAGAACGTC CGCATCATCG ACCCGTCGCG CAATCTCGAC GAGGTGGGGA CGATCATTGC CGAAAACGGC GTGATTCTCG CCGCCGGCGG CAAGGCGCAG AACCAGGGCG CGCCTGATGG AGCCGTCATC CGCGACTGCA CGGGCCTTGT CGCGACGCCC GGCCTCGTCG ATGCGCGCGT CCATGTCGGC GAACCCGGCG GCGAACACCG TGAGACGATC GCCTCGGTGA GCCGGGCGGC CGCCGCCGGC GGCGTCACCT CGATCATCAT GATGCCGGAC ACCGATCCCA TCATCGACGA CATCGCACTC GTCGAATTCG TCAAGAAGAC GGCGCGGGAT ACGGCCGCCG TCAACGTCTA TCCGGCAGCC GCCATCACCA AGGGCCTTGC CGGCGAGGAG ATGACGGAGA TCGGCCTGTT GATGCAGGCA GGCGCCGTCG CCTTTACCGA TGCCCATTCC AGCGTCCACG ACACACAGGT GCTGCGCCGG ATCATGACCT ATGCGCGCGA ATTCGGCGCC GTCATCTGCT GCGAAACACG CGACAAATAT CTCGGCGCCA ACGGCGTCAT GCATGAGGGG CTTTTCGCCA GCTGGCTCGG GCTTTCCGGC ATTCCAAAGG AAGCCGAGCT CATCCCGCTC GAACGCGATC TGAGGATCGC GCAGCTGACG CGCGGCCGTT ATCACGCCGC GATGATCTCG GTGCCGGAAT CGGTCGAGGC GATCGAGCGC GCCCGCAGCC GCGGCGCCAA GGTGACCTGC GGCATCTCGA TCAACAATCT GGCGCTCAAC GAAAACGACA TCGGCGAATA CCGCACCTTC TTCAAGCTCT ATCCGCCGCT GCGCCCGGAA GACGACCGGG TGGCGATGGC CGACGCCCTT GCGAGCGGCG CGATCGATAT CATCGTCTCC TCGCACGACC CGCAGGATGT CGATACGAAG CGCCTGCCCT TCGGCGAGGC GGAGGACGGC GCGATCGGCC TCGAAACCAT GCTAGCGGCA GCCCTCAGGC TTCATCATGG CGGCCAGGTG AGCCTGATGC GTCTGATCGA CGCCATGTCG ACCCGTCCCG CTCAGATTTT CGGCCTGAAT GCCGGCACGC TGAAGCCGGG CGCTGCGGCT GATATCGCGT TGATCGATCT CGATGAGCCT TGGCTTGTCG CCAAAGACAT GCTTCTCTCC CGCTCGAAGA ACACTCCGTT CGAGGATGCG CGCTTCAGTG GGCGGGCGGT TGCGACATAC GTCTCGGGAA AGCTTGTCCA CGCAATTTAG
|
Protein sequence | MSNPIVLKNV RIIDPSRNLD EVGTIIAENG VILAAGGKAQ NQGAPDGAVI RDCTGLVATP GLVDARVHVG EPGGEHRETI ASVSRAAAAG GVTSIIMMPD TDPIIDDIAL VEFVKKTARD TAAVNVYPAA AITKGLAGEE MTEIGLLMQA GAVAFTDAHS SVHDTQVLRR IMTYAREFGA VICCETRDKY LGANGVMHEG LFASWLGLSG IPKEAELIPL ERDLRIAQLT RGRYHAAMIS VPESVEAIER ARSRGAKVTC GISINNLALN ENDIGEYRTF FKLYPPLRPE DDRVAMADAL ASGAIDIIVS SHDPQDVDTK RLPFGEAEDG AIGLETMLAA ALRLHHGGQV SLMRLIDAMS TRPAQIFGLN AGTLKPGAAA DIALIDLDEP WLVAKDMLLS RSKNTPFEDA RFSGRAVATY VSGKLVHAI
|
| |