Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5803 |
Symbol | |
ID | 6977192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 212289 |
End bp | 213503 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643393258 |
Product | dihydroorotase |
Protein accession | YP_002278076 |
Protein GI | 209546186 |
COG category | [R] General function prediction only |
COG ID | [COG3964] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.306417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.050491 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATGT CCGGCAACCA GGCGAAGAAG CCGCTCCTCC TCACCAATGT GAAACCGATG GCTTTCGGTG CGGGGACGCC GGAAGGGCCC GTCGACATTC TCGTCGATGG CGACGGCAGG ATCGCAAGGA TCGGTCCGGC GCTTGCCGTT TCTGAGGATG TGACCCGCAT CGACGGCAAG GGCGCCTTCG TCTCGCCGGG CTGGATCGAC CTGCATGTGC ATATCTGGCA TGGGGGCACC GACATTTCCA TTCGTCCCTC CGAATGCGGT CTCGAGCGCG GCGTCACCAC GCTGGTCGAT GCCGGTTCGG CCGGCGAGGC GAATTTCCAC GGCTTCCGCG AATATATCAT CGAGCCCTCA CGCGAGCGTA TCAAGGCCTT CCTGAACCTC GGCTCGATCG GCCTCGTCGC CTGCAACCGT GTCGCCGAAC TCAGGGATAT CAGAGATATC GATCTCGACC GCATCCTCGA AGTCTATGCC GAAAACAGCG AGCACATCGT CGGCATCAAG GTGCGCGCCA GCCATGTGAT CACCGGCTCC TGGGGTGTGA CCCCCGTCAA GCTCGGCAAG AAGATCGCCA AGATCCTGAA AGTGCCGATG ATGGTGCATG TCGGCGAACC GCCGGCGCTC TATGACGAAG TGCTGGAGAT TCTCGGCCCC GGCGATGTCG TCACCCACTG CTTCAACGGC AAAGCGGGGT CGAGCATCAT GGAGGACGAG GATCTTTTTA ATCTCGCCGA GCGCTGCGCC TCCGAGGGCA TTCGTCTCGA CATCGGCCAT GGCGGCGCCT CCTTCTCTTT CAAGGTGGCG GAAGCGGCAA TTGCGCGCGG GCTGCTGCCG TTCTCGATCT CGACCGACCT GCACGGCCAT TCGATGAACT TCCCGGTCTG GGACCTGGCG ACGACGATGT CGAAGCTGCT CAGCGTCGGC ATGCCCTTCG ACAAGGTGGT CGAGGCCGTC ACCCATGCTC CGGCATCGGT CATCAAGCTG TCGATGGAGA ACCGGCTTGC CGTCGGCGCG CAAGCCGAAT TCACGATTTT CGACCTCGTC GATTCCGACC TTGAGGCGAC GGATTCCAAC GGCGACGTCT CGGTCTTGAA CAAACTGTTC GAGCCGCGTT ACGCGGTGAT AGGTACCGAT GCCGTTACCG CCAGCCGCTA TGTGCCGCGG GCACGCAAGC TGGTGCGCCA CAGCCACGGT TATTCCTACC GGTAG
|
Protein sequence | MSMSGNQAKK PLLLTNVKPM AFGAGTPEGP VDILVDGDGR IARIGPALAV SEDVTRIDGK GAFVSPGWID LHVHIWHGGT DISIRPSECG LERGVTTLVD AGSAGEANFH GFREYIIEPS RERIKAFLNL GSIGLVACNR VAELRDIRDI DLDRILEVYA ENSEHIVGIK VRASHVITGS WGVTPVKLGK KIAKILKVPM MVHVGEPPAL YDEVLEILGP GDVVTHCFNG KAGSSIMEDE DLFNLAERCA SEGIRLDIGH GGASFSFKVA EAAIARGLLP FSISTDLHGH SMNFPVWDLA TTMSKLLSVG MPFDKVVEAV THAPASVIKL SMENRLAVGA QAEFTIFDLV DSDLEATDSN GDVSVLNKLF EPRYAVIGTD AVTASRYVPR ARKLVRHSHG YSYR
|
| |