Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3671 |
Symbol | |
ID | 6982433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3800097 |
End bp | 3801263 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643398393 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_002283160 |
Protein GI | 209551243 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA CCAGCGTTCG GCCGTGGCTG ATCAAGTCCG ATGCTTCTTA TTGGGGAGAG TTTCTGTTCG TCGAAGTGAC AACCGATGAG GGGGTGAGCG GCTGGGGAGA AATCACCACC ACGACAAGGC TCGCCAATCG CGCCCTCTGC ACGATCCTGC GGCAGATCGG TGCTGCTGTG ACAGGCGAGG ACCCGGCGCG TATCGAGTAT CTGTGGCACA AGATATTCCG CAGCTTCACC TACATGGGCA GTCGCGGCGC CGCAGTCGAA TGCGTCAGCG CCATCGATAT CGCCCTTTGG GATATCCGGG GCAAAGTTCT TGGCAAGCCG ATTTACGAGC TGTTGGGCGG ACCGGTACGC GATGAAATTG CGCTCTACAC CCACCCCAAC CAGGCCAAAT TTACCAGCAA TGAGGCGGTG ATCCGGGAAA TTCGAGACAT CGTCGAGTCC GGCCACACGG CGCTCAAGTT CGATCCTTTC CCCCACCAAG GCCGCACCGC CGATGGACAG GCTCGCGAAC AGCGGGACGG CTACCTCGAT GGCAGCATGA CCCGCAAGGA TGAGCGCGAG GCCGCCGAAC TGACCGCCCT CATCCGCGAA ACCGCGGGCC CTGATGTCGA CATCCTCATC GATGCGCATG GCCGGTTCGA CGTGCCCACC GCCATTCGCC TTTGTCGGAG CCTTGAGGAA GCTGGTCAGA TCGACTGGTT CGAGGAGCCT TGTCCACCGG AGAGCCTCAA CGCGCTCAGA CAGGTCCGTG AAAAGGTCAG CGCCGCTATC TCGTGGGGCG AGCGCGGCCA CACGAAGTGG GATTTCGTGC CGGTGCTCGA AAACAGGCTC GCTGACTACA TCATGCCTGA CGTCACCTGG ACCGGCGGCA TTACTGAACT GAAGAAGATT TCCGCCCTAT GCGAGGCCTA CTACATCCCG GTCTCGCCGC ATGACGCCGC AGGGCCGATC AACGTGGTTG CGGGCGCGCA AGTGATGATG ACCGTTCCCA ACTTCTACAA ACTCGAAACG TCGGAGTGGA ACCTGGGCAA ATACGATCAC CTCATCGACA GACCGCTCGA TGTTTCGAAC GGCAGCCTCA AGCTTACGCC CAAGCCTGGT CTCGGCGTCG AAATGAACCG CGACTACCTG CAAAACCACG AGATCGAGCT GGACTAG
|
Protein sequence | MKITSVRPWL IKSDASYWGE FLFVEVTTDE GVSGWGEITT TTRLANRALC TILRQIGAAV TGEDPARIEY LWHKIFRSFT YMGSRGAAVE CVSAIDIALW DIRGKVLGKP IYELLGGPVR DEIALYTHPN QAKFTSNEAV IREIRDIVES GHTALKFDPF PHQGRTADGQ AREQRDGYLD GSMTRKDERE AAELTALIRE TAGPDVDILI DAHGRFDVPT AIRLCRSLEE AGQIDWFEEP CPPESLNALR QVREKVSAAI SWGERGHTKW DFVPVLENRL ADYIMPDVTW TGGITELKKI SALCEAYYIP VSPHDAAGPI NVVAGAQVMM TVPNFYKLET SEWNLGKYDH LIDRPLDVSN GSLKLTPKPG LGVEMNRDYL QNHEIELD
|
| |