Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3986 |
Symbol | |
ID | 6982750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4137907 |
End bp | 4140078 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643398709 |
Product | malate synthase G |
Protein accession | YP_002283474 |
Protein GI | 209551557 |
COG category | [C] Energy production and conversion |
COG ID | [COG2225] Malate synthase |
TIGRFAM ID | [TIGR01345] malate synthase G |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCA TTGATAAGAA CGGTCTTGCC ATCGAAGCCG TCCTTCATGA TTTCCTCGTC CAGGAGGTTC TGCCGGGTCT GGCGATCGAT GCGGACAAGT TCTTTGCCGA TTTTTCGGCG ATCGTCCACG ATCTCGCCCC GAAGAATCGC GCCCTGCTGG CAAGACGCGA CGAGCTGGAG GTCAAGATCG ACGACTGGTA TCGCCAGCAC GGCGCGCCGG CGGATATGGA CGACTACCAA TCCTTCCTGC GCGGGATCGG TTATCTCCTG CCGGAAGGAT CGGACTTCCA GGTTTCGACC CAAAATGTCG ACCCCGAGAT CGCCTCGATC GCCGGCCCGC AGCTCGTCGT TCCCGTCATG AATGCCCGTT ATGCACTCAA CGCCGCCAAC GCCCGCTGGG GCTCGCTCTA TGATGCGCTC TATGGCACGG ACGCCATTCC CGAGACCGAC GGCGCGGAAA AGGGCAGGGG CTACAATCCG AAGCGCGGCG AGAAGGTTAT CGCCTGGGTA CGCGATTTCC TCGATATGTC GGTACCTTTG CAAGACAGCA GCTGGAAGAA TGTCGGCAAC TTCACCGTCA AGGATGGGAT ACTTGTCATC AGATCGGTCG ATGGCGAGCA GGCCATGCTT GTCGATGGCG GGCATTTTGC CGGCTATCGC GGCGATGCTG CCGCCCCGAC GCATATTCTC CTGAAGAACA ACGGCATCCA TATCAAGATC GTCATCGATG CGACGACGGC GATCGGCAAG ACCGATCCGG CCCATATTTC CGATGTCTGG CTGGAATCGG CGATCACCAC GATCATGGAC TGTGAGGATT CGATTGCCGC CGTCGATGCC GAGGACAAGA CTGTCGTTTA CCGCAACTGG CTCGGCCTGA TGAAGGGCGA CCTGCAGGAA GAGGTGGTAA AGGGCGGAAC GAGCTTCGTC CGTAAGCTCA ACCCGGATCT GGACTATACC GGTCCGGATG GCACCGTCTT CGAACTGCAT CGCCGTTCGC TGATGCTGGT GCGCAATGTC GGCCACCTGA TGACCAACCC GGCGATCCTC GACCGCGACG GTGCCGAAGT GCCGGAGGGC ATCCTGGATG CCGTCATCAC CGGCCTGATC GCGCTTTACG ATATCGGCCC GGCCGGCCGG AGGAAGAATT CCCGCACCGG TTCCATGTAT GTGGTCAAGC CGAAGATGCA TGGACCGGAA GAGGTGGCCT TCGCCGTCGA GATCTTTTCG CGGGTCGAAG ATGCGCTTGG CATGGCGCGC AACACCATCA AGATGGGCAT CATGGATGAG GAGCGCCGCA CGACGGTCAA TCTCAAGGAA TGCATCCGTG CTGCCCGCGA GCGGGTCGTC TTCATCAATA CCGGCTTCCT CGACCGCACC GGCGACGAGA TCCACACCTC GATGGAGGCC GGCCCGATGA TTCGCAAGGG CGACATGCGC CAGGCAGCCT GGATATCGGC CTATGAGAAC TGGAACGTCG ACATCGGCCT GGAATGCGGC CTCGCCGGCC ACGCCCAGAT CGGCAAGGGC ATGTGGGCAA TGCCGGATCT GATGGCGGCG ATGCTGGAGC AGAAGATCGC CCATCCGAAG GCCGGTGCCA ACACCGCCTG GGTCCCGTCG CCGACCGCCG CGACCCTGCA CGCCACCCAT TATCACCGGG TCAATGTCGC CAGGGTTCAG CAGGGGCTGA AGGACCGCGC CCGCGCCAAG CTCTCCGACA TTCTCTCCGT GCCGGTCGCG GTACGGCCGA ACTGGACGCC GGAGGAAATC CAGCGCGAGC TTGATAACAA TGCCCAGGGC ATCCTCGGCT ACGTCGTACG CTGGGTCGAT CAGGGCGTCG GCTGCTCGAA GGTGCCCGAT ATCAACAATG TCGGCCTGAT GGAGGATCGC GCCACATTGC GCATCTCGGC CCAGCACATG GCGAACTGGC TGCATCACCA GGTCGTCACC GAGGCTCAGA TCGTCGAAAC CATGAAGCGC ATGGCCGCCG TCGTCGACCG GCAGAACGAG GCGGATCCTC TCTACCAGCC GATGGCCGGT AATTTCGACG GATCGATCGC CTTCCAGGCC GCCCTCGACC TCGTGCTGAA GGGCAGGGAG CAGCCGAACG GCTATACCGA GCCAGTGCTT CATCGCCGCC GCCTCGAGCT GAAGGCGAAG CAGGCCGGTT GA
|
Protein sequence | MSRIDKNGLA IEAVLHDFLV QEVLPGLAID ADKFFADFSA IVHDLAPKNR ALLARRDELE VKIDDWYRQH GAPADMDDYQ SFLRGIGYLL PEGSDFQVST QNVDPEIASI AGPQLVVPVM NARYALNAAN ARWGSLYDAL YGTDAIPETD GAEKGRGYNP KRGEKVIAWV RDFLDMSVPL QDSSWKNVGN FTVKDGILVI RSVDGEQAML VDGGHFAGYR GDAAAPTHIL LKNNGIHIKI VIDATTAIGK TDPAHISDVW LESAITTIMD CEDSIAAVDA EDKTVVYRNW LGLMKGDLQE EVVKGGTSFV RKLNPDLDYT GPDGTVFELH RRSLMLVRNV GHLMTNPAIL DRDGAEVPEG ILDAVITGLI ALYDIGPAGR RKNSRTGSMY VVKPKMHGPE EVAFAVEIFS RVEDALGMAR NTIKMGIMDE ERRTTVNLKE CIRAARERVV FINTGFLDRT GDEIHTSMEA GPMIRKGDMR QAAWISAYEN WNVDIGLECG LAGHAQIGKG MWAMPDLMAA MLEQKIAHPK AGANTAWVPS PTAATLHATH YHRVNVARVQ QGLKDRARAK LSDILSVPVA VRPNWTPEEI QRELDNNAQG ILGYVVRWVD QGVGCSKVPD INNVGLMEDR ATLRISAQHM ANWLHHQVVT EAQIVETMKR MAAVVDRQNE ADPLYQPMAG NFDGSIAFQA ALDLVLKGRE QPNGYTEPVL HRRRLELKAK QAG
|
| |