Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3974 |
Symbol | |
ID | 8015874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4050914 |
End bp | 4051876 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826543 |
Product | malate dehydrogenase |
Protein accession | YP_002977754 |
Protein GI | 241206658 |
COG category | [C] Energy production and conversion |
COG ID | [COG0039] Malate/lactate dehydrogenases |
TIGRFAM ID | [TIGR01763] malate dehydrogenase, NAD-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.342256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTA ACAAGATCGC ACTCATTGGT TCTGGCATGA TTGGTGGCAC GCTGGCGCAC CTCGCCGGCC TGAAGGAACT GGGCGACATC GTTCTCTTCG ACATCGCGGA CGGCATTCCC CAGGGCAAAG GTCTCGATAT CTCCCAGTCG TCGCCGGTCG AAGGCTTCGA CGTCAATCTG ACGGGTGCCA GCGACTATTC CGCGATCGAA GGCGCTGACG TCTGCATCGT CACGGCAGGC GTCGCCCGCA AGCCCGGCAT GAGCCGCGAT GACCTTCTCG GCATCAACCT CAAGGTCATG GAGCAGGTCG GCGCCGGCAT CAAGAAATAT GCCCCGAACG CCTTCGTGAT CTGCATCACC AACCCGCTCG ACGCCATGGT CTGGGCGCTG CAGAAGTTTT CCGGTCTTCC GGCCAACAAG GTCGTCGGCA TGGCTGGCGT TCTCGACTCC TCGCGCTTCC GTCTTTTCCT CGCCAAGGAA TTCAACGTGT CCGTCCAGGA TGTCACGGCC TTCGTTCTCG GCGGCCACGG CGACACGATG GTGCCGCTCG CCCGCTACTC GACGGTCGGC GGCATTCCGC TCACCGATCT CGTCACCATG GGCTGGGTCA CCAAGGAGCG CCTCGAAGAG ATCATCCAGC GCACCCGTGA CGGCGGTGCC GAAATCGTCG GCCTGCTGAA GACCGGCTCG GCCTATTACG CGCCGGCCGC CTCGGCAATC GAGATGGCCG AATCCTACCT CAAGGACAAG AAGCGTGTTC TGCCTTGTGC TGCCCACCTC TCCGGCCAGT ACGGCGTCAA GGACATGTAT GTCGGCGTTC CTACCGTCAT CGGCGCCGGC GGCGTCGAGC GCATCATCGA GATTGATCTC AACAAGACCG AGAAGGAAGC CTTCGACAAG TCCGTCGGCG CAGTCGCCGG TCTCTGCGAA GCCTGCATCA ACATCGCGCC TGCCCTGAAG TGA
|
Protein sequence | MARNKIALIG SGMIGGTLAH LAGLKELGDI VLFDIADGIP QGKGLDISQS SPVEGFDVNL TGASDYSAIE GADVCIVTAG VARKPGMSRD DLLGINLKVM EQVGAGIKKY APNAFVICIT NPLDAMVWAL QKFSGLPANK VVGMAGVLDS SRFRLFLAKE FNVSVQDVTA FVLGGHGDTM VPLARYSTVG GIPLTDLVTM GWVTKERLEE IIQRTRDGGA EIVGLLKTGS AYYAPAASAI EMAESYLKDK KRVLPCAAHL SGQYGVKDMY VGVPTVIGAG GVERIIEIDL NKTEKEAFDK SVGAVAGLCE ACINIAPALK
|
| |