Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1354 |
Symbol | |
ID | 8012451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1336577 |
End bp | 1337767 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644823939 |
Product | NADH dehydrogenase subunit D |
Protein accession | YP_002975185 |
Protein GI | 241204089 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | [TIGR01962] NADH dehydrogenase I, D subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.937388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGAAC ATAACGTCCG CAACTTCAAT ATCAATTTCG GACCGCAGCA TCCGGCGGCG CACGGCGTTC TTCGTCTTGT CCTGGAGCTT GACGGCGAAA TTGTGGAGCG TGTCGATCCG CATATCGGCC TTTTGCATCG CGGCACCGAG AAGCTGATCG AGACCAAGAC CTATCTTCAG GCCGTGCCCT ATTTCGATCG CCTCGACTAC GTCGCACCGA TGAACCAGGA ACATGCCTAT GCGATGGCCG TCGAAAAGCT GCTCGGCATC GAGATCCCGA TTCGCGGCCA GCTGATCCGT GTTCTCTATT CGGAAATCGG CCGCATCCTG TCGCACCTTC TGAACGTCAC GACGCAGGCC ATGGACGTCG GCGCGCTGAC GCCGCCGCTC TGGGGCTTCG AAGAGCGTGA AAAGCTGATG GTGTTCTACG AGCGCGCCAG CGGCTCGCGC ATGCATGCCG CTTATATCCG TCCGGGCGGC GTCCACCAGG ATCTGCCCGA ACAGCTCGTG CAGGATATCG GCGCCTGGTG CGATCCGTTC CTGAAGGCGC TCGATGACAT CGACAACCTG TTGACCGGCA ACCGCATCTT CAAGCAGCGC AACGTCGATA TCGGCGTCGT CTCGCTGGAG GAATGCTGGG CTTGGGGCTT CTCCGGCGTC ATGGTGCGCG GTTCGGGTGC GGCCTGGGAT CTGCGTCGCG CCCAGCCCTA TGAATGTTAT TCCGATCTCG AATTCGATAT CCCGATCGGC AAGAATGGCG ACAACTACGA CCGTTATCTG ATCCGCATGA TCGAGATGCG CGAATCGGTC CGCATCATGA AGCAATGCGT CAACCGCCTG CTCTCGGATG CCAGGACCGG TCCTTTCTCG TCGATCGACG GCAAGGTCGT GCCGCCGAAG CGCGGCGAGA TGAAGCGCTC GATGGAAGCG CTGATCCACC ACTTCAAGCT CTATACCGAA GGCTACCATG TGCCGGCCGG CGAGGTTTAC GCCGCCGTCG AGGCGCCGAA GGGCGAATTC GGTGTCTATC TCGTCTCCGA CGGCACCAAC AAGCCCTATC GCTGCAAGAT CCGCGCGCCC GGTTATGCCC ATCTGCAGGC GATGGACTTC ATGTGCCGCG GCCACCAGCT TGCCGACGTC GCGGCCGTCC TCGGCTCGCT CGACATCGTC TTCGGCGAGG TGGATCGCTG A
|
Protein sequence | MTEHNVRNFN INFGPQHPAA HGVLRLVLEL DGEIVERVDP HIGLLHRGTE KLIETKTYLQ AVPYFDRLDY VAPMNQEHAY AMAVEKLLGI EIPIRGQLIR VLYSEIGRIL SHLLNVTTQA MDVGALTPPL WGFEEREKLM VFYERASGSR MHAAYIRPGG VHQDLPEQLV QDIGAWCDPF LKALDDIDNL LTGNRIFKQR NVDIGVVSLE ECWAWGFSGV MVRGSGAAWD LRRAQPYECY SDLEFDIPIG KNGDNYDRYL IRMIEMRESV RIMKQCVNRL LSDARTGPFS SIDGKVVPPK RGEMKRSMEA LIHHFKLYTE GYHVPAGEVY AAVEAPKGEF GVYLVSDGTN KPYRCKIRAP GYAHLQAMDF MCRGHQLADV AAVLGSLDIV FGEVDR
|
| |