Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5799 |
Symbol | |
ID | 8016597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | + |
Start bp | 375584 |
End bp | 376774 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644827936 |
Product | NADH dehydrogenase I, D subunit |
Protein accession | YP_002979136 |
Protein GI | 241518508 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | [TIGR01962] NADH dehydrogenase I, D subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.143218 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGAGC ACGATGTACG AAACTTCCAC CTCAATTTCG GCCCGCAGCA TCCCGCCGCA CACGGCGTAT TGCGGCTGGT ACTGGAACTC AACGGGGAGA TCGTGGAGCG CATCGACCCC CATATTGGTC TTCTACATCG CGGCACCGAG AAGCTCATCG AGAAGAAGAC ATATCTTCAG GCGCTGCCCT ATTTTGACAG GCTTGATTAT GTCGCGCCGA TGAGCCAGGA GCACGCATAT TGCTTAGCGA TCGAAAAGAT GCTCGGCCTT GAGGTGCCTT ATCGCGCGCA GCTAATCCGC GTGCTCTACG CCGAGATCAG CCGCATTCTC TCTCATCTGC TGAATGTCAC CACTCAAGCG ATGGACGTTG GGGCGTTGAC GCCGCCCCTT TGGGGTTTCG AAGAGCGTGA GAAGTTGATG GTCTTTTATG AGCGCGCCTC CGGCTCTCGC ATGCATGCCG CCTATTTCAG GCCGGGCGGG GTTCATCAGG ATCTCCCGCC GAAGTTGGTT GCAGACATTG GCGAATGGTG CCGGAAATTT CCCCAGGTTA TTGAAGATAT CGGCGGCCTT CTCACCGACA ACCGGATCTT CAAGCAGCGC AACGTGGACG TCGGCCTCAT TTCACTTGAA GATGCCTGGG CCTGGGGCTT TTCCGGTGTG CTGATCCGGG GTTCGGGGGC TGCGTGGGAT CTTCGACGAG CCAATCCATA CGAGTGCTAT TCCGACCTCG AATTCGATAT TCCGATCGGC AAGAACGGAG ATTGCTACGA CCGTTATCTC ATCCGTATGC AGGAAATGCG GGAATCGGTG CGGATCATGA GCCAGTGCGC AGACCTGCTG CTCGGAAGCG CGTCCACCGG CCCCGTGAAC TCGAACGACG GCAAGGTCGT GCCTCCTAAG CGCGGCGAGA TGAAGCGGTC GATGGAGGCG CTCATCCACC ACTTCAAGCT CTACACCGAA GGTTTCCGAG TGCCGAAGGG GGAGGCCTAT GCGGCCGTGG AGGCGCCGAA AGGTGAATTC GGCGTCTATC TTGTCGCCGA TGGCACGAAT GCGCCATATC GCTGCAAGAT CAGGGCACCC GGTTTTACCC ATCTGCAGGC AATGGATTTC ATGTGCCGCG GCCATCAGCT TGCCGACGTC TCCGCCGTTC TCGGCTCCCT CGACATCGTC TTCGGCGAAG TCGATCGCTG A
|
Protein sequence | MAEHDVRNFH LNFGPQHPAA HGVLRLVLEL NGEIVERIDP HIGLLHRGTE KLIEKKTYLQ ALPYFDRLDY VAPMSQEHAY CLAIEKMLGL EVPYRAQLIR VLYAEISRIL SHLLNVTTQA MDVGALTPPL WGFEEREKLM VFYERASGSR MHAAYFRPGG VHQDLPPKLV ADIGEWCRKF PQVIEDIGGL LTDNRIFKQR NVDVGLISLE DAWAWGFSGV LIRGSGAAWD LRRANPYECY SDLEFDIPIG KNGDCYDRYL IRMQEMRESV RIMSQCADLL LGSASTGPVN SNDGKVVPPK RGEMKRSMEA LIHHFKLYTE GFRVPKGEAY AAVEAPKGEF GVYLVADGTN APYRCKIRAP GFTHLQAMDF MCRGHQLADV SAVLGSLDIV FGEVDR
|
| |