Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1264 |
Symbol | |
ID | 6979988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1279954 |
End bp | 1281120 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643395981 |
Product | NADH dehydrogenase subunit E |
Protein accession | YP_002280784 |
Protein GI | 209548867 |
COG category | [C] Energy production and conversion [S] Function unknown |
COG ID | [COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit [COG3743] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01958] NADH-quinone oxidoreductase, E subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTTC GTCGATTAGC CGAAGATCAA TTTCAGCCTG CCGCATTCGC TTTCAGCGAT GAAAATGCGG TCTGGGCGGA CAAGACGATC CAGAAATACC CCGCCGGCCG CCAGCAGTCG GCGGTCATCC CGCTGTTGAT GCGGGCGCAG GAGCAGGACG GTTGGGTCAC GCGCGCGGCG ATCGAAAAGA TCGCCGACAT GCTCGATATG GCCTATATCC GGGTGCTTGA GGTCGCGACC TTCTATACGC AGTTCCAGCT GCATCCTGTC GGCACCCGCG CCCATGTCCA GGTCTGCGGC ACGACGCCCT GCATGCTGCG CGGCTCGGAA GCGCTGATGT CGGTCTGCAA GAGCAAGATC CACGCCCATG CCTTCGAGCG CAATGCCGAG GGCACGCTGT CCTGGGAAGA GGTCGAATGT CTTGGCGCCT GCGTCAACGC CCCGATGGTG ATGATCGGCA AGGACACCTA TGAAGACCTG ACGCCGGCGC GTCTCGAAGA AATCATCGAT ACTTTTGCTG CCGGCAATGG CGCGAGTATC AAGCCCGGCA CCCAGATCGA CCGGATTTTC TCCGCCCCTG AAGGCGGCCC GACTTCGCTG ACGACGGAAG AGCCGAAGGC AAGGACGCGC GCCAAGAAGG CCGATGCCGA AAGCATTTCG GCTCCCGTCG ACGCCGCTCC GGTTCCGCCC TCCGAGGCTG CCCGCCCGAA GAGCACCGAT GCCGAAACCA ACGCTGCCCT GAAGACGCCG GCAACGGCGC CGAAGGCGGC TGCCAGGAAT GCCAAGGCTG CCGAGCAGCA GCCGGTTTCC GGCACGGCAC CTGCCGAACC GGCACCGGTG GCGGCCGCCA AGGCCGAAGC CGCCCGGGCG GCAAAGCCTG CTCTCACCGA CAAGAACCGT CCGGCCGGCA TCGAAAAGCC CGCCGCGCCG GATGACCTGA AGATGATCTC CGGCGTCGGC CCGAAGATCG AGGCGACGCT GAACGAAATC GGCATCTTCA CCTTCTCGCA GGTCGCGGGC TGGAAGAAGG CCGAACGCGA ATGGGTCGAC GGCTACCTGA ACTTCCGCGG CCGCATCGAG CGCGACGACT GGGTCAAGCA GGCCAAGGCG CTCGCCAAGG GCGGCGAAGC GGAATATATC AAGGTCTTCG GCAAGAAGCC GCGGTAA
|
Protein sequence | MSVRRLAEDQ FQPAAFAFSD ENAVWADKTI QKYPAGRQQS AVIPLLMRAQ EQDGWVTRAA IEKIADMLDM AYIRVLEVAT FYTQFQLHPV GTRAHVQVCG TTPCMLRGSE ALMSVCKSKI HAHAFERNAE GTLSWEEVEC LGACVNAPMV MIGKDTYEDL TPARLEEIID TFAAGNGASI KPGTQIDRIF SAPEGGPTSL TTEEPKARTR AKKADAESIS APVDAAPVPP SEAARPKSTD AETNAALKTP ATAPKAAARN AKAAEQQPVS GTAPAEPAPV AAAKAEAARA AKPALTDKNR PAGIEKPAAP DDLKMISGVG PKIEATLNEI GIFTFSQVAG WKKAEREWVD GYLNFRGRIE RDDWVKQAKA LAKGGEAEYI KVFGKKPR
|
| |