Gene Rleg2_1264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1264 
Symbol 
ID6979988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1279954 
End bp1281120 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content64% 
IMG OID643395981 
ProductNADH dehydrogenase subunit E 
Protein accessionYP_002280784 
Protein GI209548867 
COG category[C] Energy production and conversion
[S] Function unknown 
COG ID[COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit
[COG3743] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01958] NADH-quinone oxidoreductase, E subunit 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTTC GTCGATTAGC CGAAGATCAA TTTCAGCCTG CCGCATTCGC TTTCAGCGAT 
GAAAATGCGG TCTGGGCGGA CAAGACGATC CAGAAATACC CCGCCGGCCG CCAGCAGTCG
GCGGTCATCC CGCTGTTGAT GCGGGCGCAG GAGCAGGACG GTTGGGTCAC GCGCGCGGCG
ATCGAAAAGA TCGCCGACAT GCTCGATATG GCCTATATCC GGGTGCTTGA GGTCGCGACC
TTCTATACGC AGTTCCAGCT GCATCCTGTC GGCACCCGCG CCCATGTCCA GGTCTGCGGC
ACGACGCCCT GCATGCTGCG CGGCTCGGAA GCGCTGATGT CGGTCTGCAA GAGCAAGATC
CACGCCCATG CCTTCGAGCG CAATGCCGAG GGCACGCTGT CCTGGGAAGA GGTCGAATGT
CTTGGCGCCT GCGTCAACGC CCCGATGGTG ATGATCGGCA AGGACACCTA TGAAGACCTG
ACGCCGGCGC GTCTCGAAGA AATCATCGAT ACTTTTGCTG CCGGCAATGG CGCGAGTATC
AAGCCCGGCA CCCAGATCGA CCGGATTTTC TCCGCCCCTG AAGGCGGCCC GACTTCGCTG
ACGACGGAAG AGCCGAAGGC AAGGACGCGC GCCAAGAAGG CCGATGCCGA AAGCATTTCG
GCTCCCGTCG ACGCCGCTCC GGTTCCGCCC TCCGAGGCTG CCCGCCCGAA GAGCACCGAT
GCCGAAACCA ACGCTGCCCT GAAGACGCCG GCAACGGCGC CGAAGGCGGC TGCCAGGAAT
GCCAAGGCTG CCGAGCAGCA GCCGGTTTCC GGCACGGCAC CTGCCGAACC GGCACCGGTG
GCGGCCGCCA AGGCCGAAGC CGCCCGGGCG GCAAAGCCTG CTCTCACCGA CAAGAACCGT
CCGGCCGGCA TCGAAAAGCC CGCCGCGCCG GATGACCTGA AGATGATCTC CGGCGTCGGC
CCGAAGATCG AGGCGACGCT GAACGAAATC GGCATCTTCA CCTTCTCGCA GGTCGCGGGC
TGGAAGAAGG CCGAACGCGA ATGGGTCGAC GGCTACCTGA ACTTCCGCGG CCGCATCGAG
CGCGACGACT GGGTCAAGCA GGCCAAGGCG CTCGCCAAGG GCGGCGAAGC GGAATATATC
AAGGTCTTCG GCAAGAAGCC GCGGTAA
 
Protein sequence
MSVRRLAEDQ FQPAAFAFSD ENAVWADKTI QKYPAGRQQS AVIPLLMRAQ EQDGWVTRAA 
IEKIADMLDM AYIRVLEVAT FYTQFQLHPV GTRAHVQVCG TTPCMLRGSE ALMSVCKSKI
HAHAFERNAE GTLSWEEVEC LGACVNAPMV MIGKDTYEDL TPARLEEIID TFAAGNGASI
KPGTQIDRIF SAPEGGPTSL TTEEPKARTR AKKADAESIS APVDAAPVPP SEAARPKSTD
AETNAALKTP ATAPKAAARN AKAAEQQPVS GTAPAEPAPV AAAKAEAARA AKPALTDKNR
PAGIEKPAAP DDLKMISGVG PKIEATLNEI GIFTFSQVAG WKKAEREWVD GYLNFRGRIE
RDDWVKQAKA LAKGGEAEYI KVFGKKPR