Gene Rleg2_4891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4891 
Symbol 
ID6977985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp529629 
End bp531095 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content66% 
IMG OID643394048 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002278866 
Protein GI209546948 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.364964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.705807 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGGTG AAACGGTGTT TTCGGCAAAG CTGATGATCA ACAACGAGGC GCTGGATGCT 
TCCGAAAGGG CGACCTTCGA ACGCATCGAT CCGCTGAGCG GCGATGTCGC AACGATTGCT
TCGGCGGGAT CCATCGCCGA CATGACGCGG GCAGCCAATG CCGCCGCTGC CGCCTTTCCC
GACTGGTCGC AGACCGGGCC GGGCGAACGG CGCTGGCTGC TGAATGCCGC GGCCGATCTG
CTGGAGGCCC GCACGCCGGA ACTCATTGCC GCCATGACCG GCGAAACCGG CGCCACGGCG
CAATGGGCGG CGATCAATTG CGGGCTCGGC GCCGATATCT TTCGCGAGGC GGCGGCGATG
ACCACGCAAA TCTCAGGCGA GCTCATTCCG TCAGGCATTC CCGGCAGCCT CGCCATGGCG
GTGCGCCAGC CGGCCGGCGT CTGCGTCGGC ATCGCCCCCT GGAATGCGCC GGTCATTCTC
GGCGCCCGCG CCGTCGCCAT GCCGCTTGCC TGCGGCAACA CCGTCGTGCT CAAGGCCTCG
GAACTCTGCC CGAAGACCCA CGGCCTGATC GGCGATATCC TGCGTGACGC CGGTTTTCCG
CGCGGCGTCG TCAATGTCGT TTCCAATGCG CCGAGCGATG CCGCTGCGGT CGTCGATGCG
CTGATCGCCC ATCCGGCCGT GCGCCGCATC AATTTCACCG GCTCCACCCG TGTCGGCCGG
ATCATCGCCG AAAGCGCAGC ACGACATCTG AAGCGCTGCC TGCTCGAACT CGGCGGCAAG
GCGCCGTTCA TCGTGCTGGC CGACGCCGAT ATCGACGAGG CGGTCGGTGC CGCCGCCTTC
GGCGCCTTCA TGAACCAGGG CCAGATCTGC ATGTCCACCG AGCGGATCAT CCTGATGGAC
GAGATCGCCG ACGGCTTCGT CGGCAAGTTT CGGACGAGAG CCGCAACCCT CGTTGCAGGC
CACCCCGGAG ACGGCAACAC GCCGCTCGGC ACGCTGATCA ACGCAGAGGC CGTGCGCCGC
GTCAGGTCGC TGATCGACGA TGCCTTGCAG AAGGGCGCGG TCCTCCTCTG CGGCGGCGAG
GCCCACGGCA CGCTGATGGA TGCGACCGTC ATCGATCACG TCACCCCTGC CATGCGCGTC
TACCGCGAGG AGAGCTTCGG GCCGGTCGCG GCAATCATCC GAGTCGGCAG CGTCGACGAG
GCCGTGACGG TCGCCAACGA CAACGAATAT GGGCTTTCGG CGGCGGTGTT CAGCGCCGAT
GTCAATGCGG CCTTGGCCGT CGCCATGCGG CTTGAATCCG GCATCTGCCA CATCAACGAG
GCGACGGTTT CCGATGAGCC GCAAATGCCG TTCGGCGGCG TCAAATCGAG CGGCTACGGC
CGCTTCGGCG GCAAGGCGGC GATCGATGAA TTCACCGAGC TCCGATGGCT CACCATCGCA
TCGGGAAAAC GGCAATACCC GATCTGA
 
Protein sequence
MRGETVFSAK LMINNEALDA SERATFERID PLSGDVATIA SAGSIADMTR AANAAAAAFP 
DWSQTGPGER RWLLNAAADL LEARTPELIA AMTGETGATA QWAAINCGLG ADIFREAAAM
TTQISGELIP SGIPGSLAMA VRQPAGVCVG IAPWNAPVIL GARAVAMPLA CGNTVVLKAS
ELCPKTHGLI GDILRDAGFP RGVVNVVSNA PSDAAAVVDA LIAHPAVRRI NFTGSTRVGR
IIAESAARHL KRCLLELGGK APFIVLADAD IDEAVGAAAF GAFMNQGQIC MSTERIILMD
EIADGFVGKF RTRAATLVAG HPGDGNTPLG TLINAEAVRR VRSLIDDALQ KGAVLLCGGE
AHGTLMDATV IDHVTPAMRV YREESFGPVA AIIRVGSVDE AVTVANDNEY GLSAAVFSAD
VNAALAVAMR LESGICHINE ATVSDEPQMP FGGVKSSGYG RFGGKAAIDE FTELRWLTIA
SGKRQYPI