Gene Rleg2_4884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4884 
Symbol 
ID6977978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp522547 
End bp523992 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content65% 
IMG OID643394041 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002278859 
Protein GI209546941 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTT CCCTGCTGAT CAACGGCGCA GACCGCCCGG CCTCCGACGG CCGGACCTAT 
GACCGCATCG ATCCCTTCAC CGAAAAGCTC GCCAGCCGCG CCGCCGCGGC GAGCCTGCAG
GATGCAGCAG CCGCGGTCGA TGCCGCCTCC GCCGCCTTCG GCGCCTGGTC GAAGACCGGC
CCCGGCCAGC GCCGCGCGAT CCTGATGAAG GCCGCAGACA TCATGGATTC CAAGGTGGGC
GAGTTTACCC AGCTGATGAT CGAGGAAACC GGCGCCACTG CGCCCTGGGC CGGCTTCAAT
GTCATGTTCG CCGCCAACAT CCTGCGCGAA GCCGGCGCCA TGACGACTCA GATCTCAGGC
GAAATCATCC CTTCCAACAA ACCCGGCACG CTCGCCATGG GCGTTCGCCA GGCGGCAGGC
GTCTGTCTGG CAATTGCCCC CTGGAACGCG CCCGTCATCC TCGCCACCCG CGCCATCGCC
ATGCCGATCG CCTGCGGCAA CACCGCCATC CTCAAAGCCT CCGAACAATG CCCGGGCACG
CACCGGCTGA TCGCAACGGT GCTGACGGAA GCGGGCCTGC CGGCCGGCGT CCTCAATGTC
ATCACCAACG CGCCGGAGGA TGCGCCTGAA ATCGTCGCCG CGCTGATCGC CCATCCCGCC
GTCAAGCGCG TCAACTTCAC CGGATCGACC AAAGTCGGCA AGATCATCGC CGAGACCTGC
GGCAAGCATC TCAAGCCTGC CCTCCTCGAA CTCGGTGGCA AGGCTCCGCT TGTTATCCTC
GACGACGCCG ATATCGAAGG CGCCGTCAAT GCCGCCACGT TCGGCGCCTT CATGCATCAG
GGGCAGATCT GCATGTCGAC GGAGCGGATC ATCGTCGATG AGACGATCGC CGATCAGTTC
GTCGCCAAAC TGGCCGCCCG CGCCAGCCAG CTCCCGGCCG GCGACCCGCG CGGCCATGTC
GTGCTCGGCT CGCTGATCAG CCTCGACGCC GCCAAGAAGA TGGAAGAGCT GATCGCCGAT
GCGACGGCGA AGGGCGCCAA GCTTGTTGCC GGCGGCAAAC GCTCGGGCAC CGTGGTCGAG
GCGACACTGC TCGATCACGT CACTCCTGAT ATGCGCGTTT ATGCGGAAGA ATCCTTCGGC
CCGGTGAAAC CGATCATTCG CGTCTCGGGT GAGGAGGAAG CAATCCGCAT CGCCAACGAC
ACCGAATACG GCCTATCATC CGCCGTCTTC AGCCGCAACA TCCAGCGGGC GATGGCGGTT
GCCGCGCGCA TCGAATCCGG CATCTGCCAT ATCAACGGCC CGACCGTGCA CGACGAGGCG
CAAATGCCCT TCGGCGGCGT CAAGGGCAGC GGCTACGGCC GCTTCGGCGG CAAGGCGGCA
ATCGCCGAAT TCACCGATCT GCGCTGGATC ACCGTCGAGG ATTCCGCCCA GCACTATCCC
TTCTGA
 
Protein sequence
MNISLLINGA DRPASDGRTY DRIDPFTEKL ASRAAAASLQ DAAAAVDAAS AAFGAWSKTG 
PGQRRAILMK AADIMDSKVG EFTQLMIEET GATAPWAGFN VMFAANILRE AGAMTTQISG
EIIPSNKPGT LAMGVRQAAG VCLAIAPWNA PVILATRAIA MPIACGNTAI LKASEQCPGT
HRLIATVLTE AGLPAGVLNV ITNAPEDAPE IVAALIAHPA VKRVNFTGST KVGKIIAETC
GKHLKPALLE LGGKAPLVIL DDADIEGAVN AATFGAFMHQ GQICMSTERI IVDETIADQF
VAKLAARASQ LPAGDPRGHV VLGSLISLDA AKKMEELIAD ATAKGAKLVA GGKRSGTVVE
ATLLDHVTPD MRVYAEESFG PVKPIIRVSG EEEAIRIAND TEYGLSSAVF SRNIQRAMAV
AARIESGICH INGPTVHDEA QMPFGGVKGS GYGRFGGKAA IAEFTDLRWI TVEDSAQHYP
F