Gene Rleg_4795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4795 
Symbol 
ID8007479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp164510 
End bp165955 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content65% 
IMG OID644821725 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002972985 
Protein GI241113150 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0338248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTT CCCTCCTCAT AAACGGCGCC GACCGCGCGG CCTCCGGCGG CCGGACCTTT 
GATCGCATCG ATCCCTTCAC CGAGAAGCTC GCAAGCCGCG CCGCCGCGGC AAGCCTGGAT
GATGCGGCCG CTGCCGTCGA AGCCGCCGCC GCTGCCTTCG GCGCCTGGTC GAAGACCGGC
CCCGGCCAGC GCCGCGCGAT CCTGATGAAG GCTGCCGATA TCATGGATTC CAAAGTCGGC
GAGTTTACCC GGCTGATGAT CGAGGAGACC GGCTCAACCG CACCCTGGGC CGGCTTCAAT
GTCATGCTCG CCGCCAACAT CCTGCGCGAG GCCGGCGCCA TGACGACGCA GATATCAGGC
GAAATCATCC CTTCCGATAA GCCGGGCACG CTCGCCATGG GTGTTCGCCA GGCCGCCGGC
GTCTGCCTGG CGATCGCTCC CTGGAACGCG CCGGTCATCC TCGCCACCCG CGCCATCGCC
ATGCCGATCG CCTGCGGCAA TACCGTCGTG CTCAAGGCCT CGGAACAATG CCCCGGCACG
CATAGGCTGA TCGCCACCGC GCTGACCGAA GCCGGCCTGC CGGCCGGCGT CATCAACGTC
CTCACCAACG CGCCGGAAGA TGCGCCCGAG ATCGTCGCCG CGCTGATTGC CCATCCGGCC
GTCAAGCGCG TGAACTTCAC CGGGTCGACC AAGGTCGGCA AGATCATCGC CGAGACCTGC
GGCAGGTATC TCAAACCCGC CCTTCTTGAG CTCGGCGGCA AGGCACCGCT GGTCATCCTC
GACGACGCCG ATATCGACGG TGCCGTCAAT GCCGCCATTT TCGGCGCCTT CATGCATCAG
GGCCAGATCT GCATGTCGAC CGAGCGGATC ATCGTCGACG AGACGATCGC CGATCAGTTC
GTCGCCAAAC TGGCCGCCCG CGCCAGCCAG CTGCCGGCCG GCGACCCACG TGGCCACGTC
GTTCTCGGCT CGCTGATCAG CCTCGATGCG GCGAAGAAAA TGGAGGAGCT GATCGCCGAT
GCGACGGCCA AGGGGGCAAA ACTCGTGGCC GGCGGCAAAC GCTCGGGCAC CGTGGTCGAG
GCGACGCTGC TTGATCATGT CACGCCCGAG ATGCGCGTCT ATGCAGAAGA ATCCTTCGGC
CCGGTCAAGC CGATCATTCG CGTTACCAGT GAGGAAGAAG CCATCCGCAT CGCCAACGAC
ACCGAATACG GCCTGTCATC CGCCGTCTTC AGCCGCAATG TCCAGCGGGC AATGGCGGTT
GCGGCGCGGA TCGAATCCGG CATCTGCCAC ATCAATGGCC CGACGGTAAA CGACGAGGCG
CAAATGCCCT TCGGCGGCGT CAAGGGCAGC GGCTACGGCC GCTTCGGCGG CAAGGCGGCA
ATCGCTGAAT TCACCGACCT GCGCTGGATC ACGGTCGAGG ATTCCGCCCA GCACTATCCT
TTCTGA
 
Protein sequence
MNISLLINGA DRAASGGRTF DRIDPFTEKL ASRAAAASLD DAAAAVEAAA AAFGAWSKTG 
PGQRRAILMK AADIMDSKVG EFTRLMIEET GSTAPWAGFN VMLAANILRE AGAMTTQISG
EIIPSDKPGT LAMGVRQAAG VCLAIAPWNA PVILATRAIA MPIACGNTVV LKASEQCPGT
HRLIATALTE AGLPAGVINV LTNAPEDAPE IVAALIAHPA VKRVNFTGST KVGKIIAETC
GRYLKPALLE LGGKAPLVIL DDADIDGAVN AAIFGAFMHQ GQICMSTERI IVDETIADQF
VAKLAARASQ LPAGDPRGHV VLGSLISLDA AKKMEELIAD ATAKGAKLVA GGKRSGTVVE
ATLLDHVTPE MRVYAEESFG PVKPIIRVTS EEEAIRIAND TEYGLSSAVF SRNVQRAMAV
AARIESGICH INGPTVNDEA QMPFGGVKGS GYGRFGGKAA IAEFTDLRWI TVEDSAQHYP
F