Gene Rleg2_5505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5505 
Symbol 
ID6978599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1154916 
End bp1156430 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content64% 
IMG OID643394604 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002279422 
Protein GI209547504 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000574808 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGACGA CAGTCAGTTC CATCGAAGGT TCGCGACAGG GCCTGTTCAT CAACGGTGAG 
TTCGTGGCGC CCAAGGCCGG CAACTACATT GCCAGCTACG ACCCGACGAC CGGCGAGCGC
TGGTACGATC TGGCCGAAGC CGACGCCGAT GACGTCTCTG CGGCCGTTGC CGCTGCGAAT
GCCGCCTTCC GCAATCCCGC CTGGCGGCGG ATGACGCAGA CCGACCGCGG TGCCCTGGTG
CGCACGCTCG CCGAACTTGT CCGCACCAAT GCCGACACGC TTGCCGAAAT CGAAACCCGC
GACAACGGCA AGCTTCTCAA GGAAACCCGG GCGCAGATGC GCTCGATGCC GGACAGCTAT
CATTATTTCG CCGGGATGGC CGACAAGCTG CAGGGCGATA CGATCCCGAT CAACAGGGCC
GATACGCTGA ACATCAACCT GCGCGAAGCG CTCGGCGTCG TCGGCATGAT TACGCCCTGG
AATTCGCCGC TGATGCTTTT GACCGGCACG CTGGCGCCGT GCCTGGCGAT CGGCAACACC
GTCGTCATCA AGCCATCGGA ACATGCGACC GCCTCGACGC TGGCGCTTGC CGAACTGATC
CATGAGGCGG GCTTCCCCGC CGGGGTCGTC AACGTCGTCA CCGGCACCGG CAAGAGCGCC
GGTGAGGCGC TGACCCGCCA TCCCGGTGTT TCAAAATATG TCTTTACCGG CAGCACCGCC
ACCGGCCGCC GCATCGCCGG AAACGCGGCG CAGAACCTCG TGCCCTGCTC GATGGAGCTC
GGCGGAAAGT CGCCGCATGT GATCTTCGGC GATGTCGAGC TCGAGCATGC CGTCAATGGC
GTCGTTTCCG GCGTGTTTGC CGCTGCCGGC CAGACCTGCG TGGCCGGTTC ACGCTGCTTC
GTCGAGGCCA GCATCTACGA CAAGTTCATC GACGCGCTGA TTGCCCGCAC CGGCCGCATC
CGCGTCGGTC TGCCGACGGC AGAGGATACC GATATCGGCC CGCTGGCGCT TTCCGATCAG
TTGACGAAGG TCGAGGGCTA TGTGGCGTCC GGCGTCAAGG AAGGCGCGAA GATCGCCGCC
GGCGGGCGCC GTCCGCAGAA GGAAGGTCTG TCGCGTGCCG GCTGGTACTT CGAGCCGACG
GTGATGGTCG ATGTGCACAA CGACATGGGC TTCATGCGCG ACGAGATTTT CGGCCCGGTC
GTCGGCGTCA TGCCGTTCCG CGACGAGGCC GAGATGATCG CGCTCGCTAA CGACAGCCAT
TACGGCCTCG CTTCCGGGAT TTGGACCAAG GATATCGACC GCGCCCTGCG CTTCGCCAAC
CAGATCGAAG CCGGCACCGT CTGGGTCAAT ACCTATCGCT CGGCCTCCTT CATGTCGGCC
AATGGCGGCT TCAAGGAGAG CGGCTACGGC CGGCGCGGCG GCTTCGAGGT GATGCACGAA
TTCTCCCGGC TGAAAAACGT CATCATCGAT TATTCCGGGG CGATGCAGGA CCCCTTCGTC
ATCCGTCTGA AGTGA
 
Protein sequence
MKTTVSSIEG SRQGLFINGE FVAPKAGNYI ASYDPTTGER WYDLAEADAD DVSAAVAAAN 
AAFRNPAWRR MTQTDRGALV RTLAELVRTN ADTLAEIETR DNGKLLKETR AQMRSMPDSY
HYFAGMADKL QGDTIPINRA DTLNINLREA LGVVGMITPW NSPLMLLTGT LAPCLAIGNT
VVIKPSEHAT ASTLALAELI HEAGFPAGVV NVVTGTGKSA GEALTRHPGV SKYVFTGSTA
TGRRIAGNAA QNLVPCSMEL GGKSPHVIFG DVELEHAVNG VVSGVFAAAG QTCVAGSRCF
VEASIYDKFI DALIARTGRI RVGLPTAEDT DIGPLALSDQ LTKVEGYVAS GVKEGAKIAA
GGRRPQKEGL SRAGWYFEPT VMVDVHNDMG FMRDEIFGPV VGVMPFRDEA EMIALANDSH
YGLASGIWTK DIDRALRFAN QIEAGTVWVN TYRSASFMSA NGGFKESGYG RRGGFEVMHE
FSRLKNVIID YSGAMQDPFV IRLK