Gene Rleg_4802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4802 
Symbol 
ID8007486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp171623 
End bp173119 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content64% 
IMG OID644821732 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002972992 
Protein GI241113157 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.217691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.597938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGTCAA TACGTGGCGC CAATGCGTGG CAAACGGGTA GAACAGTGTT CGCGGCAAAG 
CTGATGATCA ACAACGAGGC GATGGATGCT TCCGAAGGGG CCACCTTCGA ACGCATCGAT
CCGTTGACCG GCGACGTCGC AACGATCGCC TCGGCAGGAT CCGTTACCGA CATGACGCGG
GCAGCCAATG CGGCGGCAGC CGCTTTTCCG GACTGGTCGC AGACCGGCCC GGGCGAACGG
CGCAGGCTTC TAAATGCCGC CGCCGACATC CTGGAGGCCC GCACATCCGA GCTCGTTGCC
GCCATGACCG GCGAAACCGG CGCCACCGCG CAGTGGGCGG CGATCAATTG CGGCCTCGGC
GCGGACATTC TTCGCGAAGC GGCGGCGATG ACCACGCAAA TATCAGGCGA GCTCATTCCT
TCCGGCATTC CGGGAAGCCT CGCCATGGCG GTCCGCCAGC CGGCAGGCGT CTGCGTCGGC
ATTGCCCCCT GGAATGCACC TATTATCCTC GGCACCCGTG CCGTCGCCAT GCCGCTTGCC
TGCGGCAACA CCGTCATCCT GAAGGCTTCC GAACTCTGCC CGAAGACCCA TGGCCTGATC
GGCGACATCC TGCGTGACGC CGGCTTTCCG CGCGGTGTCG TCAATGTCGT TTCCAATGCG
CCGAGCGATG CCGCCGCTGT CGTCGATGCG CTGATCGCCC ATCCGGCGGT GCGCCGTATC
AACTTCACCG GCTCCACCCG TGTCGGCAGG ATCATTGCCG AATCCTCGGC AAGGCATCTC
AAGCGCTGCC TGCTCGAACT CGGCGGCAAG GCGCCGTTCA TCGTCCTTGC AGACGCCGAT
ATCGATGAGG CCGTGCGTGC CGCCGCCTTC GGCGCCTTCA TGAACCAGGG CCAGATCTGC
ATGTCGACCG AACGGATCAT CCTCATGGAC GAGATTGCCG ATGGCTTCGT CGGCAAGTTC
CGGACGAAAG CCGCAACCCT CGTCGCCGGC AATCCCAAGG ACGGCAACAC GCCGCTCGGC
ACGCTGATCA ACACTGAGGC CGTCCGCCGC GTCAGGTCAC TCGTCGATGA TGCCCTGCAG
AAGGGCGCGG TCCTCGTCTG CGGTGGCCAG GCCAACGGCA CGCTAATGGA CGCAACCGTC
ATCGATCACG TGACGCCTGC CATGCGCATC TATCGCGAAG AGAGCTTTGG GCCGGTCGCG
GCGATCATCC GGGTCGGCAG CGTCGACGAG GCCGTGACGG TTGCCAATGA CAATGAATAC
GGGCTGTCGG CCGCCGTTTT CAGCGCCGAC GTCAATGCAG CACTTGCCGT CGCCATGCGG
CTTGAATCCG GCATCTGCCA CATCAACGAA GCGACAGTTT CCGACGAGCC GCAAATGCCG
TTCGGCGGCG TCAAATCAAG CGGCTACGGC CGCTTCGGCG GCAAGGCGGC GATCGACGAA
TTCACCGAGC TCAGATGGAT CACCATGGCC TCGGCAAAAC GACACTACCC GATCTGA
 
Protein sequence
MASIRGANAW QTGRTVFAAK LMINNEAMDA SEGATFERID PLTGDVATIA SAGSVTDMTR 
AANAAAAAFP DWSQTGPGER RRLLNAAADI LEARTSELVA AMTGETGATA QWAAINCGLG
ADILREAAAM TTQISGELIP SGIPGSLAMA VRQPAGVCVG IAPWNAPIIL GTRAVAMPLA
CGNTVILKAS ELCPKTHGLI GDILRDAGFP RGVVNVVSNA PSDAAAVVDA LIAHPAVRRI
NFTGSTRVGR IIAESSARHL KRCLLELGGK APFIVLADAD IDEAVRAAAF GAFMNQGQIC
MSTERIILMD EIADGFVGKF RTKAATLVAG NPKDGNTPLG TLINTEAVRR VRSLVDDALQ
KGAVLVCGGQ ANGTLMDATV IDHVTPAMRI YREESFGPVA AIIRVGSVDE AVTVANDNEY
GLSAAVFSAD VNAALAVAMR LESGICHINE ATVSDEPQMP FGGVKSSGYG RFGGKAAIDE
FTELRWITMA SAKRHYPI