Gene Rleg_5393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5393 
Symbol 
ID8007351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp807267 
End bp808658 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content62% 
IMG OID644822297 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002973557 
Protein GI241113722 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.587112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCTA CCATGAAGGT CTACCAGGCG TTCAACCGCA AGCCGATTGC AGAGCTGCCG 
GCAGACGACG TGGCCGCACT CGAGCGCAAG CTTCAGCTCG CCGCCAAAAG CTTCGCCGAT
CGCGATGGCT GGCTGCCCCC GCATCAACGG ATGGCGATCC TCAGAAAGGC GTCGGCGCTT
CTGCAGGAGA ACCGCGATCG TTTTGCGATG ATGATCGCCC GTGAAGGCGG CAAGCCGCTG
ACGGATGCGA TCATCGAGGT GACGCGCGGG ATCGACGGCC TCCTCAATGC GGCGGACGAG
CTGCGCAATT TCGGCGGCAA GGAAATCCCC ATGGGGCTGA CCGCGGCCAG CGCCAACAGA
TGGGCTTTCA CCACCAAGGA GCCGATCGGC GTCGTGGCGG CGATCTCGGC CTTCAACCAT
CCGCTCAATC TCATCATCCA CCAGATCGCG CCGGCGATCG CGGTCGGCTG CCCGGTTATC
GTGAAGCCGG CGGCAACGAC GCCGATTTCC TGCATCGAGA TCGTCAAGCT GTTCTGGGAG
GCAGGTCTCG ATGAGCGCTG GTGCCAGACC CTCATCACCG AGGACAACGC GCTTGCCGAA
GCCTTTGCAA CCGATCACCG CGTCGCGTTT TTGAGCTTCA TCGGTTCCGC CAAGGTCGGC
TGGTACCTCA AGGGCAAGCT GCCGCCGGGG ACCCGATGCG CGCTTGAGCA CGGTGGAGCG
GCACCCGTCA TCGTCGACCG CAGCGCCAAT GTCGATGCGA TCGTCGGCAC CATTGTCAAA
GGTGGCTATT ATCACGCCGG GCAAGTCTGC GTGTCGGCCC AGCGTCTCTT CGTGCATGAA
GATATTCTGG CTTCCTTCAC CGAGGCTCTC GCTGCCAGGG TGGCAGCTCT GCATGTCGGC
GATCCCACGC TTATGCAAAC CGAGGTAGGG CCGCTCATCC TGCCACGCGA GGCAGACCGC
GTCGCCGCCT GGATCAAGGA GGCGACGGAC GCCGGAACCA GGCAGATCGG CGGCGGTCGG
ATGTCCGAGA CGACACTTCT GCCTTCGGTT TTGCTGGACC CGCCTACTGA AGCGAAAGTA
TCCATGCTCG AGGTCTTTGG ACCGCTGACC TGCGTCTACG GCTACCGCGA CCTCGACGAA
GCGATCCGCA TTGCCAACTC GCTGCCCTAT GCCTTCCAGG CGAGCGTCTT TTCCGCCGAC
ATAGCGGTTG CTCTCAGGGC GGCAAAACAT TTGGATGCAT CGGCCGTTCT CGTCAACGAC
CACACCGCGT TCCGCACCGA TTGGATGCCT TTCGCCGGAC GCAGACAGTC AGGATACGGC
GTCGGCGGCA TTCCCTGGAC GATGGAAGAA ATGGCCGACG ACAAGATGGT GGTATTCAAC
CAGGTGACTT AG
 
Protein sequence
MQSTMKVYQA FNRKPIAELP ADDVAALERK LQLAAKSFAD RDGWLPPHQR MAILRKASAL 
LQENRDRFAM MIAREGGKPL TDAIIEVTRG IDGLLNAADE LRNFGGKEIP MGLTAASANR
WAFTTKEPIG VVAAISAFNH PLNLIIHQIA PAIAVGCPVI VKPAATTPIS CIEIVKLFWE
AGLDERWCQT LITEDNALAE AFATDHRVAF LSFIGSAKVG WYLKGKLPPG TRCALEHGGA
APVIVDRSAN VDAIVGTIVK GGYYHAGQVC VSAQRLFVHE DILASFTEAL AARVAALHVG
DPTLMQTEVG PLILPREADR VAAWIKEATD AGTRQIGGGR MSETTLLPSV LLDPPTEAKV
SMLEVFGPLT CVYGYRDLDE AIRIANSLPY AFQASVFSAD IAVALRAAKH LDASAVLVND
HTAFRTDWMP FAGRRQSGYG VGGIPWTMEE MADDKMVVFN QVT