Gene Rleg2_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4420 
Symbol 
ID6977514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp50624 
End bp52078 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content64% 
IMG OID643393598 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002278416 
Protein GI209546498 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGC CTCTCGAACA CCTGTCCCGC AACTGTCGTC TGGACAAGTT CTTCATCGAC 
GGTCGATGGC TCGAACCGAG GGGCAGCGCC AAGGGCGTCG TGGTCAATCC TGCCACGGAA
GAGGTGGTGA CCCAGTTTGC GCTTGGCAAC GGCGAGGATG TCGACGCCGC AGTTTCCGCC
GCCCGGCGGG CCTTCGCCAG CTGGGGCCGG ACCACGCCGG AATACCGCGC GGCTCTGCTC
GACCGTCTGC AGACGCTGCT CGAAGCGCGC AGCGAACTGT TCGCGCAGTG CCTGAGCCTC
GAAATGGGCG CAGCGATCGG ATATGCACGC AACGCGCAGG TGCCGCTGGC GATCGCCCAT
GTCAAGGTCG CCCGCGATCT CCTGGAGGCC TTCCCCTTCG TCAAGCAGCG CGGCCACACC
GCCGTCACCC ACGAACCGAT CGGCGTCTGC GCACTGATCA CTCCGTGGAA CTGGCCGCTC
TACCAGATCA CCGCCAAAGT TTCGCCGGCA CTCGCCGCGG GCTGCACGGT CGTGCTGAAG
CCCAGCGAGC TTTCGCCTCT CGGCGCGCTG CTGTTTGCAG AGACGATCGA GGAGGCAGGC
TTCCCCAACG GCGTCTTCAA CCTCGTCAAT GGCGACGGCC CCTGCGTCGG TTCGGCTCTT
GCGTCCCATC CGGATGTCGA CATGATTTCG ATTACCGGCT CCACCCGTGC CGGCATCGCC
GTGGCCCAGG CTGCCGCACC GACCGTCAAG CGCGTCGCAC AGGAGCTTGG CGGCAAGTCG
CCGAACGTCA TCCTGCCGGA CGCCGATCTC AACCGCGCCG TTTCCATGGG CGTAGCGGCT
GCCTTCCGCA ATCTTGGCCA ATCATGCAGC GCACCGACAC GGATGATCGT GCCGCGCGCG
CTGCTGTCCG AGAGTGAGTC CATCGCCAGG CGAGCCGCAG ATGAGATCGT TGTCGGTGAT
CCCCTTTCAG AAGCCAGCAC GCATGGCGCG ATCGCCAACC GCGCCCAGTT CGATCGGATC
CAAACCATGA TTGGCGTCGG GATTGCCGAG GGCGCGACGC TCATCGCCGG CGGCGAGGGG
CGTCCGGCCG GTCTGAATGC CGGATTCTAC GTTAAGCCGA CAATTTTCTC GAAAGTGCGC
ACCGACATGC GCATTGCGCA GGAGGAAATC TTCGGCCCGG TTCTCTGCCT CATCCCCTAC
GACACGGTGG AGGAAGCCAT CGCCATCGCC AACCATACCG TCTACGGCCT GGGTGCGCAT
GTGCAGGGCA AGGATATAGA GGTCGTCAAG GACGTCGCGT CACAGATCCG CTGCGGTCAA
GTGCATCTCA ACTATCCCGC CTGGGATCCA CACGCGCCTT TTGGCGGCTT CAAACAATCT
GGAAATGGAC GCGAATTCGG CATTGAAGGC ATGCTGGAAT ACCTCGAGGT CAAATCCATC
CTTGGCTACT TCTAG
 
Protein sequence
MIAPLEHLSR NCRLDKFFID GRWLEPRGSA KGVVVNPATE EVVTQFALGN GEDVDAAVSA 
ARRAFASWGR TTPEYRAALL DRLQTLLEAR SELFAQCLSL EMGAAIGYAR NAQVPLAIAH
VKVARDLLEA FPFVKQRGHT AVTHEPIGVC ALITPWNWPL YQITAKVSPA LAAGCTVVLK
PSELSPLGAL LFAETIEEAG FPNGVFNLVN GDGPCVGSAL ASHPDVDMIS ITGSTRAGIA
VAQAAAPTVK RVAQELGGKS PNVILPDADL NRAVSMGVAA AFRNLGQSCS APTRMIVPRA
LLSESESIAR RAADEIVVGD PLSEASTHGA IANRAQFDRI QTMIGVGIAE GATLIAGGEG
RPAGLNAGFY VKPTIFSKVR TDMRIAQEEI FGPVLCLIPY DTVEEAIAIA NHTVYGLGAH
VQGKDIEVVK DVASQIRCGQ VHLNYPAWDP HAPFGGFKQS GNGREFGIEG MLEYLEVKSI
LGYF