Gene Rleg_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3801 
Symbol 
ID8014626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3859630 
End bp3861138 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content61% 
IMG OID644826364 
ProductAldehyde dehydrogenase (NAD(+)) 
Protein accessionYP_002977583 
Protein GI241206487 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.804674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCATC AGAAAATCGT CGAGTCGCCG TTCAAGCTGA AATACGGCAA CTATATCGGC 
GGCGAATGGC GCGAGCCGGT CGAGGGAAAA TACTTCGAAA ACCTCACGCC CGTCACCGGC
GGCAAGCTCT GCGACATTCC CCGCTCCAAT GAAAAGGACA TTAACCTCGC ACTCGACGCC
GCTCATGCGG CAAAGGAAAA ATGGGGTCGC ACCTCGGTTG CCGAGCGCTC CAATATCCTC
ATGAAGATCG CCCAGCGCAT GGAAGACAAG CTTGAATTGC TCGCCCAGGC TGAGACCTGG
GACAATGGCA AGCCGATCCG TGAAACCATG GCGGCCGACA TTCCGCTGGC GATCGACCAT
TTCCGTTATT TCGCCTCCTG CATTCGCGCC CAGGAAGGTT CGATCGGCGA GATTGACCAC
GACACTGTCG CCTATCACTT CCATGAGCCG CTCGGCGTCG TCGGCCAGAT CATTCCGTGG
AACTTCCCGA TCCTGATGGC CACCTGGAAG CTGGCGCCCG CACTTGCCGC CGGCAATTGC
GTCGTACTGA AACCCGCCGA GCAGACCCCG GCCTCGATCC TGGTCTGGGC TGAACTCGTC
GGCGATCTCC TGCCGGCAGG GGTCCTCAAC ATCGTCAACG GTTTCGGCCT CGAAGCCGGC
AAGCCGCTGG CGACCAGCCC GCGCGTCGCT AAGATCGCCT TCACCGGCGA GACGACGACA
GGCCGGCTTA TCATGCAATA TGCCAGCCAG AACCTCATTC CGGTGACGCT GGAGCTCGGC
GGCAAATCGC CGAACATCTT CTTCGCCGAC GTGATGGCGG AAGACGACGA CTTCCTCGAC
AAGGCGTTCG AAGGCTTTGC GATGTTTGCC TTGAACCAGG GCGAAGTCTG CACCTGCCCG
AGCCGCGCCC TCGTCCAGGA ATCGATCTAC GACCGTTTCA TGGAAAAGGC CGTCAAACGC
GTTGAGGCGA TCAAGCAGGG CAACCCGCTC GATAGCGCAA CGATGATCGG CGCCCAGGCC
TCGACCGAGC AGCTGGAAAA GATCCTCGCC TATCTCGACA TCGGCAAGCA GGAAGGCGCG
GAAGTGCTGA CCGGCGGCTC GCGCAACGAT CTCGGCGGCG AGCTGGCGAA CGGCTACTAT
GTCAAGCCGA CGATCTTCAA GGGTCACAAC AAGATGCGCG TGTTCCAGGA GGAAATCTTC
GGGCCGGTGG TTTCGGTGAC GACCTTCAAG AACGAGAAGG AAGCGCTCGA AATCGCTAAC
GACACGCTCT ACGGCCTCGG CGCCGGCGTC TGGAGCCGCG ATGCCAATCG CTGCTACCGT
TTCGGCCGCG AGATCCAGGC CGGCCGCGTC TGGACCAACT GCTACCACGC CTACCCGGCC
CATGCCGCCT TCGGCGGCTA CAAGCAGTCG GGCATCGGCC GTGAAACCCA TAAGATGATG
CTCGACCACT ACCAGCAGAC CAAGAACATG CTGGTGAGCT ACAGCCCGAA GGCACTCGGC
TTCTTCTGA
 
Protein sequence
MLHQKIVESP FKLKYGNYIG GEWREPVEGK YFENLTPVTG GKLCDIPRSN EKDINLALDA 
AHAAKEKWGR TSVAERSNIL MKIAQRMEDK LELLAQAETW DNGKPIRETM AADIPLAIDH
FRYFASCIRA QEGSIGEIDH DTVAYHFHEP LGVVGQIIPW NFPILMATWK LAPALAAGNC
VVLKPAEQTP ASILVWAELV GDLLPAGVLN IVNGFGLEAG KPLATSPRVA KIAFTGETTT
GRLIMQYASQ NLIPVTLELG GKSPNIFFAD VMAEDDDFLD KAFEGFAMFA LNQGEVCTCP
SRALVQESIY DRFMEKAVKR VEAIKQGNPL DSATMIGAQA STEQLEKILA YLDIGKQEGA
EVLTGGSRND LGGELANGYY VKPTIFKGHN KMRVFQEEIF GPVVSVTTFK NEKEALEIAN
DTLYGLGAGV WSRDANRCYR FGREIQAGRV WTNCYHAYPA HAAFGGYKQS GIGRETHKMM
LDHYQQTKNM LVSYSPKALG FF