Gene Rleg2_3624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3624 
Symbol 
ID6982386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3746407 
End bp3747621 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content62% 
IMG OID643398348 
Product5-aminolevulinate synthase 
Protein accessionYP_002283115 
Protein GI209551198 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase
[TIGR01821] 5-aminolevulinic acid synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.576117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.979947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCG AAGCGTTTTT CAAAAACGAG TTGGACGGGC TTCATGCCGA GGGCCGTTAC 
CGCGTTTTTG CCGATCTCGA GCGTCACCGC GGCAATTTTC CGCGCGCGAC GCGCCATACG
GCCGATGGCG AAAAGGAAGT CACGGTCTGG TGCTCCAACG ACTATCTCGG CATGGGCCAG
AACCCGAAGG TGATCGAGGC GATGAAAAAC GCCATCGACC ACTGTGGCGC GGGTGCGGGA
GGCACCCGGA ATATTTCTGG CACCAACCAC TACCACGTCA TGCTCGAGCG TGAGCTTGCC
GATCTGCACG CCAAGGAAGC AGCGCTGATC TTCACCTCGG GCTATGTGTC CAACTGGGCA
GCGCTCGGCA CGCTTGGCGC CAAGATCCCC GGCCTGATCA TCTTCTCCGA CGCGCTGAAC
CATGCCTCGA TGATCGAGGG CATCCGCCAT GCCAAATGCG ACAAGGTGAT CTGGAAGCAC
AATGACGTCG CCGATCTCGA AGCCAAGCTT AAGGCCGCCG ATCCGAAGGC GCCGAAGCTG
ATCGCCTTCG AAAGCGTCTA TTCGATGGAC GGCGATATCG CCCCGATCAA GGAAATCTGC
GACCTCGCCG ATAAATACGG CGCGATGACC TATCTCGACG AAGTGCATGC GGTCGGCATG
TACGGCCCGC GCGGCGGCGG CATAGCCGAA CGCGAGGGGC TGATGGACCG GCTGACTGTC
ATCGAGGGCA CGCTCGGCAA GGCTTTCGGG GTGATGGGCG GTTATATCGC CGCGTCTTCA
GCGCTTTGCG ATTTCATCCG TTCCTTCGCC TCCGGCTTCA TCTTCACTAC GGCGCTGCCG
CCGGCGCTTG CCGCCGGTGC CGTCGCCTCG ATCCAGCATC TGAAGGTCAG CCAGTTCGAG
CGCGCCCGCC ATCAGGACCG GGTGCGCAAA CTCCGGATGC TGCTCGATCA GAGCGGCATT
CCGCATGTGC CCAATCCGAG CCATATCGTG CCGGTGCTTG TCGGCGATGC CGCCAAATGC
AAGTGGATTT CCGACCTGCT GCTCGACAAT TGCGGCGTCT ACGTCCAGCC GATCAACTAT
CCCACCGTGC CGAAGAAGAC CGAACGCTTG CGCATCACGC CGACGCCGCT GCATTCCGAT
GCCGACATCG CCCATCTGGT CGAGGCGCTG CATTCGCTCT GGTCGCGCTG CGCGCTGGCA
AGACACGTCG CCTGA
 
Protein sequence
MDFEAFFKNE LDGLHAEGRY RVFADLERHR GNFPRATRHT ADGEKEVTVW CSNDYLGMGQ 
NPKVIEAMKN AIDHCGAGAG GTRNISGTNH YHVMLERELA DLHAKEAALI FTSGYVSNWA
ALGTLGAKIP GLIIFSDALN HASMIEGIRH AKCDKVIWKH NDVADLEAKL KAADPKAPKL
IAFESVYSMD GDIAPIKEIC DLADKYGAMT YLDEVHAVGM YGPRGGGIAE REGLMDRLTV
IEGTLGKAFG VMGGYIAASS ALCDFIRSFA SGFIFTTALP PALAAGAVAS IQHLKVSQFE
RARHQDRVRK LRMLLDQSGI PHVPNPSHIV PVLVGDAAKC KWISDLLLDN CGVYVQPINY
PTVPKKTERL RITPTPLHSD ADIAHLVEAL HSLWSRCALA RHVA