Gene Rleg_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4020 
SymbolhemC 
ID8014826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4097007 
End bp4097936 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content67% 
IMG OID644826589 
Productporphobilinogen deaminase 
Protein accessionYP_002977800 
Protein GI241206704 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.780867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACAA AACCTTTCCG GATCGGCACG CGAGGCAGCC CGCTGGCGCT TGCCCAGGCG 
CATGAGGCCC GCGACAGGCT GATGGCGGCG CATCATCTGC CCGAGGACAT GTTCGAGATC
GTCGTGCTGA CGACCAAGGG CGACCGCATC ACCGACCGGT CGCTGGCCGA GATCGGCGGC
AAGGGGCTGT TCACCGAAGA GCTCGAACAG AAGCTTGCCG CCGGCGAGCT CGATTTCGCC
GTGCATTCCG CCAAGGATAT GGCGACGAAG CTGCCCGAGG GGCTTTATCT CTCTGCCTAT
CTGCCCCGCG AGGATATCCG CGACGCCGTC ATCGGCCGCA CCGCGCGCAA ACTAATCGAC
CTGCCGCATG GCGCCACCGT CGGTTCCTCC TCGCTCCGCC GCCAGGCGCT GATCCGCCGC
ATGCGGCCGG ATATCAATGT CATCACCTTC CGCGGCCTGG TCGAAACCCG CCTGCGCAAG
CTCGAACAGG GCGAGGTGGA TGCGACCCTG CTGGCGCTTG CCGGCCTGAA ACGGCTCGGC
AAGGTCGACG TGCTGACCGA TATCCTCGAT CCCGACACCT TCCCGCCGGC CCCGGCGCAG
GGGGCGATCT GCATCGAAAG CCGCATCGGC GATGCCAGGG TCGACGATTT GCTGGCGCCG
GTTAACGATG GCCCGACTTT CGACACCGTC TCCTGCGAAC GCGCCTTCCT CGCCGCACTC
GACGGCTCCT GCCGCACGCC GATCGGCGGT TATGCCGTCT GCGAAGGCGA CCTGATCCGG
TTCTCCGGCC TCATCATCAC CCCCGACGGC CGCAGCCAGC ATGCGGTGAC GACTGACGGC
CACCGCCGCG ATGCGGCAGC GCTCGGCACC CGCGCCGGCC AGGACGTGCG CGCCAGGGCC
GGCAGCGCCT TTTTCGACGA CTGGCACTGA
 
Protein sequence
MQTKPFRIGT RGSPLALAQA HEARDRLMAA HHLPEDMFEI VVLTTKGDRI TDRSLAEIGG 
KGLFTEELEQ KLAAGELDFA VHSAKDMATK LPEGLYLSAY LPREDIRDAV IGRTARKLID
LPHGATVGSS SLRRQALIRR MRPDINVITF RGLVETRLRK LEQGEVDATL LALAGLKRLG
KVDVLTDILD PDTFPPAPAQ GAICIESRIG DARVDDLLAP VNDGPTFDTV SCERAFLAAL
DGSCRTPIGG YAVCEGDLIR FSGLIITPDG RSQHAVTTDG HRRDAAALGT RAGQDVRARA
GSAFFDDWH