Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4020 |
Symbol | hemC |
ID | 8014826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4097007 |
End bp | 4097936 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644826589 |
Product | porphobilinogen deaminase |
Protein accession | YP_002977800 |
Protein GI | 241206704 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.780867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACAA AACCTTTCCG GATCGGCACG CGAGGCAGCC CGCTGGCGCT TGCCCAGGCG CATGAGGCCC GCGACAGGCT GATGGCGGCG CATCATCTGC CCGAGGACAT GTTCGAGATC GTCGTGCTGA CGACCAAGGG CGACCGCATC ACCGACCGGT CGCTGGCCGA GATCGGCGGC AAGGGGCTGT TCACCGAAGA GCTCGAACAG AAGCTTGCCG CCGGCGAGCT CGATTTCGCC GTGCATTCCG CCAAGGATAT GGCGACGAAG CTGCCCGAGG GGCTTTATCT CTCTGCCTAT CTGCCCCGCG AGGATATCCG CGACGCCGTC ATCGGCCGCA CCGCGCGCAA ACTAATCGAC CTGCCGCATG GCGCCACCGT CGGTTCCTCC TCGCTCCGCC GCCAGGCGCT GATCCGCCGC ATGCGGCCGG ATATCAATGT CATCACCTTC CGCGGCCTGG TCGAAACCCG CCTGCGCAAG CTCGAACAGG GCGAGGTGGA TGCGACCCTG CTGGCGCTTG CCGGCCTGAA ACGGCTCGGC AAGGTCGACG TGCTGACCGA TATCCTCGAT CCCGACACCT TCCCGCCGGC CCCGGCGCAG GGGGCGATCT GCATCGAAAG CCGCATCGGC GATGCCAGGG TCGACGATTT GCTGGCGCCG GTTAACGATG GCCCGACTTT CGACACCGTC TCCTGCGAAC GCGCCTTCCT CGCCGCACTC GACGGCTCCT GCCGCACGCC GATCGGCGGT TATGCCGTCT GCGAAGGCGA CCTGATCCGG TTCTCCGGCC TCATCATCAC CCCCGACGGC CGCAGCCAGC ATGCGGTGAC GACTGACGGC CACCGCCGCG ATGCGGCAGC GCTCGGCACC CGCGCCGGCC AGGACGTGCG CGCCAGGGCC GGCAGCGCCT TTTTCGACGA CTGGCACTGA
|
Protein sequence | MQTKPFRIGT RGSPLALAQA HEARDRLMAA HHLPEDMFEI VVLTTKGDRI TDRSLAEIGG KGLFTEELEQ KLAAGELDFA VHSAKDMATK LPEGLYLSAY LPREDIRDAV IGRTARKLID LPHGATVGSS SLRRQALIRR MRPDINVITF RGLVETRLRK LEQGEVDATL LALAGLKRLG KVDVLTDILD PDTFPPAPAQ GAICIESRIG DARVDDLLAP VNDGPTFDTV SCERAFLAAL DGSCRTPIGG YAVCEGDLIR FSGLIITPDG RSQHAVTTDG HRRDAAALGT RAGQDVRARA GSAFFDDWH
|
| |