Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_1978 |
Symbol | |
ID | 4115731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 1999235 |
End bp | 2000152 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 638036764 |
Product | porphobilinogen deaminase |
Protein accession | YP_644737 |
Protein GI | 108804800 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000417837 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGGGC GGCTGGTGCT GGGCACCCGG GGCTCGCCGC TGGCCCTGGC TCAGGCGGAG GCCTGCGCCG CGGGGCTGCG GGCGGCGGGC TTTGCGGTGG AGCTGCGCCG GATAAGGACC ACCAGCGACC GGCGTCCCGA CGACCCCCTC TCGGTGATAG ACCAGCGGGA CGTGTTCACC CGCCAGCTGG ACGAGGCGCT GCTCGCCGGG GAGGTGGACC TCGCCGTCCA CTCCATGAAG GACGTGCCCA CCGAGGTGCC GGAGGGGATC GTGCTGGCCG CCGTCGCCGG CCGGGCCGAC CCCTCTGACG CGCTGGTCTC GGAGGGGGGC TGGGGCGTTG ACGGGCTGCC CGAAGGGGCG CGGGTCGCGA CCTCCAGCCT GCGCCGCCGG GCACAGCTTC TGCACCGCAG GCCGGACCTC AGGGTGGTCG AGATCCGGGG CAACGTGGAC ACCCGGATCC GCAAGATGCG CGCGGGGGCC GCGGAGGCGG TGGTGCTGGC CCGGGCCGGG CTCGTGCGGC TGGGGCTCGA GGTGCCCCAC GCCGTGATCC CGCACGACGT CCTGCTGCCC GCGGTCGGGC AGGGGGCGCT CGCCGTCGCG GTGCGCCGGG GGGATCCGCG TCTGGAGGAG ATCCGGCGGG CCCTCAACGA CCCGGCCGCC GAGCGCGAGG TCGCGGCCGA GCGGGCGCTG CTGCGGGCGC TCGAGGGGGG CTGCCGGGTC CCGGTGGGGG CGCGCGCCGT CGCCGGGGGG CGGGGGGTGC TCCTGCGGGG GGTGGTCGTC TCCCCCGACG GGGCGGCGCT GTGCGGGGGC GAGGAGCGCG GGGAGGAGCC GGAGGAGGTC GGGCGGCGGC TGGCCGCGAG GCTCCTCGAG CGCGGGGCCG CCGGTATACT GGGGTTCGTG CGGGGGGTGA AGCCGTGA
|
Protein sequence | MAGRLVLGTR GSPLALAQAE ACAAGLRAAG FAVELRRIRT TSDRRPDDPL SVIDQRDVFT RQLDEALLAG EVDLAVHSMK DVPTEVPEGI VLAAVAGRAD PSDALVSEGG WGVDGLPEGA RVATSSLRRR AQLLHRRPDL RVVEIRGNVD TRIRKMRAGA AEAVVLARAG LVRLGLEVPH AVIPHDVLLP AVGQGALAVA VRRGDPRLEE IRRALNDPAA EREVAAERAL LRALEGGCRV PVGARAVAGG RGVLLRGVVV SPDGAALCGG EERGEEPEEV GRRLAARLLE RGAAGILGFV RGVKP
|
| |