Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1697 |
Symbol | |
ID | 5105343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1635708 |
End bp | 1637162 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507591 |
Product | 3-octaprenyl-4hydroxybenzoate decarboxylase |
Protein accession | YP_001191776 |
Protein GI | 146304460 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.373117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTTCA AGGATCTTCG AGACTATCTA GAGAATATGC GTTCCAAGGG CAAACTTATC GAAGTGGAGC ACGAGGTTGA TGTGAACCTG GAAATAGCCG AGCTAGGGAG AAGGGCCACA TATTCTCATC TCCCACCGCT ACTCTTCACC AGGGTTAAGG GTTATCCTGG CTGGAAAGTG CTCACTAACG TTTACTATTC AATGGAGGGA ATCTATGAGT TGTTTGGGAC CAAGAACCTT GAGGGGATAG CGGACAACTT TCTAGCGAGT TTCGGGGAGG TACCAATCAC AATTCTCGAT AAAGCGAAAT CTCTCCCATC ACTTCTGAAG CTGGGAAAGT ACATGCCGAA GGGTAAGAAG GCCTTGTTCA AAGAGGACAA GAGCCTGAAC CTGGAGGCAT TGCCTGCAAC AAAGACTTGG CCAAAGGATG CAGGCCGATA CATGACGTTT TCTATTGTAA TAACGAAGGA TCCAGAAAAA AACACAAATA ACTTAAGTAT ATATAGAGTA CAAATATTGA ATGAAAGGGA AGCGTTAGTG CACTGGCAAG CGTTCAAGAG AGGATCCCTG GCAGCCTCCA AGTACAGGGA TCTGGGCTTG TCTAAGGTAC CGGTGGCGAT AGTCAATGGA GTGGACCCTA TCATAACTTT CACTGCTGCG TCTCCTGTGC CTCCTGGACT GGACAAGTAT CTGTTTGCCG GGATACTGAG GGATGAGGGC GTGGACGTGG TGGAGTTGGA CAACGGAATA CTGGTTCCAG CCCACGCCGA GTCAGTACTT GAGGGTTACG TAGATTTGAA CGACATGAGG CCTGAAGGAC CCTTTGGGGA TCACCTAGGA TACTACACTC CTCAGGATTA CTACCCAGTG TTTAAGCTCG AGAAAGCCTA CGTGAGGGAG AACCCCATCT ACCATGTAAC TTCCGTGGGA AAACCACCGC TCGAGGACAC TTGGATAGGG AAGGGTGTTG AGAGGATATT CCTGCCTTTC CTGAAGATGA TAATTCCAGA CCTAGTTGAC ATGAACCTGC CAGAGTATGG TCTTTTCACT AGTATAGGAA TATTTTCAAT ACGGAAAAGG TATCCAGGAC AGGCCAGGAG GGCCATGATG TCAATTTGGG GTACTGGGCA ACTTAGCTTC CTGAAGCTGG TGATCATTGT TGATAGCGAC GTTAACATTC ATGATATGAA CCAGGTTCTT TATGCCATTG CCAGTACGGT GGATCCTCAA CGCGACGTTA TGATAGTTAC CAATGCCCTA AATGACAGTT TAGATCATAC AGTTCCTAAT CCCCCTCTTG GGAGCAAAAT GGGGATAGAC GCAACTAGAA AGTTCAAGGA GGAGTTAGGG AAGGATTGGC CAGAGCCCGT AGAAAGTGAT CCAGACGTGG TTAAGAGGAT ATCCGAGGTC TGGGAGAAGA TAGCCTCTAA ATGGCCAAGT CCTCCCTCAA GGTAG
|
Protein sequence | MGFKDLRDYL ENMRSKGKLI EVEHEVDVNL EIAELGRRAT YSHLPPLLFT RVKGYPGWKV LTNVYYSMEG IYELFGTKNL EGIADNFLAS FGEVPITILD KAKSLPSLLK LGKYMPKGKK ALFKEDKSLN LEALPATKTW PKDAGRYMTF SIVITKDPEK NTNNLSIYRV QILNEREALV HWQAFKRGSL AASKYRDLGL SKVPVAIVNG VDPIITFTAA SPVPPGLDKY LFAGILRDEG VDVVELDNGI LVPAHAESVL EGYVDLNDMR PEGPFGDHLG YYTPQDYYPV FKLEKAYVRE NPIYHVTSVG KPPLEDTWIG KGVERIFLPF LKMIIPDLVD MNLPEYGLFT SIGIFSIRKR YPGQARRAMM SIWGTGQLSF LKLVIIVDSD VNIHDMNQVL YAIASTVDPQ RDVMIVTNAL NDSLDHTVPN PPLGSKMGID ATRKFKEELG KDWPEPVESD PDVVKRISEV WEKIASKWPS PPSR
|
| |