Gene Msed_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1697 
Symbol 
ID5105343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1635708 
End bp1637162 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content48% 
IMG OID640507591 
Product3-octaprenyl-4hydroxybenzoate decarboxylase 
Protein accessionYP_001191776 
Protein GI146304460 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.373117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTCA AGGATCTTCG AGACTATCTA GAGAATATGC GTTCCAAGGG CAAACTTATC 
GAAGTGGAGC ACGAGGTTGA TGTGAACCTG GAAATAGCCG AGCTAGGGAG AAGGGCCACA
TATTCTCATC TCCCACCGCT ACTCTTCACC AGGGTTAAGG GTTATCCTGG CTGGAAAGTG
CTCACTAACG TTTACTATTC AATGGAGGGA ATCTATGAGT TGTTTGGGAC CAAGAACCTT
GAGGGGATAG CGGACAACTT TCTAGCGAGT TTCGGGGAGG TACCAATCAC AATTCTCGAT
AAAGCGAAAT CTCTCCCATC ACTTCTGAAG CTGGGAAAGT ACATGCCGAA GGGTAAGAAG
GCCTTGTTCA AAGAGGACAA GAGCCTGAAC CTGGAGGCAT TGCCTGCAAC AAAGACTTGG
CCAAAGGATG CAGGCCGATA CATGACGTTT TCTATTGTAA TAACGAAGGA TCCAGAAAAA
AACACAAATA ACTTAAGTAT ATATAGAGTA CAAATATTGA ATGAAAGGGA AGCGTTAGTG
CACTGGCAAG CGTTCAAGAG AGGATCCCTG GCAGCCTCCA AGTACAGGGA TCTGGGCTTG
TCTAAGGTAC CGGTGGCGAT AGTCAATGGA GTGGACCCTA TCATAACTTT CACTGCTGCG
TCTCCTGTGC CTCCTGGACT GGACAAGTAT CTGTTTGCCG GGATACTGAG GGATGAGGGC
GTGGACGTGG TGGAGTTGGA CAACGGAATA CTGGTTCCAG CCCACGCCGA GTCAGTACTT
GAGGGTTACG TAGATTTGAA CGACATGAGG CCTGAAGGAC CCTTTGGGGA TCACCTAGGA
TACTACACTC CTCAGGATTA CTACCCAGTG TTTAAGCTCG AGAAAGCCTA CGTGAGGGAG
AACCCCATCT ACCATGTAAC TTCCGTGGGA AAACCACCGC TCGAGGACAC TTGGATAGGG
AAGGGTGTTG AGAGGATATT CCTGCCTTTC CTGAAGATGA TAATTCCAGA CCTAGTTGAC
ATGAACCTGC CAGAGTATGG TCTTTTCACT AGTATAGGAA TATTTTCAAT ACGGAAAAGG
TATCCAGGAC AGGCCAGGAG GGCCATGATG TCAATTTGGG GTACTGGGCA ACTTAGCTTC
CTGAAGCTGG TGATCATTGT TGATAGCGAC GTTAACATTC ATGATATGAA CCAGGTTCTT
TATGCCATTG CCAGTACGGT GGATCCTCAA CGCGACGTTA TGATAGTTAC CAATGCCCTA
AATGACAGTT TAGATCATAC AGTTCCTAAT CCCCCTCTTG GGAGCAAAAT GGGGATAGAC
GCAACTAGAA AGTTCAAGGA GGAGTTAGGG AAGGATTGGC CAGAGCCCGT AGAAAGTGAT
CCAGACGTGG TTAAGAGGAT ATCCGAGGTC TGGGAGAAGA TAGCCTCTAA ATGGCCAAGT
CCTCCCTCAA GGTAG
 
Protein sequence
MGFKDLRDYL ENMRSKGKLI EVEHEVDVNL EIAELGRRAT YSHLPPLLFT RVKGYPGWKV 
LTNVYYSMEG IYELFGTKNL EGIADNFLAS FGEVPITILD KAKSLPSLLK LGKYMPKGKK
ALFKEDKSLN LEALPATKTW PKDAGRYMTF SIVITKDPEK NTNNLSIYRV QILNEREALV
HWQAFKRGSL AASKYRDLGL SKVPVAIVNG VDPIITFTAA SPVPPGLDKY LFAGILRDEG
VDVVELDNGI LVPAHAESVL EGYVDLNDMR PEGPFGDHLG YYTPQDYYPV FKLEKAYVRE
NPIYHVTSVG KPPLEDTWIG KGVERIFLPF LKMIIPDLVD MNLPEYGLFT SIGIFSIRKR
YPGQARRAMM SIWGTGQLSF LKLVIIVDSD VNIHDMNQVL YAIASTVDPQ RDVMIVTNAL
NDSLDHTVPN PPLGSKMGID ATRKFKEELG KDWPEPVESD PDVVKRISEV WEKIASKWPS
PPSR