Gene Hore_14820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_14820 
Symbol 
ID7313073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp1577169 
End bp1578614 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content46% 
IMG OID643611923 
ProductUbiD family decarboxylase 
Protein accessionYP_002509226 
Protein GI220932318 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTACC GTGATTTAAG TGACTTTATT AAGACCCTTG AAAGGGAAAA TGAACTGGTG 
AGGGTCGGGG TTGAAGTAGA CCCGGTCCTG GAAATAACAG AAATAACCGA CCGTGTTGTT
AAAAATGGGG GGCCGGCCCT TTTATTTGAA AATGTTAAGG GTTCTGATTT TCCGGTTTTA
ATTAATGCCT TTGGTTCTAT CAGGAGAATG GAGCTGGCTC TGGAGGCAGA AAGCCTTGAT
GATATAGGGG ACAGGATAAA TCAGTATTTA CAGCTGCCGA CCCCGGAAAA TTTTCTGGAT
AAGTTAAAAA TGCTCCCCCT GTTGAAAGAA CTGGGGGATT TTTTACCGCG TACAGTAAAG
AAGGCTCCCT GCCAGGAAGT GGTTGAGGAG AATGTTGACC TGGGAAGTCT TCCCATATTG
AAGTGCTGGC CCGGTGACGG GGGTCGTTAT ATTACCCTTC CCCTGGTTTT TACAAAAGAC
CCGGGGACAG GTCGCCAGAA TGTGGGGATG TACCGCCTTC AGGTTTTTGA TAAAAAAACG
ACCGGTATGC ACTGGCATAT ACATAAAGAT GGTGCCGAGA ATTATCGGAA ACATCTTGGC
AACAGGGAAA AAATGGAGGT TGCTGTAGCC ATCGGGGCCG ATCCCGCTAC TATATATGCG
GCCACCGCTC CCCTGCCAGC CGGGATAGAT GAGATGCTTT TTGCCGGGTT TTTACGTAAG
GAGCCGGTTG AGATGGTAAA ATGCCGGACT GTGGACCTTA AAGTCCCGGC CAATGCTGAA
ATAATTCTCG AGGGTTATGT TAAACCGGGG GAGTTAAGGA AAGAAGGACC CTTTGGTGAC
CATACGGGTT ATTATTCCCT GGTTGATGAG TATCCTGTCT TTCATGTCCA GGCCATTACC
AGGCGGAATC AGCCCGTTTA TAATGCAACA GTAGTTGGCA AACCTCCCAT GGAAGATTGT
TTTATGGCTA AAGCAACGGA GAGGATCTTC TTACCACTCC TCAAGATGCA ACTTCCGGAA
ATTGTAGATA TGAATTTACC CCTCGAGGGG GTATTCCATA ACTGTGCTAT AATTTCTATA
AAAAAATCCT ACCCCGGCCA TGCTAAAAAA GTAATGCACG CCCTGTGGGG GCTGGGTCAG
ATGATGTATA CTAAAATGAT TATTGTAGTT GATGAAGATG TCGATGTTCA GGACCTGTCA
ACAGTGGCCT GGAAGGTGTT TAATAATATA GATGCCAGAC GGGATGTAGT TATTGTCGAT
GGTCCGCTTG ATGCCCTGGA CCACTCTTCC CCTGTCCGCC ATTACGGCTC CAAGATGGGG
ATTGATGCTA CTAAAGCCTG GCCGGAAGAG GGTCATACAA GGGAGTGGCC CCCTGAAATA
GAGATGAATA AAAAGATAAA AAAGCTGGTC GATAAGAGGT GGCAGGAGTA TGGAATTAAT
ATATAA
 
Protein sequence
MAYRDLSDFI KTLERENELV RVGVEVDPVL EITEITDRVV KNGGPALLFE NVKGSDFPVL 
INAFGSIRRM ELALEAESLD DIGDRINQYL QLPTPENFLD KLKMLPLLKE LGDFLPRTVK
KAPCQEVVEE NVDLGSLPIL KCWPGDGGRY ITLPLVFTKD PGTGRQNVGM YRLQVFDKKT
TGMHWHIHKD GAENYRKHLG NREKMEVAVA IGADPATIYA ATAPLPAGID EMLFAGFLRK
EPVEMVKCRT VDLKVPANAE IILEGYVKPG ELRKEGPFGD HTGYYSLVDE YPVFHVQAIT
RRNQPVYNAT VVGKPPMEDC FMAKATERIF LPLLKMQLPE IVDMNLPLEG VFHNCAIISI
KKSYPGHAKK VMHALWGLGQ MMYTKMIIVV DEDVDVQDLS TVAWKVFNNI DARRDVVIVD
GPLDALDHSS PVRHYGSKMG IDATKAWPEE GHTREWPPEI EMNKKIKKLV DKRWQEYGIN
I