Gene HS_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0474 
SymbolubiD 
ID4239956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp504994 
End bp506469 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content39% 
IMG OID638104022 
Product3-octaprenyl-4hydroxybenzoate decarboxylase 
Protein accessionYP_718685 
Protein GI113460619 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATA AAAATCTACG AGATTTTCTT GAATTATTAG AGAAACAAGG TGAGCTCAAA 
AGAATTACAC AGGAAATCGA CCCTTATTTA GAAATGACAG AAATTGCTGA CCGCACTTTG
CGTGCCGGTG GTCCCGCATT ACTTTTTGAA AATCCAAAAG GCTATGAAAT TCCTGTGCTT
TGTAATTTAT TTGGTACTCC TAAACGTGTT GCTCTTGGAA TGGGGCAGGA AGATGTTACC
GCACTGCGTG ATGTAGGGAG ATTACTGGCT TTTCTAAAAG AACCTGAACA ACCAAAAAGT
TTTAAAGATT TATGGTCAAC TCTTCCTCAA TTTAAACAAG TGCTAAATAT GCCAACGAAA
GTTTTGAGTA AAGCAGAGTG TCAGCAAATT GTATTCTCTG ATGCTGAAGT TGATTTATAT
AAATTACCTA TTATGCACTG TTGGAAAGAT GATGTTGCAC CTTTAGTTAC ATGGGGATTA
ACCATCACTA AAGGACCAAG TAAAAAAAGA CAAAATTTAG GTATTTATCG CCAACAATTA
ATAGGAAAAA ATAAACTCAT TATGCGTTGG CTATCTCACC GTGGCGGTGC GTTGGATTTT
CAAGAATGGA AAGAAGCACG CCCTAATCAA CCCTTTCCTA TTTCAGTTGC TTTAGGGGCA
GATCCTGCCA CTATTCTAGG TGCGGTCACA CCAGTTCCGG ATACCTTATC GGAATATGCT
TTTGCCGGAT TATTACGTGG TAATAAAACG GAAGTGGTAA AATCAATCAG TAATGATCTT
GAAATACCTG CAAGTGCGGA GATTATTTTG GAAGGTTATA TTGATCCAAC GGAGACCGCA
CTTGAAGGTC CATACGGAGA TCATACGGGT TATTACAATG AACAAGAATA TTTTCCTGTA
TTTACCGTGA CACATCTTAC CATGCGTAAA GATCCGATTT ATCATTCAAC TTACACAGGT
CGTCCACCGG ATGAGCCTGC AGTTTTGGGT GAAGCACTGA ACGAGGTTTT TATTCCTATT
TTGCAAAAGC AGTTTCCGGA AATTGTCGAT TTCTATCTTC CTCCGGAAGG ATGCTCTTAC
CGTCTTGCAG TTGTTACAAT AAAAAAACAA TATGCAGGCC ACGCTAAGAG AGTCATGATG
GGAGTATGGT CATTTTTACG CCAGTTTATG TACACAAAAT TTGTGATTGT CTGTGATGAC
GATATAAATG CACGAGATTG GAAAGATGTG ATTTGGGCAA TTACAACACG TAGCGATCCC
GCCAGAGATT GTACAATTAT AGAAAATACG CCTATTGATT ATCTTGATTT TGCCTCACCG
ATTGCTGGTC TCGGCTCAAA AATGGGAATA GATGCGACAA ACAAATGGAT TGGAGAAACG
CAACGTGAAT GGGGAACCCC AATTAAAAAA GCCCCTAATG TAGTTAAACG CATTGATGAT
ATTTGGGAGA GTCTAAATAT TTTTGCTCCC AAATAA
 
Protein sequence
MKYKNLRDFL ELLEKQGELK RITQEIDPYL EMTEIADRTL RAGGPALLFE NPKGYEIPVL 
CNLFGTPKRV ALGMGQEDVT ALRDVGRLLA FLKEPEQPKS FKDLWSTLPQ FKQVLNMPTK
VLSKAECQQI VFSDAEVDLY KLPIMHCWKD DVAPLVTWGL TITKGPSKKR QNLGIYRQQL
IGKNKLIMRW LSHRGGALDF QEWKEARPNQ PFPISVALGA DPATILGAVT PVPDTLSEYA
FAGLLRGNKT EVVKSISNDL EIPASAEIIL EGYIDPTETA LEGPYGDHTG YYNEQEYFPV
FTVTHLTMRK DPIYHSTYTG RPPDEPAVLG EALNEVFIPI LQKQFPEIVD FYLPPEGCSY
RLAVVTIKKQ YAGHAKRVMM GVWSFLRQFM YTKFVIVCDD DINARDWKDV IWAITTRSDP
ARDCTIIENT PIDYLDFASP IAGLGSKMGI DATNKWIGET QREWGTPIKK APNVVKRIDD
IWESLNIFAP K