Gene YpsIP31758_2357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2357 
SymbolhpaB 
ID5384902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2656388 
End bp2657950 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content49% 
IMG OID640865346 
Product4-hydroxyphenylacetate 3-monooxygenase, oxygenase component 
Protein accessionYP_001401326 
Protein GI153948091 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2368] Aromatic ring hydroxylase 
TIGRFAM ID[TIGR02310] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCAG AAAATTTCCG AGCGAATAAC CAACGTCCTT TTACCCATAA AGAATATCTG 
GACAGTTTAC AAGATGGCCG AGAGATTTAC ATTTACGGCG AGCGAGTGAA AAACGTTACC
ACTCATCCGG CTTTTCGCAA TGCTGCGGCG TCCATTGGCC AGCTATATGA TGCACTCCAT
GACCCTCAGT CCCATGAACA GCTCTGTTGG AATACTGATA CGGGTAATGG CGGCTATACA
CATAAATTTT TTCGCTATGC CACTAGCCCG GCAGAGTTAC GTCAGCAACG GGATGCTATT
GCCCACTGGT CACGTCAGAA TTACGGTTGG ATGGGGCGCT CACCGGATTA CAAGGCAGCG
TTTGGCAGCG TATTGGGCGC ATTTCCCGAG TTTTATGGTC AATTTGCGGA TAATGCCCGC
AATTGGTATC AGCGTATCCA GGAATCCGGC CTCTATTTTA ATCATGCCAT CGTCAACCCG
CCCCTTGATC GCCATAAGTC CGCAGATGAA GTGAAAGATG TGTATATCCA TATCGAAAAA
GAGACCGATG CCGGAATTAT TGTCAGTGGA GCCAAAGTGG TCGCCACTAA CTCGGCATTA
ACACATTACA ATTTTATCGG TTTTGGATCC GCTCAAGTGA TGGGGGAAGA CCCTGATTTT
GCGCTGATGT TTGTCGCACC AATGGATGCT GACGGTATGA AGTTGATTTC CCGCGGATCT
TACGAACTGA TGGTGGGCGC GACAGGCTCC CCGTTTGATT ATCCGCTCTC CAGTCGATTT
GATGAGAACG ATGCGATTTT AGTGATGGAT AACGTACTTA TCCCTTGGGA GAATATTTTA
ATCTACCGTG ATTTTGATCG CTGCCGTCGG TGGTCTACTC AGGGAGGATT CGCCCGGTTA
TTCCCACTTC AGGCTTGTAC GCGTCTCGGC GTGAAACTGG ACTTTATTAC CGCATTGCTG
AAAAAGAGTT TAATGTGCAC TGGCTCACTG GAGTTTCGTG GTGTCCAAGC GGATTTAGGT
GAAGTGGTCG CTTGGCGTAA CGTGATCTGG GCGTTAAGCG ATGCCATGTG CGCCGAGGCG
AAACCTTGGG TGAACGGCGC TTATCTACCC GATCTCGCAG CATTGCAAGC CTACCGCGTC
ATCGCACCGA TGGCATACTG CAAAATCAAG AATATCATTG AGCGTACGCT CGCCAGTGGT
CTGATTTATC TCCCCTCCAG TGTCCGTGAT CTACAAAATC CGGCTATCGA CAAATACCTG
TCGCGTTATG TGCGGGGTTC AAACGGTATC GACCATGTAG AACGCATTAA GGTCCTGAAA
TTGATGTGGG ATGCGATGGG CAGTGAGTTT GGCGGCCGCC ATGAATTATA TGAAATTAAC
TATGCGGGCA GCCAGGATGA GATCCGTCTG CAATGTTTGC GTCAGGCGCA GGGAGACGGA
TCTATGGACC AGATGATGGC GATGGTTGAT CGGTGTTTGA GTGATTATGA TACTCAGGGC
TGGACGGTTT CTCATCTGCA CAATAATAAC GATATTAATC AACTAAATAG GTTAATGAAA
TAA
 
Protein sequence
MKPENFRANN QRPFTHKEYL DSLQDGREIY IYGERVKNVT THPAFRNAAA SIGQLYDALH 
DPQSHEQLCW NTDTGNGGYT HKFFRYATSP AELRQQRDAI AHWSRQNYGW MGRSPDYKAA
FGSVLGAFPE FYGQFADNAR NWYQRIQESG LYFNHAIVNP PLDRHKSADE VKDVYIHIEK
ETDAGIIVSG AKVVATNSAL THYNFIGFGS AQVMGEDPDF ALMFVAPMDA DGMKLISRGS
YELMVGATGS PFDYPLSSRF DENDAILVMD NVLIPWENIL IYRDFDRCRR WSTQGGFARL
FPLQACTRLG VKLDFITALL KKSLMCTGSL EFRGVQADLG EVVAWRNVIW ALSDAMCAEA
KPWVNGAYLP DLAALQAYRV IAPMAYCKIK NIIERTLASG LIYLPSSVRD LQNPAIDKYL
SRYVRGSNGI DHVERIKVLK LMWDAMGSEF GGRHELYEIN YAGSQDEIRL QCLRQAQGDG
SMDQMMAMVD RCLSDYDTQG WTVSHLHNNN DINQLNRLMK