Gene YpsIP31758_2358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2358 
SymbolhpaX 
ID5385889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2658018 
End bp2659385 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content49% 
IMG OID640865347 
Product4-hydroxyphenylacetate permease 
Protein accessionYP_001401327 
Protein GI153949197 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR02332] 4-hydroxyphenylacetate permease 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.876442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATT CACTGTCTAC CGCACCGGGC GTTACCCCGC CGGTCAATAA AAACACACCA 
TTAAGTGCGC AGCAGCAGTC CGTCATTAAT AAACTGTTTC GCCGTCTGAT CCCGTTCTTA
TTCTTGTTGT TTGTTTTTGC CTTCCTTGAT CGTATCAATA TTGGTTTTGC GGGTTTGACG
ATGGGGAAGG ATTTAGGGCT AAGTGCCACG ACCTTTGGTT TAGCGACTAC ATTATTCTAC
GTGACATATA TCTTATTCGC GATCCCCAGC AATATCATGT TGGGGATCGT GGGTGCCAGG
CGTTGGATTG CGACCATTAT GGTGCTCTGG GGTATCGCCT CTACCGCGAC CTTATTTGCC
GTCGGCCCCA ACAGTTTGTA TCTCTTACGA ATGATTGTTG GGATCACCGA AGCCGGTTTT
TTACCCGGCA TTCTGGTGTA TTTAACTTAT TGGTTTCCGG CCCACTTTCG AGCCAGAGCC
AATGCGCTGT TTATGGTGGC GATGCCAGTG ACTATGGCGC TCGGCTCTCT GGTTTCCGGC
TATATTTTGG CGCTTGATGG TTTTTTGAAT ATGCGTGGTT GGCAGTGGCT ATTTCTGCTG
GAAGGCTTTC CATCGGTCTT GCTGGGGGGG GTGGTCTGGT TCTATCTGGA CGATACCCCG
CAGAAAGCGC GCTGGTTAAC GAAAGAAGAT AAACAGTGTC TGCAAGAGAT GCTGGAGAGT
GACCGTTTGC AATTGGCGAA ACAGGCGGAT TATGGTGCCT TACCACAATC AGGGATGTGG
CGGGAAATTT TCACCCCCGT GGTGCTGATG TATACACTGG CTTACTTCTG TTTAACCAAT
ACCCTAAGTG CGGTGAATAT TTGGACGCCG CAGATCCTGC AAAGTTTTAA TCAGAGCAGC
AGCAATATCA CTATCGGTCT GCTGGCCGCT ATCCCACAAG TTTGTACCAT TGCCGGCATG
ATCTGGTGGA GTAGACGCTC GGATCGGGTT CAGGAACGCA AAATGCACAC GGTTTTGCCG
TATCTGTTTG CCGCAGCGGG GTGGGTGCTG GCATCGGCCA CACAAAATAG CGTAATCCAG
TTGTTAGGGA TCATTATGGC CTCGACCGGG TCATTTACGG CGATGGCGAT TTTCTGGACT
ACACCGGATC AATCCATCAG CCTGAGAGCC AGAGCTGTTG GTATTGCAGT GATAAATGCC
ACTGGAAATA TTGGTTCGGC TGTCAGCCCG GTTTTAATTG GTTGGTTGAA AGACCAGACC
GGTAACTTTA ATTCCGGGCT GTATTTCGTT GCCGGTTTAT TGGTTATCGG GGCTGTTATT
TTCTTGATGA TTCCAATGAA AAAGGCACCT CCAAAAGCCA TTTTCTAA
 
Protein sequence
MSDSLSTAPG VTPPVNKNTP LSAQQQSVIN KLFRRLIPFL FLLFVFAFLD RINIGFAGLT 
MGKDLGLSAT TFGLATTLFY VTYILFAIPS NIMLGIVGAR RWIATIMVLW GIASTATLFA
VGPNSLYLLR MIVGITEAGF LPGILVYLTY WFPAHFRARA NALFMVAMPV TMALGSLVSG
YILALDGFLN MRGWQWLFLL EGFPSVLLGG VVWFYLDDTP QKARWLTKED KQCLQEMLES
DRLQLAKQAD YGALPQSGMW REIFTPVVLM YTLAYFCLTN TLSAVNIWTP QILQSFNQSS
SNITIGLLAA IPQVCTIAGM IWWSRRSDRV QERKMHTVLP YLFAAAGWVL ASATQNSVIQ
LLGIIMASTG SFTAMAIFWT TPDQSISLRA RAVGIAVINA TGNIGSAVSP VLIGWLKDQT
GNFNSGLYFV AGLLVIGAVI FLMIPMKKAP PKAIF