Gene YpAngola_A1641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1641 
SymbolhpaX 
ID5800112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1695297 
End bp1696664 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content49% 
IMG OID641339587 
Product4-hydroxyphenylacetate permease 
Protein accessionYP_001606144 
Protein GI162420137 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR02332] 4-hydroxyphenylacetate permease 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATT CACTGTCTAC CGCACCGGGC GTTACCCCGC CGGTCAATAA AAACACACCA 
TTAAGTGCGC AGCAGCAGTC CGTCATTAAT AAACTGTTTC GCCGTCTGAT CCCGTTCTTA
TTCTTGTTGT TTGTTTTTGC CTTCCTTGAT CGTATCAATA TAGGTTTTGC GGGTTTGACG
ATGGGGAAGG ATTTAGGGCT AAGTGCCACA ACCTTTGGTT TAGCGACTAC ATTATTCTAC
GTGACATATA TCTTATTCGC GATCCCCAGC AATATCATGT TGGGGATCGT GGGTGCCAGG
CGTTGGATTG CGACCATTAT GGTGCTCTGG GGGATCGCTT CTACCGCGAC CTTATTTGCC
GTCGGCCCCA ACAGTTTGTA TCTCTTACGA ATGATTGTTG GGATCACCGA AGCCGGTTTT
TTACCCGGCA TTCTGGTGTA TTTAACTTAT TGGTTTCCGG CCCACTTTCG AGCCAGAGCC
AATGCGCTGT TTATGGTGGC GATGCCAGTG ACTATGGCGC TCGGCTCTCT GGTTTCCGGC
TATATTTTGG CGCTTGATGG TTTTTTGAAT ATGCGTGGTT GGCAGTGGCT ATTTCTGCTG
GAAGGCTTTC CATCGGTCTT GCTGGGGGGG GTGGTCTGGT TCTATCTGGA CGATACCCCG
CAGAAAGCGC GCTGGTTAAC GAAAGAAGAT AAACAGTGTC TGCAAGAGAT GCTGGAGAGT
GACCGTTTGC AATTGGCGAA ACAGGCGGAT TATGGTGCCT TACCACAATC CGCGATGTGG
CGGGAAATTT TCACCCCCGT GGTGCTGATG TATACACTGG CTTACTTCTG TTTAACCAAT
ACCCTAAGTG CGGTGAATAT TTGGACGCCG CAGATCCTGC AAAGTTTTAA TCAGAGCAGC
AGCAATATCA CTATCGGTCT GCTGGCCGCT ATCCCACAAG TTTGTACCAT TGCCGGCATG
ATCTGGTGGA GTAGACGCTC GGATCGGGTT CAGGAACGCA AAATGCACAC GGTTTTGCCG
TATCTGTTTG CCGCAGCGGG GTGGGTGCTG GCATCGGCCA CACAAAATAG CGTAATCCAG
TTGTTAGGGA TCATTATGGC CTCGACCGGG TCATTTACGG CGATGGCGAT TTTCTGGACT
ACACCGGATC AATCCATCAG CCTGAGAGCC AGAGCTGTTG GTATTGCAGT GATAAATGCC
ACTGGAAATA TTGGTTCGGC TGTCAGCCCG GTTTTAATTG GTTGGTTGAA AGACCAGACC
GGTAACTTTA ATTCCGGGCT GTATTTCGTT GCCGGTTTAT TGGTTATCGG GGCTGTTATT
TTCTTGATGA TTCCAATGAA AAAGGCACCT CCAAAAGCCA TTTTCTAA
 
Protein sequence
MSDSLSTAPG VTPPVNKNTP LSAQQQSVIN KLFRRLIPFL FLLFVFAFLD RINIGFAGLT 
MGKDLGLSAT TFGLATTLFY VTYILFAIPS NIMLGIVGAR RWIATIMVLW GIASTATLFA
VGPNSLYLLR MIVGITEAGF LPGILVYLTY WFPAHFRARA NALFMVAMPV TMALGSLVSG
YILALDGFLN MRGWQWLFLL EGFPSVLLGG VVWFYLDDTP QKARWLTKED KQCLQEMLES
DRLQLAKQAD YGALPQSAMW REIFTPVVLM YTLAYFCLTN TLSAVNIWTP QILQSFNQSS
SNITIGLLAA IPQVCTIAGM IWWSRRSDRV QERKMHTVLP YLFAAAGWVL ASATQNSVIQ
LLGIIMASTG SFTAMAIFWT TPDQSISLRA RAVGIAVINA TGNIGSAVSP VLIGWLKDQT
GNFNSGLYFV AGLLVIGAVI FLMIPMKKAP PKAIF