Gene YpsIP31758_1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1871 
Symbol 
ID5385841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2167277 
End bp2168545 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content47% 
IMG OID640864855 
Productmajor facilitator transporter 
Protein accessionYP_001400846 
Protein GI153949397 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.00512651 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGCCAG CATCAACAAA AAATGATGAT GACACTGTTT TCACTACGAT CCCTAAACTG 
CCTCCCCTAA ATAAGCGTCC TTATATACAA CAAGGTACGC CAGAATTCAT CCGCGTTGCT
CTGGCTTTAT TTTCGGCGGG GTTAGCAACT TTCGCCCTGC TTTACTGCGT ACAACCTATT
TTGCCCATGT TGTCGCAGGA CTTTAGTACC TCTCCAGCGT CCAGCAGTTT GTCACTTTCG
ATAGCCACCG GGATGCTGGC GTTGGGTTTG ATGTTTACCG GCCCCCTTTC TGATGCGATA
GGCCGAAAAT CAGTCATGGT CGTGGCGTTG CTATTGGCCG CGGTTTGCAC AATAGTTTGC
TCTTTTATGA CCAGTTGGCA TGGGATTTTG CTAATGCGCG CATTGACCGG TCTATCCTTA
AGCGGAGTTG CCGCGGTGGC AATGACCTAT TTGAGTGAAG AGATCCATCC TAATTTTATT
GCGCTATCAA TGGGGTTGTA TATCAGCGGT AGTTCTATTG GTGGCATGAG TGGGCGTTTG
GTGGCTGGGG TATTAAGCGA TCTCTTTTCC TGGCGCGTAT CACTGCTAGT ACTCGGATTA
TTTGCTTTAG CTGCTGCTTG CTTGTTTTGG TTTATCCTCC CAGCGTCTAA ACACTTTCGT
GCAAGTTCAT TGCGCCCCAG AACGTTGTTA ATCAATTTTA AACTGCACTG GCGTGACTCC
GGCTTACCCC TACTATTTGC TGAAGGTTTT CTCATTATGG GGGGGTTCGT CACCTTATTT
AATTATATCG GCTATCGGTT ACTGGATGGG CCTTATTATC TCAGCCCGAC CATCGTAGGG
CTATTATCCA TTGTTTATTT AACGGGTTCT TATAGTTCAC CTAAAGCGGG TTCACTCAGT
AATCGCTACG GGCGAGGCCT AATTTTATTG GCCTCTATCG GTATGATGTT GGTTGGTGTT
GTGATCACCA GTTTTCCGTC AGTGATCATG ATTTTTATTG GTATGATGTT CGTTGCAGCA
GGATTCTTTG CCGCTCACTC CGTTGTCAGT AGCTGGGTTG GTTGCAGAGC ACGTCGTGCT
AAGGCACAAG CTTCGTCACT TTATCTGTTT TGTTACTATG CCGGTTCCAG CGTAGCCGGT
ACGTTAGGTG GCGTTTTTTG GTTACATTTA GGCTGGACAG GTGTTGTTGT TTTTATTACC
GCCCTTTTAG TTATCGCGTT GTTTATCGCT CAGCGATTAC GAAAGTTGGT AGGAACAGCC
AAGCGTTGA
 
Protein sequence
MAPASTKNDD DTVFTTIPKL PPLNKRPYIQ QGTPEFIRVA LALFSAGLAT FALLYCVQPI 
LPMLSQDFST SPASSSLSLS IATGMLALGL MFTGPLSDAI GRKSVMVVAL LLAAVCTIVC
SFMTSWHGIL LMRALTGLSL SGVAAVAMTY LSEEIHPNFI ALSMGLYISG SSIGGMSGRL
VAGVLSDLFS WRVSLLVLGL FALAAACLFW FILPASKHFR ASSLRPRTLL INFKLHWRDS
GLPLLFAEGF LIMGGFVTLF NYIGYRLLDG PYYLSPTIVG LLSIVYLTGS YSSPKAGSLS
NRYGRGLILL ASIGMMLVGV VITSFPSVIM IFIGMMFVAA GFFAAHSVVS SWVGCRARRA
KAQASSLYLF CYYAGSSVAG TLGGVFWLHL GWTGVVVFIT ALLVIALFIA QRLRKLVGTA
KR