Gene YpsIP31758_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1298 
Symbol 
ID5386951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1517681 
End bp1519213 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content51% 
IMG OID640864274 
Productputative sialic acid transporter 
Protein accessionYP_001400277 
Protein GI153947886 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00891] putative sialic acid transporter 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTT CAGTAGGTCC ATCTCGTGAA GATAAACCAT TATCGGGTGG CGCTAAACCA 
CCCCGTTGGT ACAAACAACT TACCCCGGCG CAATGGAAGG CCTTTGTTGC CGCTTGGATC
GGTTACGCCC TGGATGGCTT TGACTTTGTT CTGATTACTC TGGTTCTGAC CGATATTAAA
CAAGAATTTG GCCTGACACT GATTCAGGCG ACCAGCCTGA TTTCTGCTGC CTTCATCTCA
CGCTGGTTTG GTGGGTTGGT ACTGGGCGCG ATGGGGGATC GCTATGGCCG TAAACTGGCC
ATGATCACCA GTATTGTGTT GTTCTCCTTC GGTACGTTGG CCTGTGGCTT AGCACCTGGC
TACACCACGC TGTTTATTGC TCGCTTGATT ATCGGTATTG GCATGGCGGG TGAGTATGGT
TCCAGCTCGA CCTATGTGAT GGAAAGCTGG CCTAAAAACA TGCGTAATAA AGCCAGTGGC
TTCCTGATTT CTGGCTTCTC TATCGGTGCG GTACTCGCGG CGCAAGCCTA CAGCTACGTG
GTGCCCGCAT TTGGTTGGCG TATGTTGTTC TACATTGGAT TATTGCCAAT TATCTTTGCA
CTGTGGTTGC GTAAAAATCT ACCGGAAGCA GAGGACTGGG AAAAGGCACA AAGTAAGCAG
AAAAAAGGTA AACAGGTCAC TGACCGGAAT ATGGTGGATA TTCTGTATCG CAGTCACCTC
AGTTATCTGA ATATTGGCCT GACGATATTT GCCGCCGTCT CACTTTACCT CTGTTTTACT
GGCATGGTCT CGACGTTGCT GGTGGTGGTT CTCGGTATTC TTTGCGCTGC AATATTTATC
TATTTTATGG TTCAAACCAG TGGCGATCGC TGGCCTACGG GCGTCATGCT GATGGTCGTG
GTGTTCTGTG CGTTCCTCTA CTCTTGGCCG ATCCAGGCGT TGTTACCGAC CTACCTGAAA
ATGGATCTCG GCTATGACCC ACACACCGTA GGCAATATAT TGTTCTTCAG TGGTTTTGGT
GCGGCCGTGG GTTGTTGTGT TGGCGGTTTC CTTGGCGATT GGTTGGGTAC CCGCAAAGCC
TATGTGACCA GTTTGCTGAT ATCACAGCTC TTGATCATCC CGCTGTTTGC CATCCAAGGC
AGCAGTATTT TGTTCCTGGG GGGATTACTG TTCTTACAAC AGATGCTGGG GCAGGGGATT
GCGGGCCTGT TGCCGAAACT GTTGGGCGGT TATTTTGATA CCGAACAGCG AGCCGCAGGA
CTGGGCTTTA CCTACAACGT CGGCGCATTG GGAGGGGCAT TGGCCCCCAT ACTGGGGGCA
TCGATTGCTC AACATCTCAG TTTAGGCACC GCGTTGGGAT CGCTCTCTTT CAGTCTGACA
TTCGTGGTGA TCCTACTGAT TGGTTTTGAT ATGCCATCCC GTGTACAGCG TTGGGTCCGC
CCATCAGGTT TACGGATGGT GGATGCCATC GATGGCAAAC CATTCAGTGG TGCCATTACG
GCCCAGCATG CGCGAGTAGT GACACAGAAA TAA
 
Protein sequence
MSISVGPSRE DKPLSGGAKP PRWYKQLTPA QWKAFVAAWI GYALDGFDFV LITLVLTDIK 
QEFGLTLIQA TSLISAAFIS RWFGGLVLGA MGDRYGRKLA MITSIVLFSF GTLACGLAPG
YTTLFIARLI IGIGMAGEYG SSSTYVMESW PKNMRNKASG FLISGFSIGA VLAAQAYSYV
VPAFGWRMLF YIGLLPIIFA LWLRKNLPEA EDWEKAQSKQ KKGKQVTDRN MVDILYRSHL
SYLNIGLTIF AAVSLYLCFT GMVSTLLVVV LGILCAAIFI YFMVQTSGDR WPTGVMLMVV
VFCAFLYSWP IQALLPTYLK MDLGYDPHTV GNILFFSGFG AAVGCCVGGF LGDWLGTRKA
YVTSLLISQL LIIPLFAIQG SSILFLGGLL FLQQMLGQGI AGLLPKLLGG YFDTEQRAAG
LGFTYNVGAL GGALAPILGA SIAQHLSLGT ALGSLSFSLT FVVILLIGFD MPSRVQRWVR
PSGLRMVDAI DGKPFSGAIT AQHARVVTQK