Gene YPK_1407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1407 
Symbol 
ID6088640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1550617 
End bp1552149 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content51% 
IMG OID641596471 
Productputative sialic acid transporter 
Protein accessionYP_001720154 
Protein GI170023649 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00891] putative sialic acid transporter 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTT CAGTAGGTCC ATCTCGTGAA GATAAACCAT TATCGGGTGG CGCTAAACCA 
CCCCGTTGGT ACAAACAACT TACCCCGGCG CAATGGAAGG CCTTTGTTGC CGCTTGGATC
GGTTATGCCC TGGATGGCTT TGACTTTGTT CTGATTACTC TGGTTCTGAC CGATATTAAA
CAAGAATTTG GCCTGACACT GATTCAGGCG ACCAGCCTGA TTTCTGCTGC CTTCATCTCA
CGCTGGTTTG GTGGGTTGGT ACTGGGTGCG ATGGGGGATC GCTATGGCCG TAAACTGGCC
ATGATCATCA GTATTGTGTT GTTCTCCTTC GGTACGTTGG CCTGTGGCTT AGCACCTGGC
TACACCACGC TGTTTATTGC TCGCTTGATT ATCGGTATTG GCATGGCGGG TGAGTATGGT
TCCAGCTCGA CCTATGTGAT GGAAAGCTGG CCTAAAAACA TGCGTAATAA AGCCAGTGGC
TTCCTGATTT CTGGCTTCTC TATCGGTGCG GTACTCGCGG CGCAAGCCTA CAGCTACGTG
GTGCCCGCAT TTGGTTGGCG TATGTTGTTC TACATTGGAT TATTGCCAAT TATCTTTGCA
CTGTGGTTGC GTAAAAACCT ACCGGAAGCA GAGGACTGGG AAAAGGCACA AAGTAAGCAG
AAAAAAGGTA AACAGGTCAC TGACCGGAAT ATGGTGGATA TTCTGTATCG CAGTCACCTC
AGTTATCTGA ATATTGGCCT GACGATATTT GCCGCTGTCT CACTTTACCT CTGTTTTACT
GGCATGGTCT CGACGTTGCT GGTAGTGGTT CTCGGTATTC TTTGCGCTGC AATATTTATC
TATTTTATGG TTCAAACCAG TGGCGATCGC TGGCCTACGG GCGTCATGCT GATGGTCGTG
GTGTTCTGTG CGTTCCTCTA CTCTTGGCCG ATCCAGGCGT TGTTACCGAC CTACCTGAAA
ATGGATCTCG GCTATGACCC ACACACCGTA GGCAATATAT TGTTCTTCAG TGGTTTTGGT
GCGGCCGTGG GTTGTTGTGT TGGCGGTTTC CTTGGCGATT GGTTGGGTAC CCGCAAAGCC
TATGTGACCA GTTTGCTGAT ATCACAGCTC TTGATCATCC CGCTGTTTGC CATCCAAGGC
AGCAGTATTT TGTTCTTAGG GGGATTACTG TTCTTACAAC AGATGCTGGG GCAGGGGATT
GCGGGCCTGT TGCCGAAACT GCTGGGCGGT TATTTTGATA CCGAACAGCG AGCCGCAGGA
CTGGGCTTTA CCTACAACGT CGGCGCATTG GGAGGGGCAT TGGCCCCCAT ACTGGGGGCA
TCGATTGCTC AACATCTCAG TTTAGGCACC GCGTTGGGAT CGCTCTCTTT CAGTCTGACA
TTCGTGGTGA TCCTACTGAT TGGTTTTGAT ATGCCATCCC GTGTACAGCG TTGGGTCCGC
CCATCAGGTT TACGGATGGT GGATGCCATC GATGGCAAAC CATTCAGTGG TGCCATTACG
GCCCAGCATG CGCGAGTAGT GACACAGAAA TAA
 
Protein sequence
MSISVGPSRE DKPLSGGAKP PRWYKQLTPA QWKAFVAAWI GYALDGFDFV LITLVLTDIK 
QEFGLTLIQA TSLISAAFIS RWFGGLVLGA MGDRYGRKLA MIISIVLFSF GTLACGLAPG
YTTLFIARLI IGIGMAGEYG SSSTYVMESW PKNMRNKASG FLISGFSIGA VLAAQAYSYV
VPAFGWRMLF YIGLLPIIFA LWLRKNLPEA EDWEKAQSKQ KKGKQVTDRN MVDILYRSHL
SYLNIGLTIF AAVSLYLCFT GMVSTLLVVV LGILCAAIFI YFMVQTSGDR WPTGVMLMVV
VFCAFLYSWP IQALLPTYLK MDLGYDPHTV GNILFFSGFG AAVGCCVGGF LGDWLGTRKA
YVTSLLISQL LIIPLFAIQG SSILFLGGLL FLQQMLGQGI AGLLPKLLGG YFDTEQRAAG
LGFTYNVGAL GGALAPILGA SIAQHLSLGT ALGSLSFSLT FVVILLIGFD MPSRVQRWVR
PSGLRMVDAI DGKPFSGAIT AQHARVVTQK