Gene YPK_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0421 
Symbol 
ID6087528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp459914 
End bp461035 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content51% 
IMG OID641595483 
Productalkanesulfonate transporter substrate-binding subunit 
Protein accessionYP_001719177 
Protein GI170022672 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTAT CTTTCTTCTT TCGTCGTGGG TTCAACGTTC ACCGCTGGCT CAATATCGGG 
GCAATGGCCG CTATTATTAC GTTAGCGTTT ACTAATACAG TGATTGCACA GGATAGCGCC
CCGGCCCAGT TCCGCATTGG GTACCAAAAA GGCTCGGTGA ATCTGGTGTT GGCGAAAACT
CACCAATTGC TGGAAAAACG CTTTCCTGAT ACCCAAATTA GCTGGATTGA ATTCCCGGCA
GGCCCGCAAA TGCTCGAAGC GTTGAACGTC AACAGCATCG ATTTGGGCAG TACTGGCGAT
ATTCCCCCCA TTTTTGCTCA GGCTGCTGGC GCTGATCTGC TGTATGTCGG CATGGAGCCG
CCGAAACCGA AGGCAGAAGT GATTTTGGTG CCGGAAAACA GCGCGATTAA CAGCGTTGCT
GAACTGAAAG GCCATAAAGT GGCTTTCCAG AAAGGTTCCA GCTCACACAA CCTGCTCTTG
CAGGCACTGC AAAAAGCGGG GCTAAAATTT ACCGATATCC AACCTGTCTA CCTCACGCCT
GCTGATGCAC GAGCGGCTTT CCAACAGGGC AATGTTGATG CTTGGGTCAT TTGGGATCCC
TACTATTCCG CCGCCTTGTT GCAAGGGGGC ATACGGGTAC TGATAGATGG CAGCCAATTA
AACCAGACAG GCTCTTTCTA TCTGGCTTCT CGTCCTTATA CCGAAGCTAA CGGCCCATTT
ATTCAGCAGG TGCTGGAGGT ACTGACGCAG GCTGATGCGT TGACGCTTAG CGATCGAGCA
CAAAGTATCA CGCTACTGGC CAACGCGATG GGGCTACCAG AGGCCGTGAT TGCCAGCTAT
CTGGATCATC GTCCCCCCAC GGCTATCCAG CCTTTGAGTC AGGCGACGGT TGCTGCCCAA
CAAAGAACGG CCGATCTGTT TTTCGCCAAC CGGCTGTTAC CGGTCAAAGT GGATATTTCG
CAACGTGTTT GGCTGCCAGC CGGGCAATTA TCCTCAAAGC CACCATCTTC AAAGCCATCA
TCTTCAAACC AATCATCTCC AAGCCAATTA CCTACAGATC AACCGTCTAT AGCGCAGACA
TCTATAGAGC AATCATCTAC AGCAAAATCA CAGACCAAAT AA
 
Protein sequence
MSLSFFFRRG FNVHRWLNIG AMAAIITLAF TNTVIAQDSA PAQFRIGYQK GSVNLVLAKT 
HQLLEKRFPD TQISWIEFPA GPQMLEALNV NSIDLGSTGD IPPIFAQAAG ADLLYVGMEP
PKPKAEVILV PENSAINSVA ELKGHKVAFQ KGSSSHNLLL QALQKAGLKF TDIQPVYLTP
ADARAAFQQG NVDAWVIWDP YYSAALLQGG IRVLIDGSQL NQTGSFYLAS RPYTEANGPF
IQQVLEVLTQ ADALTLSDRA QSITLLANAM GLPEAVIASY LDHRPPTAIQ PLSQATVAAQ
QRTADLFFAN RLLPVKVDIS QRVWLPAGQL SSKPPSSKPS SSNQSSPSQL PTDQPSIAQT
SIEQSSTAKS QTK