Gene YPK_3226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_3226 
Symbol 
ID6089541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp3526960 
End bp3528663 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content49% 
IMG OID641598303 
Productextracellular solute-binding protein 
Protein accessionYP_001721948 
Protein GI170025443 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATTA TTCACCGGCT TACCCAATAT GAGCGTTTAT ATCAAAAATT TGGTGACCAC 
CCGGTGGCGA CTACCGTGGC TGACGTGGCT AGTTTACTCT TTTGTAGCGA ACGGCATGCC
CGCACCCTGA TTCAGCAACT ACAGATGAAG AGTTGGCTAA GCTGGCATTC ACAAGTGGGT
AGAGGGAAAC GAGCGCAACT GCAATGCCTG AAAAAACCTG ATGCATTACG GGCTATTTAC
CTGCAACAGT TCCTTGAGCA AGGCGATCAT CAGGCGGCAT TCTCAATAGC ACAATTGGAA
CCCGAGCGCC TACAGACCTT ACTTACCCCC CATATGGGCG GACAATGGCA AGCCGATAGC
CCTATTCTGC GTATCCCCTA CTACCGCGAG CTGGAACCGC TTAATCCCAT GAATGCTTCA
GGGCGGGCAG AACAGCACCT GATTTATACT TTGCATGCTG GGCTGACACG GTTTAATACA
GGTGACCCGT TGCCTAAACC TGATTTGGCT CATCACTGGC AAATCAGCGC AGATGGCTTA
ACCTGGCAGT TCTTCTTACG CAGCCAACTA CGTTGGCATA ATGGCGACCA CATTCATGGT
AAGCAATTAT TGCAAACACT GGAGATTCTG CGCGCAAACC GACGTAGCCA CCCCAGTTTT
GCTAATATTG TTACTATCAC TCTCCCCCAC GCTTTATGCC TACAATTTAC CCTTTCCCAA
CCAGATTATT GGCTAGCACA CCGGCTGGCT GATTTACCCT GTAGGCTTTT TCATCCAGAC
GATCCCTTTT TAGGTGCGGG TCCTTTTAAA TTAGCGACCT TTGATAAACA TTTAGTTCGA
CTAAAGCAGC ACGAATTTTA CCATTTGCAA CATCCCTATC TGGACATTAT CGAGTACTGG
ATCACCCCTA GCCTGACGGT AAATTCAACA AATGGCAGTT GCCAGCATCC GGTTCGCATC
ACCATCGGCC AAGAGGAAGA GTTCCCACTG GCCCGCCCCG TACAGCGCAG CATGAGCCTC
GGATTCTGCT ATCTGGCTAT TAATCGCCAT CGTAGCAACC TCACTCCACA GCAAATAGCC
AAGCTACTGA TGTTAGTCCA AACCTCGGGT ATATTAGAGG CGCTCTCCAT CAGCCGTGAC
GTAATAACGC CCTGCCATGA AATCCTTCCA GGCTGGCCTA TTCCACAGTT TTCGACGGAT
GAAAATCCCT CCCTTCCCGC CTGTTTGGTT CTGACCTATC AACCGCCGAT GGAGCTTGAG
AGTGTCGCTG AGCAACTAAA AATAGTATTA GCCGCTCATG GCTGTACATT AGAGATCCGC
GCCTGCCATG ATAAACAGTG GCAAGATGTT GACAAAATTA AAGAGAGCGA TTTACTGTTG
GCCGATCATT TAGTCGGTGA ATCGCCAGAG GCCACAATGG AGAGCTGGCT ACGGCTGGAC
CCTCTGTGGC GCGGAATTTT ACAGAACGAA CAGTGGAACC AGCAGCAAAA AACGCTGACC
TTCATTCAGC AGATAGAAAG CGCGCCAGAA CGTTTTCGCC AATTACAGGC ACATTACGAT
GACCTGATGT TAGCGGGACT GATTTTGCCG TTGTTTAACT ATGAATATCA GGTCAATGCC
CCATCACGCA TCAATGGGGT TACATTAACG GCATATGGTT GGTTCGATTT CTGTCAAGCC
TGGCTACCGC CAATAACGAA TTAA
 
Protein sequence
MRIIHRLTQY ERLYQKFGDH PVATTVADVA SLLFCSERHA RTLIQQLQMK SWLSWHSQVG 
RGKRAQLQCL KKPDALRAIY LQQFLEQGDH QAAFSIAQLE PERLQTLLTP HMGGQWQADS
PILRIPYYRE LEPLNPMNAS GRAEQHLIYT LHAGLTRFNT GDPLPKPDLA HHWQISADGL
TWQFFLRSQL RWHNGDHIHG KQLLQTLEIL RANRRSHPSF ANIVTITLPH ALCLQFTLSQ
PDYWLAHRLA DLPCRLFHPD DPFLGAGPFK LATFDKHLVR LKQHEFYHLQ HPYLDIIEYW
ITPSLTVNST NGSCQHPVRI TIGQEEEFPL ARPVQRSMSL GFCYLAINRH RSNLTPQQIA
KLLMLVQTSG ILEALSISRD VITPCHEILP GWPIPQFSTD ENPSLPACLV LTYQPPMELE
SVAEQLKIVL AAHGCTLEIR ACHDKQWQDV DKIKESDLLL ADHLVGESPE ATMESWLRLD
PLWRGILQNE QWNQQQKTLT FIQQIESAPE RFRQLQAHYD DLMLAGLILP LFNYEYQVNA
PSRINGVTLT AYGWFDFCQA WLPPITN