Gene YpsIP31758_2172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2172 
Symbol 
ID5387186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2497608 
End bp2498876 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content52% 
IMG OID640865159 
Productcarbohydrate ABC transporter periplasmic-binding protein 
Protein accessionYP_001401145 
Protein GI153948120 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.00849198 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATAA AAAAAATAGG TATCGCAGGT ATTATCGGCA CGTTGCTGAT GGCGAGTAAC 
GCCAGCGCAC AGGAAACCCT CCGTGTACTG CTCGAAGGGC ACAGCACCAG CGACTCGATA
AAAGCACTGT TACCCGAATT CGAAAAGCAG ACCGGTATTA AGGTTCAGGC AGAGATAGTA
CCTTACAGCG ATCTGACCTC TAAAGCCCTG CTGGCCTTCT CCTCGCACAG TGGACGTTAC
GACGTGGTTA TGGATGACTG GGTGCATGCA GTAGGTTACG CCTCTGCTGG TTATATCACA
CCTGTAGATC AGTGGATGGA GAGTGATACC GCCTTCTACG ATGGTGCGGA TTTCGTCAAA
AGCTATGCTG ATACGCTGCG TTATAAAGAC GGTTATTACG GGCTGCCAGT CTATGGTGAA
AGTACCTTCC TGATGTACCG CAAAGACCTG TTTGAACAGT ACGGTATCGC CGTGCCGAAA
ACCTTTGATG AGCTGACCGC TGCGGCAAAA ACCATCAAAG AGAAGACCGA AGGTAAGGTG
GCGGGTATTA CGCTCCGTGG AGCTCAGGGG ATCCAGAACA CCTTTGCATG GGCGTCATTC
CTCTGGGGTT ACGGCGGCCA GTGGATTGAC GACAACGGAA AATCTGCAAT TGCTTCGCCA
CAGGCGGTAG AAGCCACCAA GTCATTCGTC AATATCCTGA AAAACTACGG GCCGATCGGC
GCGGCTAACT TCGGCTGGCA GGAAAACCGC TTGGTATTCC AGCAGGGTAA AGCGGCAATG
ACTATCGATT CGACAGTGAA CGGGGGCTTC AACGAAGACC CGAAAGAGTC TACGGTCGTC
GGTAAAGTGG GCTATGCCCC GGTACCGGTA CAGCCAGGCG ATCATCCAGG TAACAGCGGC
GCACTTCAGG TGCATGGCTT GTATATCTCC AGCGACAGTA AGAAGCAGGA TGCTGCCTGG
AAATTTATCA GTTGGGCAAC GGACAAACAG ACGCAGATGA AGTCGGTCGA ACTGAATCCT
AACGCCGGTG TGAGTTCACT CAGTGCCATC AACAGTGATG CCTTCACCAA GCGTTACGGG
GCCTTTAAGG ACGGTATGCT CGCAGCATTG CAAAACGGCA ATGCGAAATA CCTCCCAACC
ATTCCGCAGT CTACACAGAT TATCAACATA ACCGGTATTG CTCTATCCGA GGCACTGGCA
GGTACTCAGA CAGTAGAAAA TGCCCTTCAG CAAGCCAACA CCCGTAATGA TAAAGCGTTG
TCCCGTTAA
 
Protein sequence
MSIKKIGIAG IIGTLLMASN ASAQETLRVL LEGHSTSDSI KALLPEFEKQ TGIKVQAEIV 
PYSDLTSKAL LAFSSHSGRY DVVMDDWVHA VGYASAGYIT PVDQWMESDT AFYDGADFVK
SYADTLRYKD GYYGLPVYGE STFLMYRKDL FEQYGIAVPK TFDELTAAAK TIKEKTEGKV
AGITLRGAQG IQNTFAWASF LWGYGGQWID DNGKSAIASP QAVEATKSFV NILKNYGPIG
AANFGWQENR LVFQQGKAAM TIDSTVNGGF NEDPKESTVV GKVGYAPVPV QPGDHPGNSG
ALQVHGLYIS SDSKKQDAAW KFISWATDKQ TQMKSVELNP NAGVSSLSAI NSDAFTKRYG
AFKDGMLAAL QNGNAKYLPT IPQSTQIINI TGIALSEALA GTQTVENALQ QANTRNDKAL
SR