Gene YPK_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_3044 
Symbol 
ID6089114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp3345405 
End bp3346952 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content54% 
IMG OID641598124 
ProductABC transporter related 
Protein accessionYP_001721770 
Protein GI170025265 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAAC CTTTATTGAA AATCACCGAT ATGGCGAAAA GCTTCTCTGG TGTCTGGGCG 
CTCAGTAACG TACAGCTCAC CGTAGAGCAG GGTGAAATAC ATGCACTCTT GGGTGAGAAC
GGCGCAGGTA AATCAACACT CTTGAAAGCA CTCTCGGGTG CTCAACCCCA GACTCACGGC
GAAATCTGGT TTAACGGTGA AATGCTGGCA TTGGACGACT CGCCAGTGGA ACGCCAGAAC
AAAGGCATTA TCACCATCTA TCAAGAAGTT AACCTACTGC CCAACATGAC GGTGGCAGAA
AACATGTTTC TTGGTCGTGA ACCGCGCCGC CGTCAGGTAT TTGTCGACGA AAAAGCCGTC
AATCAGGAAG CCCAAGCGAT CCTCGATTAC CTGCAACTTA ACGTGTCACC CACCACGGCG
GTGGCGCGCT TGAGTATCGC GCAGCAACAG ATGGTAGAGA TCGCCCGGGC GCTGACCTTG
AACGCGAAGC TCATTGTCAT GGATGAGCCT TCGGCAGCAC TCAGCGACAG CGAAGTCGAA
AGCCTGCATC GCGTGGTACG GGAACTGAAA GGCCGTGGTG TGAGCATTAT CTATGTCACC
CACCGCTTGC ACGAAGTGTT CCAACTCTGT GATCGTTTCA CGGTGTTTCA GGATGGGCGT
TACACCGGTT CTGATGAGGT TGCAGGCACC AACGTTGAGA AGATTATCCG CCTGATGGTG
GGGCGAGACG TCGTATTTAA CCGCCGCCCC GCCAGTGAGA CCCATCACCA AGACCAGCCC
ATTCGCCTAT CTGTGCAAGG GCTGTGTCGT GAAAAACCCC CGCTCGATCC ACATGGTGTG
GCGCTAAAAG ACATCAGCTT TCACGTCCAC GCCGGGGAAG TCCTGGGTAT CGCCGGGTTG
GTAGGGGCAG GGCGTACCGA AGTGGCACGT TGTCTGTTTG GGGCGGGGGC TTTCACCTCT
GGCAATTTTG AGATAGACGG TATGCCCTAT CAGCCACGGG ATCCAATGTT CGCGCTGGAA
CAGGGGATCG CACTGGTGCC GGAAGACCGT AAAAAAGAGG GGGCAGTGCA AGGGCTTTCT
ATTCGCGACA ATCTGACACT TTCGAGCCTG GCCGGGCTGT TACAGTGGCG TTTTTTCGTC
AATACCCGCA AAGAAGATCA ACTGATTGAG ACCTACCGTT TAGCACTGCA AATCAAGATG
GTGAACAGCG AACAGGCGGT GCGTAAGCTC TCTGGCGGTA ACCAGCAGAA GGTGATCTTG
GCCCGCTGCA TGGCGCTCAA TCCACGGATC CTGATCGTCG ATGAACCGAC ACGGGGCATT
GATGTGGGCA CGAAATCGGA AGTGCATCAG GTGTTGTTTG ATATGGCTAA ACAGGGCGTG
GCAGTGATCG TCATCTCCTC GGATTTACCG GAAGTTCTCG CGGTTTCTGA CCGGATCATC
ACGCTAAGCG AAGGGCGAGT CACTGGAGAG ATTCACGGTG ATGACGCCAG CGAAGAACGG
CTGATGACCA TGATGGCCAT CAATCATAAC GCCTTAAATG CCGCCTAA
 
Protein sequence
MSQPLLKITD MAKSFSGVWA LSNVQLTVEQ GEIHALLGEN GAGKSTLLKA LSGAQPQTHG 
EIWFNGEMLA LDDSPVERQN KGIITIYQEV NLLPNMTVAE NMFLGREPRR RQVFVDEKAV
NQEAQAILDY LQLNVSPTTA VARLSIAQQQ MVEIARALTL NAKLIVMDEP SAALSDSEVE
SLHRVVRELK GRGVSIIYVT HRLHEVFQLC DRFTVFQDGR YTGSDEVAGT NVEKIIRLMV
GRDVVFNRRP ASETHHQDQP IRLSVQGLCR EKPPLDPHGV ALKDISFHVH AGEVLGIAGL
VGAGRTEVAR CLFGAGAFTS GNFEIDGMPY QPRDPMFALE QGIALVPEDR KKEGAVQGLS
IRDNLTLSSL AGLLQWRFFV NTRKEDQLIE TYRLALQIKM VNSEQAVRKL SGGNQQKVIL
ARCMALNPRI LIVDEPTRGI DVGTKSEVHQ VLFDMAKQGV AVIVISSDLP EVLAVSDRII
TLSEGRVTGE IHGDDASEER LMTMMAINHN ALNAA