Gene YPK_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2002 
Symbol 
ID6088137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2227141 
End bp2229279 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content51% 
IMG OID641597069 
Producttype I secretion system ATPase 
Protein accessionYP_001720742 
Protein GI170024237 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID[TIGR01846] type I secretion system ABC transporter, HlyB family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAGT CTTATTCAAC TCTGGCCGCC GAGGGCACCA AGGTTATTCA GCCACTGGAG 
GCGTTGGCGT GTGCTGCTAC CTGTTTTGAT TTAACCATCG AAGCTTCGCA ATTGGCACAC
CAACTAGGCT TAGCTCCCGA TGAGATAGAC AGTATTGCCT TGTGCCGTTG TGCTGCTTGG
ATAGGGTTAC GAGCGCGGAA AGTTAATCAG TCTTTTGAAC GTGTGGGAAA GCTGGTATTA
CCCGTTTTAT TCAGTGATGG TGCTCAGTGG TATGTTTTGC TGTCGCTGAC CGCGCAGGAG
GCCACTGTCT ATTTTGCAGG CAGCGACCAG ACGCGGAAAA TAAGTCCAGA AGTCCTTGCG
AAATTGTGGC GCAATGAGGT GATTTTGCTG GCGAAAGCGG AACCGATAAC GGCCAAAAAG
CAGTTTTTCG GTTTTAGCTG GTTTATTCCG GTGGTGATGA AGTCCCGGCA TCAGCTACGT
AACATTATTC TTATCTCATT ACTGCTACAA GCCATTTTAC TCGTCACGCC AATGCTATTT
GAAACCGTTA TTGATAAAGT GTTGGTAAGC CGCGGGGTCG ATAGTTTGGT GGTTCTAGGT
GGAGCCATGG TGGCACTGGC GATTACTGAA CCGGGGTATA CCTTATTACG TAGCTGGCTG
TTTGCGCATT TGTCCAGCAG GGTGGGGGCT GAACTCAATA CACAGTTATA TCGGCATCTT
CTGGGGCTGC CACTGGGGTA TTTTACCGGC CAACAGACAG GGCAAACTAT CGCCAAGATG
CGAGAAATGG AGCAAATTCG CAGCTTCCTT ACGGGCTCAG CCCTGACTAT GGTGTTGGAT
CTGTTTTTTG TCGTCACCTT TATTGCCGTG ATGTTTCGCT ACAGTGGTCA ATTGACGGGG
ATCGTATTGA TTTCCCTGCT TTGTTATCTG CTATTTTGGT CGGCAGTTGG GGGGCGGTTA
CGCAAGCGAG TGGCGCAGCA ATACGAAACC AGTGCGCAAG CAACCTCGTT CCTGACCGAG
GCGGTCAGCG GGGTGGAAAC CATCAAAACC TCGGCAACCG AAAGTCAGTT TAACCGGCGT
TGGCGTCAAG TATTAGCACG CTATGTCCGG GCCTCTTTTG GCAGTTCGCA GGCAGGGAAC
TTGGCAGGCC AAGGGATCTC GCTCATTAAT AAAATAACGT CAGCAATTCT GTTGTGGTTT
GGCGTGACCC TCGTATTGGC TGGAAAACTC AGCCCAGGTG AACTGGTGGC GTTTAATATG
TTTGCCGGTT ACGTTACCCA ACCTATATTA CGATTGGCTC AGGCATGGCA AGACTTCCAA
CATACGCAGA TTGCGCTGGA GCGGATTGGC ACCATTCTGG ATAGCCCCAC CGAACCCGGA
AGTGCCGGGC TGGTTGCCAA CTGTAAACGG GCAGGGAGCT TAAGTTTTAA GCAAGTGCGC
TTCCGTTATC GCCCTGATAC CGCAGAAGTT TTACAAAATC TCAATCTGGA GATTGCCGGT
GGAGAATTTA TCGGCATTAC CGGGCCATCG GGGTGCGGTA AATCGACACT GACCAAGCTA
ATGCAACGCT TATATACGCC ACAACATGGG CAGATTCTGG TCGATGGGCA GGATTTGGCG
ATTACTGACC CCACCACGCT GCGCCGTCAA ATGAGTGTGG TATTGCAGGA GAGCGTTCTA
TTTTCGGGCA GCATTCGGGA CAATATTTGT CAATGTCGGC CCAATGCAGA TGAGGCTCAT
GTGATACATA TTGCTCGGTT GGCCGGGGCG CATGATTTTA TTATAGCGTT GCCGCAGGGT
TATCAAACTC AGGTCGGTGA AAAGGGGGGG CTGCTTTCTG GCGGGCAACG TCAGCGGATT
GCACTGGCAA GGGCGCTGAT GGCTGATCCC AAGATTCTTA TCCTTGATGA AGCAACCAGT
GCGTTGGACT ATGAATCGGA GGCGGCGATT ATGCGCCAAT TACCGGAAAT CACCCGTAAC
CGCACGGTTA TCTGCATTGC ACACCGTTTG AATACACTAC GTCAGTGCCA CCGCATTTTG
TTACTAAAAG ATGGGCAAAT TGCGGAGCAA GGTAGTCATG AAGTCTTGGT AGCAAGTGGT
GGGAATTACG CTCGTCTGTG GCAACAGCAG ACAGAGTAG
 
Protein sequence
MTESYSTLAA EGTKVIQPLE ALACAATCFD LTIEASQLAH QLGLAPDEID SIALCRCAAW 
IGLRARKVNQ SFERVGKLVL PVLFSDGAQW YVLLSLTAQE ATVYFAGSDQ TRKISPEVLA
KLWRNEVILL AKAEPITAKK QFFGFSWFIP VVMKSRHQLR NIILISLLLQ AILLVTPMLF
ETVIDKVLVS RGVDSLVVLG GAMVALAITE PGYTLLRSWL FAHLSSRVGA ELNTQLYRHL
LGLPLGYFTG QQTGQTIAKM REMEQIRSFL TGSALTMVLD LFFVVTFIAV MFRYSGQLTG
IVLISLLCYL LFWSAVGGRL RKRVAQQYET SAQATSFLTE AVSGVETIKT SATESQFNRR
WRQVLARYVR ASFGSSQAGN LAGQGISLIN KITSAILLWF GVTLVLAGKL SPGELVAFNM
FAGYVTQPIL RLAQAWQDFQ HTQIALERIG TILDSPTEPG SAGLVANCKR AGSLSFKQVR
FRYRPDTAEV LQNLNLEIAG GEFIGITGPS GCGKSTLTKL MQRLYTPQHG QILVDGQDLA
ITDPTTLRRQ MSVVLQESVL FSGSIRDNIC QCRPNADEAH VIHIARLAGA HDFIIALPQG
YQTQVGEKGG LLSGGQRQRI ALARALMADP KILILDEATS ALDYESEAAI MRQLPEITRN
RTVICIAHRL NTLRQCHRIL LLKDGQIAEQ GSHEVLVASG GNYARLWQQQ TE