Gene YPK_2429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2429 
Symbol 
ID6087298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2659892 
End bp2662801 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content48% 
IMG OID641597495 
Productinvasin region 3 
Protein accessionYP_001721159 
Protein GI170024654 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.621215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTATGT ATTTTAATAA AATAATTTCA TTTAATATTA TTTCACGAAT AGTTATTTGT 
ATCTTTTTGA TATGTGGAAT GTTCATGGCT GGGGCTTCAG AAAAATATGA TGCTAACGCA
CCGCAACAGG TCCAGCCTTA TTCTGTCTCT TCATCTGCAT TTGAAAATCT CCATCCTAAT
AATGAAATGG AGAGTTCAAT CAATCCCTTT TCCGCATCGG ATACAGAAAG AAATGCTGCA
ATAATAGATC GCGCCAATAA GGAGCAGGAG ACTGAAGCGG TGAATAAGAT GATAAGCACC
GGGGCCAGGT TAGCTGCATC AGGCAGGGCA TCTGATGTTG CTCACTCAAT GGTGGGCGAT
GCGGTTAATC AAGAAATCAA ACAGTGGTTA AATCGATTCG GTACGGCTCA AGTTAATCTG
AATTTTGACA AAAATTTTTC GCTAAAAGAA AGCTCTCTTG ATTGGCTGGC TCCTTGGTAT
GACTCTGCTT CATTCCTCTT TTTTAGTCAG TTAGGTATTC GCAATAAAGA CAGCCGCAAC
ACACTTAACC TTGGCGTCGG GATACGTACA TTGGAGAACG GTTGGCTGTA CGGACTTAAT
ACTTTTTATG ATAATGATTT GACCGGCCAC AACCACCGTA TCGGTCTTGG TGCCGAGGCC
TGGACCGATT ATTTACAGTT GGCTGCCAAT GGGTATTTTC GCCTCAATGG ATGGCACTCG
TCGCGTGATT TCTCCGACTA TAAAGAGCGC CCAGCCACTG GGGGGGATTT GCGCGCGAAT
GCTTATTTAC CTGCACTCCC ACAACTGGGG GGGAAGTTGA TGTATGAGCA ATACACCGGT
GAGCGTGTTG CTTTATTTGG TAAAGATAAT CTGCAACGCA ACCCTTATGC CGTGACTGCC
GGGATCAATT ACACCCCCGT GCCTCTACTC ACTGTCGGGG TAGATCAGCG TATGGGGAAA
AGCAGTAAGC ATGAAACACA GTGGAACCTC CAAATGAACT ATCGCCTGGG CGAGAGTTTT
CAGTCGCAAC TTAGCCCTTC AGCGGTGGCA GGAACACGTC TACTGGCGGA GAGCCGCTAT
AACCTTGTCG ATCGTAACAA TAATATCGTG TTGGAGTATC AGAAACAGCA GGTGGTTAAA
CTGACATTAT CGCCAGCAAC TATCTCCGGC CTGCCGGGTC AGGTTTATCA GGTGAACGCA
CAAGTACAAG GGGCATCTGC TGTAAGGGAA ATTGTCTGGA GTGATGCCGA ACTGATTGCC
GCTGGCGGCA CATTAACACC ACTGAGTACC ACACAATTCA ACTTGGTTTT ACCGCCTTAT
AAACGCACAG CACAAGTGAG TCGGGTAACG GACGACCTGA CAGCCAACTT TTATTCGCTT
AGTGCGCTCG CGGTTGATCA CCAAGGAAAC CGATCTAACT CATTCACATT GAGCGTCACC
GTTCAGCAGC CTCAGTTGAC ATTAACGGCG GCCGTCATTG GTGATGGCGC ACCGGCTAAT
GGGAAAACTG CAATCACCGT TGAGTTCACC GTTGCTGATT TTGAGGGGAA ACCCTTAGCC
GGGCAGGAGG TGGTGATAAC CACCAATAAT GGTGCGCTAC CGAATAAAAT CACGGAAAAG
ACAGATGCAA ATGGCGTCGC GCGCATTGCA TTAACCAATA CGACAGATGG CGTGACGGTA
GTCACAGCAG AAGTGGAGGG GCAACGGCAA AGTGTTGATA CCCACTTTGT TAAGGGTACT
ATCGCGGCGG ATAAATCCAC TCTGGCTGCG GTACCGACAT CTATCATCGC TGATGGTCTA
ATGGCTTCAA CCATCACGTT GGAGTTGAAG GATACCTATG GGGACCCGCA GGCTGGCGCG
AATGTGGCTT TTGACACAAC CTTAGGCAAT ATGGGCGTTA TCACGGATCA CAATGACGGC
ACTTATAGCG CACCATTGAC CAGTACCACG TTGGGGGTAG CAACAGTAAC GGTGAAAGTG
GATGGGGCTG CGTTCAGTGT GCCGAGTGTG ACGGTTAATT TCACGGCAGA TCCTATTCCA
GATGCTGGCC GCTCCAGTTT CACCGTCTCC ACACCGGATA TCTTGGCTGA TGGCACGATG
AGTTCCACAT TATCCTTTGT CCCTGTCGAT AAGAATGGCC ATTTTATCAG TGGGATGCAG
GGCTTGAGTT TTACTCAAAA CGGTGTGCCG GTGAGTATTA GCCCCATTAC CGAGCAGCCA
GATAGCTATA CCGCGACGGT GGTTGGGAAT AGTGTCGGTG ATGTCACAAT CACGCCGCAG
GTTGATACCC TGATACTGAG TACATTGCAG AAAAAAATAT CCCTATTCCC GGTACCTACG
CTGACCGGTA TTCTGGTTAA CGGGCAAAAT TTCGCTACGG ATAAAGGGTT CCCGAAAACG
ATCTTTAAAA ACGCCACATT CCAGTTACAG ATGGATAACG ATGTTGCTAA TAATACTCAG
TATGAGTGGT CGTCGTCATT CACACCCAAT GTATCGGTTA ACGATCAGGG TCAGGTGACG
ATTACCTACC AAACCTATAG CGAAGTGGCT GTGACGGCGA AAAGTAAAAA ATTCCCAAGT
TATTCGGTGA GTTATCGGTT CTACCCAAAT CGGTGGATAT ACGATGGCGG CAGATCGCTG
GTATCCAGTC TCGAGGCCAG CAGACAATGC CAAGGTTCAG ATATGTCTGC GGTTCTTGAA
TCCTCACGTG CAACCAACGG AACGCGTGCG CCTGACGGGA CATTGTGGGG CGAGTGGGGG
AGCTTGACCG CGTATAGTTC TGATTGGCAA TCTGGTGAAT ATTGGGTCAA AAAGACCAGC
ACGGATTTTG AAACCATGAA TATGGACACA GGCGCACTGC AACCAGGGCC TGCATACTTG
GCGTTCCCGC TCTGTGCGCT GTCAATATAA
 
Protein sequence
MSMYFNKIIS FNIISRIVIC IFLICGMFMA GASEKYDANA PQQVQPYSVS SSAFENLHPN 
NEMESSINPF SASDTERNAA IIDRANKEQE TEAVNKMIST GARLAASGRA SDVAHSMVGD
AVNQEIKQWL NRFGTAQVNL NFDKNFSLKE SSLDWLAPWY DSASFLFFSQ LGIRNKDSRN
TLNLGVGIRT LENGWLYGLN TFYDNDLTGH NHRIGLGAEA WTDYLQLAAN GYFRLNGWHS
SRDFSDYKER PATGGDLRAN AYLPALPQLG GKLMYEQYTG ERVALFGKDN LQRNPYAVTA
GINYTPVPLL TVGVDQRMGK SSKHETQWNL QMNYRLGESF QSQLSPSAVA GTRLLAESRY
NLVDRNNNIV LEYQKQQVVK LTLSPATISG LPGQVYQVNA QVQGASAVRE IVWSDAELIA
AGGTLTPLST TQFNLVLPPY KRTAQVSRVT DDLTANFYSL SALAVDHQGN RSNSFTLSVT
VQQPQLTLTA AVIGDGAPAN GKTAITVEFT VADFEGKPLA GQEVVITTNN GALPNKITEK
TDANGVARIA LTNTTDGVTV VTAEVEGQRQ SVDTHFVKGT IAADKSTLAA VPTSIIADGL
MASTITLELK DTYGDPQAGA NVAFDTTLGN MGVITDHNDG TYSAPLTSTT LGVATVTVKV
DGAAFSVPSV TVNFTADPIP DAGRSSFTVS TPDILADGTM SSTLSFVPVD KNGHFISGMQ
GLSFTQNGVP VSISPITEQP DSYTATVVGN SVGDVTITPQ VDTLILSTLQ KKISLFPVPT
LTGILVNGQN FATDKGFPKT IFKNATFQLQ MDNDVANNTQ YEWSSSFTPN VSVNDQGQVT
ITYQTYSEVA VTAKSKKFPS YSVSYRFYPN RWIYDGGRSL VSSLEASRQC QGSDMSAVLE
SSRATNGTRA PDGTLWGEWG SLTAYSSDWQ SGEYWVKKTS TDFETMNMDT GALQPGPAYL
AFPLCALSI