Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_2429 |
Symbol | |
ID | 6087298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | - |
Start bp | 2659892 |
End bp | 2662801 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641597495 |
Product | invasin region 3 |
Protein accession | YP_001721159 |
Protein GI | 170024654 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.621215 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTATGT ATTTTAATAA AATAATTTCA TTTAATATTA TTTCACGAAT AGTTATTTGT ATCTTTTTGA TATGTGGAAT GTTCATGGCT GGGGCTTCAG AAAAATATGA TGCTAACGCA CCGCAACAGG TCCAGCCTTA TTCTGTCTCT TCATCTGCAT TTGAAAATCT CCATCCTAAT AATGAAATGG AGAGTTCAAT CAATCCCTTT TCCGCATCGG ATACAGAAAG AAATGCTGCA ATAATAGATC GCGCCAATAA GGAGCAGGAG ACTGAAGCGG TGAATAAGAT GATAAGCACC GGGGCCAGGT TAGCTGCATC AGGCAGGGCA TCTGATGTTG CTCACTCAAT GGTGGGCGAT GCGGTTAATC AAGAAATCAA ACAGTGGTTA AATCGATTCG GTACGGCTCA AGTTAATCTG AATTTTGACA AAAATTTTTC GCTAAAAGAA AGCTCTCTTG ATTGGCTGGC TCCTTGGTAT GACTCTGCTT CATTCCTCTT TTTTAGTCAG TTAGGTATTC GCAATAAAGA CAGCCGCAAC ACACTTAACC TTGGCGTCGG GATACGTACA TTGGAGAACG GTTGGCTGTA CGGACTTAAT ACTTTTTATG ATAATGATTT GACCGGCCAC AACCACCGTA TCGGTCTTGG TGCCGAGGCC TGGACCGATT ATTTACAGTT GGCTGCCAAT GGGTATTTTC GCCTCAATGG ATGGCACTCG TCGCGTGATT TCTCCGACTA TAAAGAGCGC CCAGCCACTG GGGGGGATTT GCGCGCGAAT GCTTATTTAC CTGCACTCCC ACAACTGGGG GGGAAGTTGA TGTATGAGCA ATACACCGGT GAGCGTGTTG CTTTATTTGG TAAAGATAAT CTGCAACGCA ACCCTTATGC CGTGACTGCC GGGATCAATT ACACCCCCGT GCCTCTACTC ACTGTCGGGG TAGATCAGCG TATGGGGAAA AGCAGTAAGC ATGAAACACA GTGGAACCTC CAAATGAACT ATCGCCTGGG CGAGAGTTTT CAGTCGCAAC TTAGCCCTTC AGCGGTGGCA GGAACACGTC TACTGGCGGA GAGCCGCTAT AACCTTGTCG ATCGTAACAA TAATATCGTG TTGGAGTATC AGAAACAGCA GGTGGTTAAA CTGACATTAT CGCCAGCAAC TATCTCCGGC CTGCCGGGTC AGGTTTATCA GGTGAACGCA CAAGTACAAG GGGCATCTGC TGTAAGGGAA ATTGTCTGGA GTGATGCCGA ACTGATTGCC GCTGGCGGCA CATTAACACC ACTGAGTACC ACACAATTCA ACTTGGTTTT ACCGCCTTAT AAACGCACAG CACAAGTGAG TCGGGTAACG GACGACCTGA CAGCCAACTT TTATTCGCTT AGTGCGCTCG CGGTTGATCA CCAAGGAAAC CGATCTAACT CATTCACATT GAGCGTCACC GTTCAGCAGC CTCAGTTGAC ATTAACGGCG GCCGTCATTG GTGATGGCGC ACCGGCTAAT GGGAAAACTG CAATCACCGT TGAGTTCACC GTTGCTGATT TTGAGGGGAA ACCCTTAGCC GGGCAGGAGG TGGTGATAAC CACCAATAAT GGTGCGCTAC CGAATAAAAT CACGGAAAAG ACAGATGCAA ATGGCGTCGC GCGCATTGCA TTAACCAATA CGACAGATGG CGTGACGGTA GTCACAGCAG AAGTGGAGGG GCAACGGCAA AGTGTTGATA CCCACTTTGT TAAGGGTACT ATCGCGGCGG ATAAATCCAC TCTGGCTGCG GTACCGACAT CTATCATCGC TGATGGTCTA ATGGCTTCAA CCATCACGTT GGAGTTGAAG GATACCTATG GGGACCCGCA GGCTGGCGCG AATGTGGCTT TTGACACAAC CTTAGGCAAT ATGGGCGTTA TCACGGATCA CAATGACGGC ACTTATAGCG CACCATTGAC CAGTACCACG TTGGGGGTAG CAACAGTAAC GGTGAAAGTG GATGGGGCTG CGTTCAGTGT GCCGAGTGTG ACGGTTAATT TCACGGCAGA TCCTATTCCA GATGCTGGCC GCTCCAGTTT CACCGTCTCC ACACCGGATA TCTTGGCTGA TGGCACGATG AGTTCCACAT TATCCTTTGT CCCTGTCGAT AAGAATGGCC ATTTTATCAG TGGGATGCAG GGCTTGAGTT TTACTCAAAA CGGTGTGCCG GTGAGTATTA GCCCCATTAC CGAGCAGCCA GATAGCTATA CCGCGACGGT GGTTGGGAAT AGTGTCGGTG ATGTCACAAT CACGCCGCAG GTTGATACCC TGATACTGAG TACATTGCAG AAAAAAATAT CCCTATTCCC GGTACCTACG CTGACCGGTA TTCTGGTTAA CGGGCAAAAT TTCGCTACGG ATAAAGGGTT CCCGAAAACG ATCTTTAAAA ACGCCACATT CCAGTTACAG ATGGATAACG ATGTTGCTAA TAATACTCAG TATGAGTGGT CGTCGTCATT CACACCCAAT GTATCGGTTA ACGATCAGGG TCAGGTGACG ATTACCTACC AAACCTATAG CGAAGTGGCT GTGACGGCGA AAAGTAAAAA ATTCCCAAGT TATTCGGTGA GTTATCGGTT CTACCCAAAT CGGTGGATAT ACGATGGCGG CAGATCGCTG GTATCCAGTC TCGAGGCCAG CAGACAATGC CAAGGTTCAG ATATGTCTGC GGTTCTTGAA TCCTCACGTG CAACCAACGG AACGCGTGCG CCTGACGGGA CATTGTGGGG CGAGTGGGGG AGCTTGACCG CGTATAGTTC TGATTGGCAA TCTGGTGAAT ATTGGGTCAA AAAGACCAGC ACGGATTTTG AAACCATGAA TATGGACACA GGCGCACTGC AACCAGGGCC TGCATACTTG GCGTTCCCGC TCTGTGCGCT GTCAATATAA
|
Protein sequence | MSMYFNKIIS FNIISRIVIC IFLICGMFMA GASEKYDANA PQQVQPYSVS SSAFENLHPN NEMESSINPF SASDTERNAA IIDRANKEQE TEAVNKMIST GARLAASGRA SDVAHSMVGD AVNQEIKQWL NRFGTAQVNL NFDKNFSLKE SSLDWLAPWY DSASFLFFSQ LGIRNKDSRN TLNLGVGIRT LENGWLYGLN TFYDNDLTGH NHRIGLGAEA WTDYLQLAAN GYFRLNGWHS SRDFSDYKER PATGGDLRAN AYLPALPQLG GKLMYEQYTG ERVALFGKDN LQRNPYAVTA GINYTPVPLL TVGVDQRMGK SSKHETQWNL QMNYRLGESF QSQLSPSAVA GTRLLAESRY NLVDRNNNIV LEYQKQQVVK LTLSPATISG LPGQVYQVNA QVQGASAVRE IVWSDAELIA AGGTLTPLST TQFNLVLPPY KRTAQVSRVT DDLTANFYSL SALAVDHQGN RSNSFTLSVT VQQPQLTLTA AVIGDGAPAN GKTAITVEFT VADFEGKPLA GQEVVITTNN GALPNKITEK TDANGVARIA LTNTTDGVTV VTAEVEGQRQ SVDTHFVKGT IAADKSTLAA VPTSIIADGL MASTITLELK DTYGDPQAGA NVAFDTTLGN MGVITDHNDG TYSAPLTSTT LGVATVTVKV DGAAFSVPSV TVNFTADPIP DAGRSSFTVS TPDILADGTM SSTLSFVPVD KNGHFISGMQ GLSFTQNGVP VSISPITEQP DSYTATVVGN SVGDVTITPQ VDTLILSTLQ KKISLFPVPT LTGILVNGQN FATDKGFPKT IFKNATFQLQ MDNDVANNTQ YEWSSSFTPN VSVNDQGQVT ITYQTYSEVA VTAKSKKFPS YSVSYRFYPN RWIYDGGRSL VSSLEASRQC QGSDMSAVLE SSRATNGTRA PDGTLWGEWG SLTAYSSDWQ SGEYWVKKTS TDFETMNMDT GALQPGPAYL AFPLCALSI
|
| |