Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_3053 |
Symbol | |
ID | 6091019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | + |
Start bp | 3354616 |
End bp | 3360594 |
Gene Length | 5979 bp |
Protein Length | 1992 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641598133 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001721779 |
Protein GI | 170025274 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTGT TGAGGTTTCT GCTGATCTCC CCCTTCGCCC TTATCAAGGG ACTCTATCGG TTAAGCGCTT ATCTTTTACG GTTAGTGGGC CGCCTATTGC GCCCAGTCGT CGGTAACCTG AATTGGCGCG CGCCACAATG GATGACGAAA ACCGCCAACG GGCTACACTG TGCTTTCAAC CGTTCAGAGC AATGGGTCGC TAAGCATCCC AAAGGGATCA GTGCGGCAAT AGTGCTGTTG ATGGCTGCCG CCAGCGCCGC ATTCTATGGC TATCACTGGT ACCTTAATCG GCCCCAACCG ATTGAACCGG CACCTATGGT TTATCAGGAA ACCAGTATCA GAGTCTCGGC ACCAAGAACG GTTAACTATC AGGCGCAAAA ACCAGAAGCC CAACCGCTTA GCCTTAATTT TATGCATTCA GCCGCACCTA TCACCGCGAT GGGTCAGGTC GTCGATCAGG GCATCTCATT AACCCCCGCG ATAGAGGGTG AATGGAAATG GGCAACTGAA CGCACGTTGG TGTTTACCCC CAAAAAAGCC TGGCCGATGG GCGCTAACTA CCAAATTACG ATCGATGCAG AAAAACTTTT AGCACCACAG ATTAAGCTCA ATCAGACCGA ACTTAATTTC ACAACGCCAG CTTTTGCTTA TCAATTGGAA AAAGCGGAAT ATTATCAGGA TCCGCAAGAG GCCCAAAAGC GCAGCACTAT TTTCCATGTG CAATTTAATG CCCCAGTTGA TGTTGCCAGC TTTGAAAAAC AGATCCTCTT GGGATTGGTC GAAGGTAAAT CCAAGTCAGA GAAGAAACTT AATTTCTCCG TCGTTTATGA TGAGAAAAAG CTTAATGCCT GGATACATTC GCAACCCTTG ATGCCAATGG ATAAAGGCGG TTCGGTCCAT CTATCGATTA ATAAAGGGGT GAATGCCAGT GTTGCCGCCA CGCCTACGAC ACAGGCACAG AATAAATGGG TATCCGTCCC TAACCTATAT AGCCTGGCGG TTAATAGTAT TAATGCCACG TTGGTTGAGT CAGATAACAA TAATGGTGAG CGGGCCTTAA TTATTGCTAT CAGCGACGCG GTTAAAGATA AAGAGATCAA AAATGCGGTC AAAGCCTGGT TACTGCCGCA ACATAATTTT CAAGCGAAAG AGAGCGCCAA AACATCAACC GATTTCTATC CTTGGGATAT GGATGATATT GACGATAATC TGCTGCAACA ATCAACGCCG CTGGCGCTGA CCCTCAATGA GGCCGAGCAA GAGTATCAGC CAATATTCAG CTTTAAGTTT GATGCCCCTT CCTATCGCAC ACTGCTGATC GAGGTTAACA ATAGCCTGAC ATCGGTGGGT GGTTATAAAA TGCCGGAAAA AATCTACCAA ATAGTCAGGG TTCCCGATTA CCCTAAGACG CTGCGCTTTA TGTCACAAGG CTCGTTATTA TCGATGCAGG GTGATAAGCA GATCAGCGTC GCCGCCCGTA ATATGACTGG CATGAAGCTG GATATTAAGC GGGTTATTCC TAGCCAGTTA CAACATATTG TGTCATTTAA AAGCAGCGAA TATTCATCAG CTCACTTTAA CCGCCTGAGT GATGAATATT TTACTGAACA CTTCCAGTAC CAAACCGCGC TGAATAATGA CAACCCCGGC GAGATCAATT ATCAAGGGGT CGATCTGTCC CGTTATCTTG CAAATAATCC GAGTGCTCGG CGTGGGGTGT TCTTACTTAC CCTGTCAGCT TGGGATCCGG AGAAAAGGGA TAATCAGCAA CACAGCGAGG AAGACTACGA CGAAGACCAG GAATGGGTCG GCGATTCACG CTTTGTGGTG ATCACGGACT TAGGCATTAT CACCAAGCAA TCACAGGATA GATCCCGTGA TGTGTTCGTG CAATCCATTC ACTCGGGTCT GCCCACCGCC GATGCCAAAG TCTCTGTGGT GGCAAAAAAT GGTGTGGTCT TACTGAGCCA AATCACCGAT AGCAAAGGGC ATGTTCATTT TCCTGCGCTG GACGCCTTTA AAAATGAACG CCAACCGGTC ATGTTCCTGG TGGAAAAAGA AGGGGATGTC TCCTTCCTGC CCACCCGAGC CACCTATGAC CGTAACCTTG ATTTCTCACG TTTTGATATT GATGGCGAAG AGACACCGTC CGACCCACGG ACTCTAAGCA GCTATCTGTT TTCTGACCGG GGAGTTTATC GCCCAGGCGA TCGCTTCAAT ATTGGTCTGA TCACCCGGAC CGCCAACTGG GCTACCGCAC TCGATGGCGT CCCCCTGCGG GCGGAGATCC GTGACCCACG TGATACCTTG ATGAGTACCC TGCCGATAAC CTTGGACAGC AGTGGTTTCA ATGAGCTCAG CTATACGACC GGTGAAAACT CACCTACCGG TGAATGGAAC GTCTATCTCT ATCTGGTTGG TAAGAATAAT GAAACGTCGA TGTTGCTGGG GCACACCACC GTAAATGTTA AAGAGTTCGA GCCTGATCGC CTAAAAGTGC AACTGCAACT GACGCCAGAG CGTCAACAAG GCTGGGTTAA ACCGCAGGAG CTGCAAGCCA ATATCAATGT ACAAAATTTA TTCGGTACAC CAGCACAGGA GCGCCGTGTC ACCTCTAGAC TGATCTTGCG GCCAATGTAC CCGAGTTTTG CCCCGTTCCC TGATTACCTG TTCTATGAGA ATCGTCATAA CAGCGATGGT TTTGAGACTG AACTGGAAGA ACAAACGACC GATCTACAAG GGATGGCGAC CATCCCATTG GATCTGAAAT CCTATGCTGA CGCCACCTAT CAACTGCAAT TGTTGTCGGA AGCCTTTGAA GCGGGTGGAG GCCGCTCCGT GGCCGCGACT GCGCGGGTTC TGGTTTCACC TTACGACTCT CTGGTTGGGG TGAAAGCCGA TGGCGATCTG AGTTATATCA ACCGTGATGC CGTGCGTAAG CTGAATATTA TTGCTGTGGA CCCGAGCCTG AATAAAATTG CGCTGCCAGA CTTGAGTCTG TCATTGATTG AGCAGAAGTA TATTTCAGTG CTAACCAAAC AGGATTCAGG CGTTTATAAA TATCAATCAC GGCTAAAGGA GCAGTTGGTC TCAGAGCAAC CGCTAAAAAT CAGCCCGACA GGGACGGATT TCACCCTAGT GACCCAGCAG CCTGGTGATT TTATTCTGGT GGTTAAGGAC AGTCAGGGGC AGGTTCTGAA CCGTATTAGT TATACGGTGG CGGGTAACGC AAACCTGACC CGCTCACTGG ATCGCAACAC CGAATTAAAG CTAAAACTGA ATCAGGCCGA ATATCTGCAA GGCGAAGAAA TTGAGATTGC GATTAATGCA CCTTATGCCG GTAGCGGTCT GATCACGATA GAAAAAGATA AAGTGTATAG CTGGCAGTGG TTCCACAGTG ATACCACCAG CTCTGTGCAG AGAATCCGCA TCCCACCGGC AATGGAAGGC AATGGCTATA TCAACGTACA ATTCGTGCGT GATGTGAATT CCGATGAGAT CTTTATGAGC CCACTGAGTT ACGGTGTAAT GCCATTTAAG ATCAGTACCA AAGCGCGTCA GGCGGCTATC GAGTTAGCGT CGCCGTCAGT CATTAAACCG GGTGAAGTGT TACCGATTAA AGTGACCACC GATTCACCAC AGCGCGTGGT GGTGTTTGCC GTCGATGAAG GTATTTTGCA GGTGGCACGC TATCGCCTGA AAGATCCACT GGATTACTTC TTCCGTAAAC GTGAACTGAG TGTACAGAGT GCACAAATTC TCGATTTGAT CCTGCCGGAA TTCAGCAAGC TGATGGCACT GACCTCCGCA CCTGGAGGCG ACGCCGGGGA AGGGCTGGAC CTGCACCTCA ATCCGTTTAA ACGCAAACAA GACAAGCCGG TGGCTTATTG GTCTGGTATC ACCGAAGTGA ATGGTGAAAC CACCTTCAAT TACCCGATTC CCGACTATTT CAATGGTAAA GTTCGCGTGA TGGCCATCTC TGCGACCCCT GATCGCATTG GTAAAGTCCA GACCTCGACC ACCGTGCGGG ATAACTTTAT TCTGACGCCG AATGTCCCCG CGATGGTAGC ACCGGGAGAT GAATTTGATG TCACCGTGGG TGTGAGTAAC AACCTGCAAG GATTGAAGGG TAAAGCGGTT GATATCACCG TGCGTCTGAC ACCACCGCCA CAATTGGAAG TGGTGGGTGA AGCGCAACAC AGCCTGTCGC TGGCAGAAAA ACGTGAAACG CTTGTCAGCT TCCGCCTACG CGCCCGTTCA GCATTGGGTG ATGCTCCACT GGTGTTTGAT GCCAGCTATG GCTCTCAATC CAGCCGCCGG ACGGTCAGTA CCTCGGTACG CCCGGCGATG CCATTCCGAA CGCAATCGGT GATGGGCCGG ATGGAGGGTA ACAAGCATAC TGTGACCAAT CTGCGCCAGA TGTTTGATAA TTATGCTCAA CGTCAGGCGA CCGCTTCCCA CTCACCGTTG GTCTTAACCC AAGGTCTGGC GCGGTACCTG GCTGATTACC CGTACTACAG TTCTGAGCAA ATTGTCAGCC GCTCGATTCC GTTGATTATG CAAAGCAAAC ATCCTGAAAT GGACAGTGCC CTCAATCAGA ATGAGGTCCG TGATCAACTG AAAAACATGC TACGTATCCT GAGCTCTCGG CAGAATAGCA CTGGTGCAAT CGGTTTGTGG CACGCCTCCC CTACCCCTGA TCCGTTTGTC ACACCTTATG TCGTGCAATT TCTGCTGGAA GCGAAATCTG CCGGTTACAG CTTGCCGAAT GACATCTTGG AGGGGGCCAA CAACGCACTG CGTCTGTTAG CGGCTCGACC TTATGATGAC CTTTACTCTC TGCGTTTGCG GGCCTTTGCT GTTTACCTGT TGACCTTGCA GGGGGAGATC ACCACCAATA CTCTGGCATC GGTGCAAAGT ACGTTACAGC AACTTTATCC TGACAGTTGG CAGACTGATC TGAGTGCCAT TTATCTGGCC TCATCATACC GTCTGCTCAA AATGGATGAC GAAGCCAATA AACTGCTGCA ACCCACCTGG AAACAACTGG GTAAAGCCTA CAGCAAGGCC TGGTGGACGC AGAATTATTT TGATCCACTG GTGCAAGATG CAACCCGGTT GTATCTGATC ACTCGCCATT TCCCAGAGAA AGTCTCTTCT ATTCCGCCAC AAGCACTGGA AAATATGGTG CTGGCACTGA GGGATGAGCA TTACACGACC TATTCATCCG CGATGAGCAT TCTGGCACTG GAAAGTTACA CCAGCCAGGT AGCCGCCCAG CAAGATACGC CAGAAACCCT GCAAATCATC GAGATCAGTA AAAGCAAAGG GATCGACCCT AACGTTATCT CAACGCTGAA CGGCCTGTTC GTTCAAGGTG ATTTTACCGG TGAGGCTAAA GCGATTCAGT TTAACAACTA TGCCTCGGCA CCCGCTTGGT ATGTGGTCAA TCAATCAGGC TATGACCTTC AGCCACCAAA AGACGCCATC TCTAATGGGC TGGAAATCAG CCGCAGCTAC ACCGATGAGC AGGGTAAGCC GGTGACCCAA GTCACCTTAG GGCAGAAAGT TAACGTGCAC CTAAAAATCC GGGCTAACGC TAAACAAGGT CAAAATAATC TGGCGATTGT CGATCTACTG CCGGGCGGTT TTGAAGTGGT ACAACAAACG GCACCTGAAC CAGAGTTTTA TGATAATCAG GATGATCAGG ATGAGGAAAC TGGCAGCGGC TGGCAGTCGC CGCTAATGGT ATCTGGCTCC AGTTGGTACC CTGACTACAG TGATATTCGT GAAGATCGCG TGATCATTTA TGGCAGTGCC AGTACCGACG TTAAAGAGTT TATCTACCAA ATCAAATCAA CCAATACGGG TCGCTTTGTG GTGCCACCGG CTTACGGCGA AGCCATGTAT GATCGTAATG TACAGGCGCT GTCGGTCGGT AAAGGGCATA TCCTTGTCGT TCCACCTGAG GCAAAATAG
|
Protein sequence | MDLLRFLLIS PFALIKGLYR LSAYLLRLVG RLLRPVVGNL NWRAPQWMTK TANGLHCAFN RSEQWVAKHP KGISAAIVLL MAAASAAFYG YHWYLNRPQP IEPAPMVYQE TSIRVSAPRT VNYQAQKPEA QPLSLNFMHS AAPITAMGQV VDQGISLTPA IEGEWKWATE RTLVFTPKKA WPMGANYQIT IDAEKLLAPQ IKLNQTELNF TTPAFAYQLE KAEYYQDPQE AQKRSTIFHV QFNAPVDVAS FEKQILLGLV EGKSKSEKKL NFSVVYDEKK LNAWIHSQPL MPMDKGGSVH LSINKGVNAS VAATPTTQAQ NKWVSVPNLY SLAVNSINAT LVESDNNNGE RALIIAISDA VKDKEIKNAV KAWLLPQHNF QAKESAKTST DFYPWDMDDI DDNLLQQSTP LALTLNEAEQ EYQPIFSFKF DAPSYRTLLI EVNNSLTSVG GYKMPEKIYQ IVRVPDYPKT LRFMSQGSLL SMQGDKQISV AARNMTGMKL DIKRVIPSQL QHIVSFKSSE YSSAHFNRLS DEYFTEHFQY QTALNNDNPG EINYQGVDLS RYLANNPSAR RGVFLLTLSA WDPEKRDNQQ HSEEDYDEDQ EWVGDSRFVV ITDLGIITKQ SQDRSRDVFV QSIHSGLPTA DAKVSVVAKN GVVLLSQITD SKGHVHFPAL DAFKNERQPV MFLVEKEGDV SFLPTRATYD RNLDFSRFDI DGEETPSDPR TLSSYLFSDR GVYRPGDRFN IGLITRTANW ATALDGVPLR AEIRDPRDTL MSTLPITLDS SGFNELSYTT GENSPTGEWN VYLYLVGKNN ETSMLLGHTT VNVKEFEPDR LKVQLQLTPE RQQGWVKPQE LQANINVQNL FGTPAQERRV TSRLILRPMY PSFAPFPDYL FYENRHNSDG FETELEEQTT DLQGMATIPL DLKSYADATY QLQLLSEAFE AGGGRSVAAT ARVLVSPYDS LVGVKADGDL SYINRDAVRK LNIIAVDPSL NKIALPDLSL SLIEQKYISV LTKQDSGVYK YQSRLKEQLV SEQPLKISPT GTDFTLVTQQ PGDFILVVKD SQGQVLNRIS YTVAGNANLT RSLDRNTELK LKLNQAEYLQ GEEIEIAINA PYAGSGLITI EKDKVYSWQW FHSDTTSSVQ RIRIPPAMEG NGYINVQFVR DVNSDEIFMS PLSYGVMPFK ISTKARQAAI ELASPSVIKP GEVLPIKVTT DSPQRVVVFA VDEGILQVAR YRLKDPLDYF FRKRELSVQS AQILDLILPE FSKLMALTSA PGGDAGEGLD LHLNPFKRKQ DKPVAYWSGI TEVNGETTFN YPIPDYFNGK VRVMAISATP DRIGKVQTST TVRDNFILTP NVPAMVAPGD EFDVTVGVSN NLQGLKGKAV DITVRLTPPP QLEVVGEAQH SLSLAEKRET LVSFRLRARS ALGDAPLVFD ASYGSQSSRR TVSTSVRPAM PFRTQSVMGR MEGNKHTVTN LRQMFDNYAQ RQATASHSPL VLTQGLARYL ADYPYYSSEQ IVSRSIPLIM QSKHPEMDSA LNQNEVRDQL KNMLRILSSR QNSTGAIGLW HASPTPDPFV TPYVVQFLLE AKSAGYSLPN DILEGANNAL RLLAARPYDD LYSLRLRAFA VYLLTLQGEI TTNTLASVQS TLQQLYPDSW QTDLSAIYLA SSYRLLKMDD EANKLLQPTW KQLGKAYSKA WWTQNYFDPL VQDATRLYLI TRHFPEKVSS IPPQALENMV LALRDEHYTT YSSAMSILAL ESYTSQVAAQ QDTPETLQII EISKSKGIDP NVISTLNGLF VQGDFTGEAK AIQFNNYASA PAWYVVNQSG YDLQPPKDAI SNGLEISRSY TDEQGKPVTQ VTLGQKVNVH LKIRANAKQG QNNLAIVDLL PGGFEVVQQT APEPEFYDNQ DDQDEETGSG WQSPLMVSGS SWYPDYSDIR EDRVIIYGSA STDVKEFIYQ IKSTNTGRFV VPPAYGEAMY DRNVQALSVG KGHILVVPPE AK
|
| |