Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_2979 |
Symbol | |
ID | 5387181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 3352172 |
End bp | 3358168 |
Gene Length | 5997 bp |
Protein Length | 1998 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640865985 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001401941 |
Protein GI | 153950405 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTGT TGAGGTTTCT GCTGATCTCC CCCTTCGCCC TTATCAAGGG ACTCTATCGG TTAAGCGCTT ATTTTTTACG GTTAGTGGGC CGCCTATTGC GCCCAGTCGT CGGTAACCTG AATTGGCGCG CGCCACAATG GATGACGAAA ACTGCCAACG GGCTACACTG TGCTTTCAAC CGTTCAGAGC AATGGGTCGC TAAGCATCCC AAAGGGATCA GTGCGGCAAT AGTGCTGTTG ATGGCTGCCG CCAGCGCCGC ATTCTATGGC TATCACTGGT ATCTTAATCG GCCCCAACCG ATTGAACCGG CACCTATGGT TTATCAGGAA ACCAGTATCA GAGTCTCGGC ACCAAGAACG GTTAACTATC AGGCGCAAAA ACCAGAAGCC CAGCCGCTTA GCCTTAATTT TATGCATTCA GCCGCACCTA TCACCGCGAT GGGTCAGGTC ATCGATCAGG GCATCTCATT AACCCCCGCG ATGGAGGGTG AATGGAAATG GGCAACTGAA CGCACGTTGG TGTTTACCCC TAAAAAAGCC TGGCCGATGG GCGCTAACTA CCAAATTACG ATCGATGCAG AAAAACTTTT AGCACCACAG ATTAAGCTCA ATCAGACCGA ACTTAATTTC ACAACGCCAG CTTTTGCTTA TCAATTGGAA AAAGCGGAAT ATTATCAGGA TCCGCAAGAG GCCCAAAAGC GCAGCACCAT TTTCCATGTG CAATTTAATG CCCCGGTTGA TGTTGCCAGC TTTGAAAAAC AGATCCTCTT GGGATTGGTC GAAGGTAAAT CCAAGTCAGA GAAGAAACTT AATTTCTCCG TCGTTTATGA TGAGAAAAAG CTTAATGCCT GGATACATTC GCAACCCTTG ATGCCAATGG ATAAAGGCGG TTCGGTCCAT CTATCGATTA ATAAAGGGGT GAATGCCAGT GTTGCCGCCA CGCCTACGAC ACAGGCACAG AATAAATGGG TATCCGTCCC TAACCTATAT AGCCTGGCGG TTAATAGTAT TAATGCCACG TTGGTTGAGT CAGATAACAA TAATGGTGAG CGGGCCTTAA TTATTGCTAT CAGCGACGCG GTTAAAGATA AAGAGATCAA AAATGCGGTC AAAGCCTGGT TACTGCCGCA ACATAATTTT CAAGCGAAAG AGAGCGCCAA AACATCAACC GATTTCTATC CTTGGGATAT GGATGATATT GACGATAATC TGCTGCAACA ATCAACGCCG CTGGCGCTGA CCCTCAATGA GGCCGAGCAA GAGTATCAGC CAATATTCAG CTTTAAGTTT GATGCCCCTT CCTATCGCAC ACTGCTGATC GAGGTTAACA ATAGCCTGAC ATCGGTGGGC GGTTATAAAA TGCCGGAAAA AATCTACCAA ATAGTCAGGG TTCCCGATTA CCCTAAGACG CTGCGCTTTA TGTCACAAGG CTCGTTATTA TCGATGCAGG GTGATAAGCA GATCAGCGTC GCCGCCCGTA ATATGACTGG CATGAAGCTG GATATTAAGC GGGTTATTCC TAGCCAGTTA CAACATATTG TGTCATTTAA AAGCAGCGAA TATTCATCAG CTCACTTTAA CCGCCTGAGT GATGAATATT TTACTGAACA CTTCCAGTAC CAAACCGCGC TGAATAATGA CAACCCCGGC GAGATCAATT ATCAAGGGGT CGATCTGTCC CGTTATCTTG CAAATAATCC GAGTGCTCGG CGTGGGGTGT TCTTACTTAC CCTGTCAGCT TGGGATCCGG AGAAAAGGGA TAATCAGCAA CACAGCGAGG AAGACTACGA CGAAGACCAG GAATGGGTCG GCGATTCACG CTTTGTGGTG ATCACGGACT TAGGCATTAT CACCAAGCAA TCGCAGGATA GATCCCGTGA TGTGTTTGTG CAATCCATTC ACTCGGGTCT GCCCACCGCC GATGCCAAAG TCTCTGTGGT GGCAAAAAAT GGTGTGGTCT TACTGAGCCA AATCACCGAT AGCAAAGGGC ATGTTCATTT TCCTGCGCTG GACGCCTTTA AAAATGAACG CCAACCGGTC ATGTTCCTGG TGGAAAAAGA AGGGGATGTC TCCTTCCTGC CCACCCGAGC CACCTATGAC CGTAACCTTG ATTTCTCACG TTTTGATATT GATGGCGAAG AGACACCGTC CGACCCACGT ACTCTAAGCA GCTATCTGTT TTCTGACCGG GGAGTTTATC GCCCAGGCGA TCGCTTCAAT ATTGGTCTGA TCACCCGGAC CGCCAACTGG GCTACCGCAC TCGATGGCGT CCCCCTGCGG GCGGAGATCC GTGACCCACG AGATACCTTG ATGAGTACCC TGCCGATAAC CTTGGACAGC AGTGGTTTCA ATGAGCTCAG CTATACGACC GGTGAAAACT CACCTACCGG TGAATGGAAC GTCTATCTCT ATCTGGTTGG TAAGAATAAT GAAACGTCGA TGTTGCTGGG GCACACCACC GTAAATGTTA AAGAGTTCGA GCCTGATCGC TTAAAAGTGC AACTGCAACT GACGCCAGAG CGTCAACAAG GCTGGGTTAA ACCGCAGGAG CTGCAAGCCA ATATCAATGT ACAAAATTTA TTCGGTACAC CAGCACAGGA GCGCCGTGTC ACCTCTAGAC TGATCTTGCG GCCAATGTAC CCGAGTTTTG CCCCGTTCCC TGATTACCTG TTCTATGAGA ATCGTCATAA CAGCGATGGT TTTGAGACTG AACTGGAAGA ACAAACGACC GATCTACAAG GGATGGCGAC CATCCCATTG GATCTGAAAT CCTATGCTGA CGCCACCTAT CAACTGCAAT TGTTGTCGGA AGCCTTTGAA GCGGGTGGAG GCCGCTCCGT GGCCGCGACT GCGCGGGTTC TGGTTTCACC TTACGACTCT CTGGTTGGGG TGAAAGCCGA TGGCGATCTG AGTTATATCA ACCGTGATGC CGTGCGTAAG CTGAATATTA TTGCTGTGGA CCCGAGCCTG AATAAAATTG CGCTGCCAGA CTTGAGTCTG TCATTGATTG AGCAGAAGTA TATTTCAGTG CTAACCAAAC AGGATTCAGG CGTTTATAAA TATCAATCAC GGCTAAAGGA GCAGTTGGTC TCAGAGCAAC CGCTAAAAAT CAGCCCGACA GGGACGGATT TCACCCTAGT GACCCAGCAG CCTGGTGATT TTATTCTGGT GGTTAAGGAC AGTCAGGGGC AGGTTCTGAA CCGTATTAGT TATACGGTGG CGGGTAACGC AAACCTGACC CGCTCACTGG ATCGCAACAC CGAATTAAAG CTAAAACTGA ATCAGGCCGA ATATCTGCAA GGCGAAGAAA TTGAGATTGC GATTAATGCA CCTTATGCCG GTAGCGGTCT GATCACGATA GAAAAAGATA AAGTGTATAG CTGGCAGTGG TTCCACAGTG ATACCACCAG CTCTGTGCAG AGAATCCGCA TCCCACCGGC AATGGAAGGC AATGGCTATA TCAACGTACA ATTCGTGCGT GATGTGAATT CCGATGAGAT CTTTATGAGC CCACTGAGTT ACGGTGTGAT GCCATTTAAG ATCAGTACCA AAGCGCGTCA GGCGGCTATC GAGTTAGCGT CGCCGTCAGT CATTAAACCG GGTGAAGTGT TACCGATTAA AGTGACCACC GATTCACCAC AGCGCGTGGT GGTGTTTGCC GTCGATGAAG GTATTTTGCA GGTGGCACGC TATCGCCTGA AAGATCCACT GGACTACTTC TTCCGTAAAC GTGAACTGAG TGTACAGAGT GCACAAATTC TCGATTTGAT CCTGCCGGAA TTCAGCAAGC TGATGGCACT GACCTCCGCA CCCGGAGGCG ATGCCGGGGA AGGGCTGGAT CTGCACCTCA ATCCGTTTAA ACGCAAACAA GACAAGCCGG TGGCTTATTG GTCTGGTATC ACCGAAGTGA ATGGTGAAAC CACCTTCAAT TACCCGATTC CCGACTATTT CAATGGTAAA ATTCGCGTGA TGGCCATCTC TGCGACCCCT GATCGCATTG GTAAAGTCCA GACCTCGACC ACCGTGCGGG ATAACTTTAT TCTGACACCG AATGTCCCCG CGATGGTAGC ACCGGGAGAT GAATTTGATG TCACCGTGGG TGTGAGTAAC AACCTGCAAG GATTGAAGGG TAAAGCGGTT GATATCACCG TGCGTCTGAC ACCACCGCCA CAACTGGAAG TGGTGGGTGA AGCGCAACAC AGCCTGTCGC TGGCAGAAAA ACGTGAAACG CTTGTCAGCT TCCGCCTGCG CGCCCGTTCA GCATTGGGTG ATGCTCCACT GGTGTTTGAT GCCAGCTATG GCTCTCAATC CAGCCGCCGG ACGGTCAGTA CCTCGGTACG CCCGGCGATG CCATTCCGAA CGCAATCGGT GATGGGCCGG ATGGAGGGTA ACAAGCATAC TGTGACCAAT CTGCGCCAGA TGTTTGATAA TTATGCTCAA CGTCAGGCGA CCGCTTCCCA CTCACCGTTG GTCTTAACCC AAGGTCTGGC GCGGTACCTG GCTGATTACC CGTACTACAG TTCTGAGCAA ATTGTCAGCC GCTCGATTCC GTTGATTATG CAAAGCAAAC ATCCTGAAAT GGACAGTGCC CTCAATCAGA ATGAGGTCCG TGATCAACTG AAAAACATGC TACGTATCCT GAGCTCTCGG CAGAATAGCA CTGGTGCAAT CGGTTTGTGG CACGCCTCCC CTACCCCTGA TCCGTTTGTC ACACCTTATG TCGTGCAATT TCTGCTGGAA GCGAAATCTG CCGGTTACAG CTTGCCGAAT GACATCTTGG AGGGGGCCAA CAACGCACTG CGTCTGTTAG CGGCTCGACC TTATGATGAC CTTTACTCTC TGCGTTTGCG GGCCTTTGCT GTTTACCTGT TGACCTTGCA GGGGGAGATC ACCACCAATA CTCTGGCATC GGTGCAAAGT ACGTTACAGC AACTTTATCC TGACAGTTGG CAGACTGATC TGAGTGCCAT TTATCTGGCC TCATCATACC GTCTGCTCAA AATGGATGAC GAAGCCAATA AACTGCTGCA ACCCACCTGG AAACAACTGG GTAAAGCCTA CAGCAAGGCC TGGTGGACGC AGAATTATTT TGATCCACTG GTGCAAGATG CAACCCGGTT GTATCTGATC ACTCGCCATT TCCCAGAGAA AGTCTCTTCT ATTCCGCCAC AAGCACTGGA AAATATGGTG CTGGCACTGA GGGATGAGCA TTACACGACC TATTCATCCG CGATGAGCAT TCTGGCACTG GAAAGTTACA CCAGCCAGGT AGCCGCCCAG CAAGATACGC CAGAAACCCT GCAAATCATC GAGATCAGTA AAAGCAAAGG GATCGACCCT AACGTTATCT CAACGCTGAA CGGCCTGTTC GTTCAAGGTG ATTTTACCGG TGAGGCTAAA GCGATTCAGT TTAACAACTA TGCCTCGGCA CCCGCTTGGT ATGTGGTCAA TCAATCAGGC TATGACCTTC AGCCACCAAA AGACGCCATC TCTAATGGGC TGGAAATCAG CCGCAGCTAC ACCGATGAGC AGGGTAAGCC GGTGACCCAA GTCACCTTAG GGCAGAAAGT TAACGTGCAC CTAAAAATCC GGGCTAACGC TAAACAAGGT CAAAATAATC TGGCGATTGT CGATCTACTG CCGGGCGGTT TTGAAGTGGT ACAACAAACG GCACCTGAAC CAGAGTTTTA TGATAATCAG GATGATCAGG ATGATCAGGA TGATCAGGAT GAGGAAACTG GCAGCGGCTG GCAGTCGCCG CTAATGGTAT CTGGCTCCAG TTGGTACCCT GACTACAGTG ATATTCGTGA AGATCGCGTG ATCATTTATG GCAGTGCCAG TACCGACGTT AAAGAGTTTA TCTACCAAAT CAAATCAACC AATACGGGTC GCTTTGTGGT GCCACCGGCT TACGGCGAAG CCATGTATGA TCGTAATGTA CAGGCGCTGT CGGTCGGTAA AGGGCATATC CTTGTCGTTC CACCTGAGGC AAAATAG
|
Protein sequence | MDLLRFLLIS PFALIKGLYR LSAYFLRLVG RLLRPVVGNL NWRAPQWMTK TANGLHCAFN RSEQWVAKHP KGISAAIVLL MAAASAAFYG YHWYLNRPQP IEPAPMVYQE TSIRVSAPRT VNYQAQKPEA QPLSLNFMHS AAPITAMGQV IDQGISLTPA MEGEWKWATE RTLVFTPKKA WPMGANYQIT IDAEKLLAPQ IKLNQTELNF TTPAFAYQLE KAEYYQDPQE AQKRSTIFHV QFNAPVDVAS FEKQILLGLV EGKSKSEKKL NFSVVYDEKK LNAWIHSQPL MPMDKGGSVH LSINKGVNAS VAATPTTQAQ NKWVSVPNLY SLAVNSINAT LVESDNNNGE RALIIAISDA VKDKEIKNAV KAWLLPQHNF QAKESAKTST DFYPWDMDDI DDNLLQQSTP LALTLNEAEQ EYQPIFSFKF DAPSYRTLLI EVNNSLTSVG GYKMPEKIYQ IVRVPDYPKT LRFMSQGSLL SMQGDKQISV AARNMTGMKL DIKRVIPSQL QHIVSFKSSE YSSAHFNRLS DEYFTEHFQY QTALNNDNPG EINYQGVDLS RYLANNPSAR RGVFLLTLSA WDPEKRDNQQ HSEEDYDEDQ EWVGDSRFVV ITDLGIITKQ SQDRSRDVFV QSIHSGLPTA DAKVSVVAKN GVVLLSQITD SKGHVHFPAL DAFKNERQPV MFLVEKEGDV SFLPTRATYD RNLDFSRFDI DGEETPSDPR TLSSYLFSDR GVYRPGDRFN IGLITRTANW ATALDGVPLR AEIRDPRDTL MSTLPITLDS SGFNELSYTT GENSPTGEWN VYLYLVGKNN ETSMLLGHTT VNVKEFEPDR LKVQLQLTPE RQQGWVKPQE LQANINVQNL FGTPAQERRV TSRLILRPMY PSFAPFPDYL FYENRHNSDG FETELEEQTT DLQGMATIPL DLKSYADATY QLQLLSEAFE AGGGRSVAAT ARVLVSPYDS LVGVKADGDL SYINRDAVRK LNIIAVDPSL NKIALPDLSL SLIEQKYISV LTKQDSGVYK YQSRLKEQLV SEQPLKISPT GTDFTLVTQQ PGDFILVVKD SQGQVLNRIS YTVAGNANLT RSLDRNTELK LKLNQAEYLQ GEEIEIAINA PYAGSGLITI EKDKVYSWQW FHSDTTSSVQ RIRIPPAMEG NGYINVQFVR DVNSDEIFMS PLSYGVMPFK ISTKARQAAI ELASPSVIKP GEVLPIKVTT DSPQRVVVFA VDEGILQVAR YRLKDPLDYF FRKRELSVQS AQILDLILPE FSKLMALTSA PGGDAGEGLD LHLNPFKRKQ DKPVAYWSGI TEVNGETTFN YPIPDYFNGK IRVMAISATP DRIGKVQTST TVRDNFILTP NVPAMVAPGD EFDVTVGVSN NLQGLKGKAV DITVRLTPPP QLEVVGEAQH SLSLAEKRET LVSFRLRARS ALGDAPLVFD ASYGSQSSRR TVSTSVRPAM PFRTQSVMGR MEGNKHTVTN LRQMFDNYAQ RQATASHSPL VLTQGLARYL ADYPYYSSEQ IVSRSIPLIM QSKHPEMDSA LNQNEVRDQL KNMLRILSSR QNSTGAIGLW HASPTPDPFV TPYVVQFLLE AKSAGYSLPN DILEGANNAL RLLAARPYDD LYSLRLRAFA VYLLTLQGEI TTNTLASVQS TLQQLYPDSW QTDLSAIYLA SSYRLLKMDD EANKLLQPTW KQLGKAYSKA WWTQNYFDPL VQDATRLYLI TRHFPEKVSS IPPQALENMV LALRDEHYTT YSSAMSILAL ESYTSQVAAQ QDTPETLQII EISKSKGIDP NVISTLNGLF VQGDFTGEAK AIQFNNYASA PAWYVVNQSG YDLQPPKDAI SNGLEISRSY TDEQGKPVTQ VTLGQKVNVH LKIRANAKQG QNNLAIVDLL PGGFEVVQQT APEPEFYDNQ DDQDDQDDQD EETGSGWQSP LMVSGSSWYP DYSDIREDRV IIYGSASTDV KEFIYQIKST NTGRFVVPPA YGEAMYDRNV QALSVGKGHI LVVPPEAK
|
| |