Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1880 |
Symbol | |
ID | 5800351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 1948622 |
End bp | 1954600 |
Gene Length | 5979 bp |
Protein Length | 1992 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641339811 |
Product | alpha-2-macroglobulin domain-containing protein |
Protein accession | YP_001606366 |
Protein GI | 162419079 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0009665 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATTTGT TGAGGTTTCT GCTGATCTCC CCCTTTGCCC TTATCAAGGG ACTCTATCGG TTAAGCGCTT ATCTTTTACG GTTAGTGGGC CGCCTATTGC GCCCAGTCGT CGGTAACCTG AATTGGCGCG CGCCACAATG GATGACGAAA ACCGCCAACG GGCTACACTG TGCCTTCAAC CGTTCAGAGC AATGGGTCGC TAAGCATCCC AAAGGGATCA GTGCGGCAAT AGTGCTGTTG ATGGCTGCCG CCAGCGCCGC CTTCTATGGC TATCACTGGT ATCTTAACCG GCCCCAACCG ATTGAACCGG CACCTATGGT TTATCAGGAA ACCAGTATCA GAGTCTCGGC GCCAAGAACG GTTAACTACC AGGCGCAAAA ACCAGAAGCC CAGCCGCTTA GTCTTAATTT TATGCATTCA GCCGCACCTA TCACCGCGAT GGGTCAGGTC GTCGATCAGG GCATCTCATT AACCCCCGCG ATAGAGGGTG AATGGAAATG GGCAACTGAA CGCACGTTGG TGTTTACTCC CAAAAAAGCC TGGCCGATGG GCGCTAACTA CCAAATTACG ATCGATACAG AAAAACTTTT AGCACCACAG ATTAAGCTCA ATCAGACCGA ACTTAATTTC ACAACGCCAG CTTTTGCTTA TCAATTGGAA AAAGCGGAAT ATTATCAGGA TCCGCAAGAG GCCCAAAAGC GCAGCACTAT TTTCCATGTG CAATTTAATG CCCCAGTTGA TGTTGCCAGC TTTGAAAAAC AGATCCTCTT GGGATTGGTC GAAGGTAAAT CCAAGTCAGA GAAGAAACTT AATTTCTCCG TCGTTTATGA TGAGAAAAAG CTTAATGCCT GGATACATTC GCAACCCTTG ATGCCAATGG ATAAAGGCGG TTCGGTCCAT CTATCGATTA ATAAAGGGGT GAATGCCAGT GTCGCCGCCA CGCCTACGAC ACAGGCACAG AATAAATGGG TATCCGTCCC TAACCTATAT AGCCTGGCGG TTAATAGTAT CAATGCCACG TTGGTCGAGT CAGATAACAA TAATGGTGAG CGGGCCTTAA TTATTGCTAT CAGCGACGCG GTTAAAGATA AAGAGATCAA AAATGCGGTC AAAGCCTGGT TACTGCCGCA ACATAATTTT CAAGCGAAAG AGAGCGCCAA AACATCAACC GATTTCTATC CTTGGGATAT GGATGATATT GACGATAATC TGCTGCAACA ATCAACGCCG CTGGCGCTGA CCCTCAATGA GGCCGAGCAA GAGTATCAGC CAATATTCAG CTTTAAGTTT GATGCCCCTT CCTATCGCAC ACTGCTGATC GAGGTTAACA ATAGCCTGAC ATCGGTGGGC GGTTATAAAA TGCCGGAAAA AATCTACCAA ATAGTCAGGG TTCCCGATTA CCCTAAGACG CTGCGCTTTA TGTCACAAGG CTCGTTATTA TCGATGCAGG GTGATAAGCA GATCAGCGTC GCCGCCCGTA ATATGACTGG CATGAAACTG GATATTAAGC GGGTTATTCC TAGCCAGTTA CAACATATTG TGTCATTTAA AAGCAGCGAA TATTCATCAG CTCACTTTAA CCGCCTGAGT GATGAATATT TTACTGAACA CTTCCAGTAC CAAACCGCGC TGAATAATGA CAACCCCGGC GAGATCAATT ATCAAGGGGT CGATCTGTCC CGTTATCTTG CAAATAATCC GAGTGCTCGG CGTGGGGTGT TCTTACTCAC CCTGTCAGCT TGGGATCCGG AGAAAAGGGA TAATCAGCAA CACAGCGAGG AAGACTACGA CGAAGACCAG GAATGGGTCG GCGATTCACG CTTTGTGGTG ATCACGGACT TAGGCATTAT CACCAAGCAA TCGCAGGATA GATCCCGTGA TGTGTTTGTG CAATCCATTC ACTCGGGTCT GCCCGCCGCC GATGCTAAAG TCTCTGTGGT GGCAAAAAAT GGTGTGGTCT TACTGAGCCA AATCACCGAT AGCAAAGGGC ATGTCCATTT TCCTGCGCTG GACGCCTTTA AAAATGAACG CCAACCGGTC ATGTTCCTGG TGGAAAAAGA AGGGGATGTC TCCTTCCTGC CCACCCGAGC CACCTATGAC CGTAACCTTG ATTTCTCACG TTTTGATATT GATGGCGAAG AGACCCCGTC CGACCCACGT ACTCTAAGCA GCTATCTGTT TTCTGACCGG GGAGTTTATC GCCCAGGCGA CCGCTTCAAT ATTGGTCTGA TCACCCGGAC CGCCAACTGG GCTACCGCAC TCGATGGCGT CCCCCTGCGG GCGGAGATCC GTGACCCACG AGATACCTTG ATGAGTACCC TGCCGATAAC CTTGGACAGC AGTGGTTTCA ATGAGCTCAG CTATACGACC GGTGAAAACT CACCTACCGG TGAATGGAAC GTCTATCTCT ATCTGGTTGG TAAGAATAAT GAAACGTCGA TGTTGCTGGG GCACACCACC GTAAATGTTA AAGAGTTCGA GCCTGATCGC TTAAAAGTGC AACTGCAACT GACGCCAGAG CGTCAACAAG GCTGGGTTAA ACCGCAGGAG CTGCAAGCCA ATATCAATGT ACAAAATCTA TTCGGTACAC CAGCACAGGA GCGCCGTGTC ACCTCTAGAC TGATCTTGCG GCCAATGTAC CCGAGTTTTG CCCCGTTCCC TGATTACCTG TTCTATGAGA ATCGCCATAA CAGCGATGGT TTTGAGACCG AACTGGAAGA GCAAACGACC GATCTACAGG GGATGGCGAC CATTCCATTG GATCTGAAAT CCTATGCTGA CGCCACCTAT CAACTGCAAT TGCTGTCGGA AGCCTTTGAA GCGGGTGGAG GCCGCTCTGT GGCCGCGACT GCGCGGGTTC TGGTCTCACC TTACGACTCT CTGGTTGGGG TGAAAGCCGA TGGCGATCTG AGTTATATCA ACCGTGATGC CGTGCGTAAG CTGAATATTA TTGCCGTTGA CCCGAGCCTG AATAAAATTG CGCTGCCAGA CTTGAGTCTG TCATTGATTG AGCAGAAGTA TATTTCAGTG CTAACCAAAC AGGATTCAGG CGTTTATAAA TATCAATCAC GGCTAAAGGA GCAGTTGGTC TCAGAGCAAC CGCTACAAAT CAGCCCGACA GGGACGGATT TCACCCTGGT GACCCAGCAG CCTGGTGATT TTATTCTGGT GGTTAAGGAC AGTCAGGGGC AGGTTCTGAA CCGTATTAGT TATACGGTGG CGGGTAACGC AAACCTGACC CGCTCACTGG ATCGCAACAC CGAATTAAAG CTAAAACTGA ATCAGGCCGA ATATCTGCAA GGCGAAGAAA TTGAGATTGC GATTAATGCA CCTTATGCCG GTAGCGGTCT GATCACGATA GAAAAAGATA AAGTGTATAG CTGGCAGTGG TTCCACAGTG ATACCACCAG CTCTGTGCAG AGAATCCGCA TCCCACCGGC AATGGAAGGC AATGGCTATA TCAACGTACA ATTCGTGCGT GATGTGAATT CCGATGAGAT CTTTATGAGC CCACTGAGTT ACGGTGTGAT GCCATTTAAG ATCAGTACCA AAGCGCGTCA GGCGGCTATC GAGTTAGCGT CGCCGTCAGT CATTAAACCG GGTGAAGTGT TACCGATTAA AGTGACCACC GATTCACCAC AGCGCGTGGT GGTGTTTGCC GTCGATGAAG GTATTTTGCA GGTGGCACGC TATCGCCTGA AAGATCCACT GGATTACTTC TTCCGTAAAC GTGAACTGAG TGTACAGAGT GCACAAATTC TCGATTTGAT CCTGCCGGAA TTCAGCAAGC TGATGGCACT GACCTCCGCA CCTGGAGGCG ACGCCGGGGA AGGGCTGGAT CTGCACCTCA ATCCGTTTAA ACGCAAACAA GACAAGCCGG TGGCTTATTG GTCTGGTATC ACCGAAGTGA ATGGTGAAAC CACCTTCAAT TACCCGATTC CCGACTATTT CAATGGTAAA ATTCGCGTGA TGGCCATCTC TGCGACCCCT GATCGCATTG GTAAAGTCCA GACCTCGACC ACCGTGCGGG ATAACTTTAT TCTGACGCCG AATGTCCCCG CGATGGTAGC ACCGGGAGAT GAATTTGATG TCACCGTGGG TGTGAGTAAC AACCTGCAAG GATTGAAGGG TAAAGCGGTT GATATCACCG TGCGTCTGAC ACCACCGCCA CAACTGGAAG TGGTGGGTGA AGCGCAACAC AGCCTGTCGC TGGCAGAAAA ACGTGAAACG CTTGTCAGCT TCCGCCTACG CGCCCGTTCA GCATTGGGTG ATGCTCCACT GGTGTTTGAT GCCAGCTATG GCTCTCAATC CAGCCGCCGG ACGGTCAGTA CCTCGGTACG CCCGGCGATG CCATTCCGAA CGCAATCGGT GATGGGCCGG ATGGAGGGTA ACAAGCATAC TGTGACCAAT CTGCGCCAGA TGTTTGATAA TTATGCTCAA CGTCAGGCGA CCGCTTCCCA CTCACCGTTG GTCTTAACCC AAGGTCTGGC GCGGTACCTG GCTGATTACC CGTACTACAG TTCTGAGCAA ATTGTCAGCC GCTCGATTCC GTTGATTATG CAAAGCAAAC ATCCTGAAAT GGACAGTGCC CTCAATCAGA ATGAGGTCCG TGATCAACTG AAAAACATGC TACGTATCCT GAGCTCTCGG CAGAATAGCA CTGGTGCAAT CGGTTTGTGG CACGCCTCCC CTACCCCTGA TCCGTTTGTC ACACCTTATG TCGTGCAATT TCTGCTGGAA GCGAAATCTG CCGGTTACAG CTTGCCGAAT GACATCTTGG AGGGGGCCAA CAACGCACTG CGTCTGTTAG CGGTTCGACC TTATGATGAC CTTTACTCTC TGCGTTTGCG GGCCTTTGCT GTTTACCTGT TGACCTTGCA GGGGGAGATC ACCACCAATA CTCTGGCATC GGTGCAAAGT ACGTTACAGC AACTTTATCC TGACAGTTGG CAGACTGATC TGAGTGCCAT TTATCTGGCC TCATCATACC GTCTGCTCAA AATGGATGAC GAAGCCAATA AACTGCTGCA ACCCACCTGG AAACAACTGG GTAAAGCCTA CAGCAAGGCC TGGTGGACGC AGAATTATTT TGATCCACTG GTGCAAGATG CAACCCGGTT GTATCTGATC ACTCGCCATT TCCCAGAGAA AGTCTCTTCT ATTCCGCCAC AAGCACTGGA AAATATGGTG CTGGCACTGA GGGATGAGCA TTACACGACC TATTCATCCG CGATGAGCAT TCTGGCACTG GAAAGTTACA CCAGCCAGGT AGCCGCCCAG CAAGATACGC CAGAAACCCT GCAAATCATC GAGATCAGTA AAAGCAAAGG GATCGACCCT AACGTTATCT CAACGCTGAA CGGCCTGTTC GTTCAAGGTG ATTTTACCGG TGAGGCTAAA GCGATTCAGT TTAACAACTA TGCCTCGGCA CCCGCTTGGT ATGTGGTCAA TCAATCAGGC TATGACCTTC AGCCACCAAA AGACGCCATC TCTAATGGGC TGGAAATCAG CCGCAGCTAC ACCGATGAGC AGGGTAAGCC GGTGACCCAA GTCACCTTAG GGCAGAAAGT TAACGTGCAC CTAAAAATCC GGGCTAACGC TAAACAAGGT CAAAATAATC TGGCGATTGT CGATCTACTG CCGGGCGGTT TTGAAGTGGT ACAACAAACG GCACCTGAAC CAGAGTTTTA TGATAATCAG GATGATCAGG ATGAGGAAAC TGGCAGCGGC TGGCAGTCGC CGCTAATGGT ATCTGGCTCC AGTTGGTACC CTGACTACAG TGATATTCGT GAAGATCGCG TGATCATTTA TGGCAGTGCC AGTACCGACG TTAAAGAGTT TATCTACCAA ATCAAATCAA CCAATACGGG TCGCTTTGTG GTGCCACCGG CTTACGGCGA AGCCATGTAT GATCGTAATG TACAGGCGCT GTCGGTCGGT AAAGGGCATA TCCTTGTCGT TCCACCTGAG GCAAAATAG
|
Protein sequence | MDLLRFLLIS PFALIKGLYR LSAYLLRLVG RLLRPVVGNL NWRAPQWMTK TANGLHCAFN RSEQWVAKHP KGISAAIVLL MAAASAAFYG YHWYLNRPQP IEPAPMVYQE TSIRVSAPRT VNYQAQKPEA QPLSLNFMHS AAPITAMGQV VDQGISLTPA IEGEWKWATE RTLVFTPKKA WPMGANYQIT IDTEKLLAPQ IKLNQTELNF TTPAFAYQLE KAEYYQDPQE AQKRSTIFHV QFNAPVDVAS FEKQILLGLV EGKSKSEKKL NFSVVYDEKK LNAWIHSQPL MPMDKGGSVH LSINKGVNAS VAATPTTQAQ NKWVSVPNLY SLAVNSINAT LVESDNNNGE RALIIAISDA VKDKEIKNAV KAWLLPQHNF QAKESAKTST DFYPWDMDDI DDNLLQQSTP LALTLNEAEQ EYQPIFSFKF DAPSYRTLLI EVNNSLTSVG GYKMPEKIYQ IVRVPDYPKT LRFMSQGSLL SMQGDKQISV AARNMTGMKL DIKRVIPSQL QHIVSFKSSE YSSAHFNRLS DEYFTEHFQY QTALNNDNPG EINYQGVDLS RYLANNPSAR RGVFLLTLSA WDPEKRDNQQ HSEEDYDEDQ EWVGDSRFVV ITDLGIITKQ SQDRSRDVFV QSIHSGLPAA DAKVSVVAKN GVVLLSQITD SKGHVHFPAL DAFKNERQPV MFLVEKEGDV SFLPTRATYD RNLDFSRFDI DGEETPSDPR TLSSYLFSDR GVYRPGDRFN IGLITRTANW ATALDGVPLR AEIRDPRDTL MSTLPITLDS SGFNELSYTT GENSPTGEWN VYLYLVGKNN ETSMLLGHTT VNVKEFEPDR LKVQLQLTPE RQQGWVKPQE LQANINVQNL FGTPAQERRV TSRLILRPMY PSFAPFPDYL FYENRHNSDG FETELEEQTT DLQGMATIPL DLKSYADATY QLQLLSEAFE AGGGRSVAAT ARVLVSPYDS LVGVKADGDL SYINRDAVRK LNIIAVDPSL NKIALPDLSL SLIEQKYISV LTKQDSGVYK YQSRLKEQLV SEQPLQISPT GTDFTLVTQQ PGDFILVVKD SQGQVLNRIS YTVAGNANLT RSLDRNTELK LKLNQAEYLQ GEEIEIAINA PYAGSGLITI EKDKVYSWQW FHSDTTSSVQ RIRIPPAMEG NGYINVQFVR DVNSDEIFMS PLSYGVMPFK ISTKARQAAI ELASPSVIKP GEVLPIKVTT DSPQRVVVFA VDEGILQVAR YRLKDPLDYF FRKRELSVQS AQILDLILPE FSKLMALTSA PGGDAGEGLD LHLNPFKRKQ DKPVAYWSGI TEVNGETTFN YPIPDYFNGK IRVMAISATP DRIGKVQTST TVRDNFILTP NVPAMVAPGD EFDVTVGVSN NLQGLKGKAV DITVRLTPPP QLEVVGEAQH SLSLAEKRET LVSFRLRARS ALGDAPLVFD ASYGSQSSRR TVSTSVRPAM PFRTQSVMGR MEGNKHTVTN LRQMFDNYAQ RQATASHSPL VLTQGLARYL ADYPYYSSEQ IVSRSIPLIM QSKHPEMDSA LNQNEVRDQL KNMLRILSSR QNSTGAIGLW HASPTPDPFV TPYVVQFLLE AKSAGYSLPN DILEGANNAL RLLAVRPYDD LYSLRLRAFA VYLLTLQGEI TTNTLASVQS TLQQLYPDSW QTDLSAIYLA SSYRLLKMDD EANKLLQPTW KQLGKAYSKA WWTQNYFDPL VQDATRLYLI TRHFPEKVSS IPPQALENMV LALRDEHYTT YSSAMSILAL ESYTSQVAAQ QDTPETLQII EISKSKGIDP NVISTLNGLF VQGDFTGEAK AIQFNNYASA PAWYVVNQSG YDLQPPKDAI SNGLEISRSY TDEQGKPVTQ VTLGQKVNVH LKIRANAKQG QNNLAIVDLL PGGFEVVQQT APEPEFYDNQ DDQDEETGSG WQSPLMVSGS SWYPDYSDIR EDRVIIYGSA STDVKEFIYQ IKSTNTGRFV VPPAYGEAMY DRNVQALSVG KGHILVVPPE AK
|
| |