Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A4077 |
Symbol | |
ID | 5802556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 4341114 |
End bp | 4344326 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641341858 |
Product | putative autotransporter protein |
Protein accession | YP_001608364 |
Protein GI | 162419377 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.00997828 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAAACAA ACCGCTCTAC GCTTTCCCCG TGCTTTCGTA AAACAATGAT AGCCAGTTTG TTGGTGCCTC TTTGCAGCCC CCTGTATAGC TGGGCGGTAC AAACGGCCAG CATAACAGAT GGCAGCACGA TGGTTATCTC TGGGGGTTAT GACACTGAGG CTAATAACCA CTCGGCGGTA TTTGTGCAAG GTTCCGGTAG CACCATTAAT GGTGGCTCCG ATGTCGTTAT TGAAACCACG GGGGTTGGTG CAATTGGTGC CTATGCCTCT GAAGGTGGGT CGTTGGGGCT GACGGGTTCG ACTATCAATA CCGAGAATGG TGTGGCTTTT GGTGTCTTAA ATGACAAAGG TACGGTGAAT TTACAAGGTG GTACGATTAC CACGAAAGGT CAGACGGCAT ATGGCGTGTA TTCCTCTGGT TTGGGCAGTA ATACCGATAT TCACAGTTCG GCGATCACGA CCAGCTACTC GTTAACCCAC GCTATTTATG GTGCCGGAGG GACGGGATTG ACATTGAACA ATACCACCCT CAATACCAGT GGCAGTGGTA GCTATGGCAT TTATCTGGAT GGTTCCGGAG GGAGCTTAAC GGGCGCGAAT AACACCATTA ATAGTACTCA TGCGACCAAG GGTGCGGGTA TCTATATTTC AGCGGGTGGC TCAAATGCGA CTTTAGATAA CACCACACTG AATATCACTA AAGGTGCTGT TGGTGTGAAT GTGGGGGAGG GTTCATCTAT TACGATGGAT GGCCTTATTG CCACCGGTAA TATCACCAAC CTATTTAAAG TGAATGGGAA TGCCTCGGTC AGTAATGCCA ATATCGAATT AGCCGCGGGT GGCGTATTAA TGGCACAGGG CAGCAGTGCG TCCAATCAAG CGGTCATCAT ATTAAATAAT GTCAATGTTA TTTCTAACAG CGGCAGCACG ACACTGGTTG ATGTTAATAA GGACGCTGAC GTCACCATTA ATGGGGGGGC TTACCACTCA AAAGGTAACA ATGCGAAGGG AATCTGGGTT CGAGATAATA ACTCATCGCT GAATGTCGAT AACGCCGTGA TTATCACCGA GGGCGTGAAT GCAACGGCGA TTGAAAATCG TGGCACCGCT ATCGTAAAAA ATACCACGGT GATAACTAAA GGGAATAACT CTCACGGCCT CTACTCTGAG CAAAGCCTTG ATGCCACCAA TATGGCAATT TCCACTGCGG GGATTGGCAG TATTGGGGTG GCGGCAGCTA AAGGCGGTAA CCTAAATCTG AATGATGCAT TCATCGAGAC GACGGGTAAT TCAGGTATGG TGCTGGGTAC TTTTGCCGAC TCATCCATCA GCGCTAAAAA TATTACAGGG ATCTCGACCG GCGCTGGTGC TTATGCCTTG TGGGTAGATG ATGGTAGCTC AATTCTGCTG GAAGAGAGCC AAATTACCAC TCAAGGCCAG GGCGCGGGAG GGATTTATGC CTCAAATACC GGGACCGGCT CTCACACCGC TTACACTCAG GTTACGCTGA ACAACTCACA GATTCATAGT GAGCAGGGGC CGGGTATCTG GGCTAATGGT GCTGACATTA ATGTTGATGT GAAGAATGGT TCGCAGTTAA CGGGAGGCAA TGGGTTATTG GTCTATGCCT CAAGTAATGC AGGGGCCGCC AGCAATGTCA ATGTGAATGG CGATAACCAC GCCGTTCTGT TGGGTGATAT TCACGCCGCA GAAAACAGCA ATATTAACCT GGCACTGAAT AATAATTCCG TTTGGACGGG GGCGGCGACT AACGCCAAAC AGGTTGATAT CGACAGCAGC AGTATCTGGA ATTTAACGGG TGATGCAGAT GTTGAGTCAA TGCATGTATT GGGCCAGATG AACTTTATCT CAAATAGCAG TGACACTAAT TCACGGGCTC CCTACGATAA TTTCAGCACC TTAACGATCA ACAGTAATGT CACCGGGAGT GGCAGTTTTA CCTTTAATGT GCAATTGGGT GATAACGACT CACCAGTGGA TAGACTCTAT GTGATTGGTA ATGCTTCTGG TGACCATGGG GTTCAGGTTA TTAACCAAGG CGGTTTGGGG GCATTGACCA CGGGCGACGG GATTAACCTG ATTACCGTTG ATGGGGAGAC CCATTCCGGC TCATTTACTA TGAGTAACTC GGTGAGCGCA GGGGCCTATG AATATTTCTT GTATAAGATA GATGACTACC GTTGGAACCT GCAATCTAAT CTCATCAATC CCGGTCCTGG TCCTGAACCA GAAATTGAAC CAGAAGAGAT AGCTTACCGC CCTGAAGTTC CTGGCTATAT TGCCGCACCT TGGTTAAATG CATTTTATGG TTTTACTACT TTGGGTAGCT TGCACGAACG CCGTGGCTCG GCCGAGGGAG CAGCCGAAGG GTTTAATCAA GACTCATGGG GCCGGATCCG TGGGCAGCAT AATAATTTTG ACGCGGGCCG TTTTAGCTAC GATTCAAATA TCTGGTTTAT GCAATTGGGT CATGATGTCT ATCAGGCCAA AAATGCCGCA GGCACTCAAG TGACTGGCGG TATGATGATC ACCCTAGGTA AGCAGAATAG CGATACACGG GATCGGGCGC GGGCGATAAA TCCGGATTTG TCGATCGATA CCGGCAAGAT CAAAACCGAG GCTTATGGGT TTGGGGGCTA TTACACCCTG ATGACCGAGG AAGGCGGTTA CCTTGATATC GTTAGCCAGG CGACGCTATA CCGCAACAAC TATGAGAGCC AACATAATAC CAAACATAAT GGCTACGGTG TTGTGATGTC TGCCGAAGTG GGTCAGCCGT ATCCACTGGC TGCTGGCTGG GTAGTGGAGC CTCAGGGGCA GCTAAAATAT CAATACCTGC ACCTGAGTCC GAAGAATTTC AACGATAGCA TTTCAGAGAT CGGGGGGACG GATTACTCTG TTGGTCAGGT ACGTGGCGGG CTGCGTCTGT TCAGTGACGC GAGTGAGAAG CGGGACATTA AGCCTTATTT GACCACCGAT GTGCTTCACC AGTTAGGCCG AAACCCACAG GTGACGGTAG CGACGGTGGA TATCCGTCCT GACTTCACAA AAACCTTCTG GCAGGGGGGC GCAGGGGTGA CCGCCAAAGT GAATAGTCAG GTTGATCTCT ATGCTGATGC GAAATACCAA AAATCCTTTG ATGGCAAATT AGATGGCTAC TTAGGTAATT TGGGCGTGAA AGTCAGTTTC TGA
|
Protein sequence | MKTNRSTLSP CFRKTMIASL LVPLCSPLYS WAVQTASITD GSTMVISGGY DTEANNHSAV FVQGSGSTIN GGSDVVIETT GVGAIGAYAS EGGSLGLTGS TINTENGVAF GVLNDKGTVN LQGGTITTKG QTAYGVYSSG LGSNTDIHSS AITTSYSLTH AIYGAGGTGL TLNNTTLNTS GSGSYGIYLD GSGGSLTGAN NTINSTHATK GAGIYISAGG SNATLDNTTL NITKGAVGVN VGEGSSITMD GLIATGNITN LFKVNGNASV SNANIELAAG GVLMAQGSSA SNQAVIILNN VNVISNSGST TLVDVNKDAD VTINGGAYHS KGNNAKGIWV RDNNSSLNVD NAVIITEGVN ATAIENRGTA IVKNTTVITK GNNSHGLYSE QSLDATNMAI STAGIGSIGV AAAKGGNLNL NDAFIETTGN SGMVLGTFAD SSISAKNITG ISTGAGAYAL WVDDGSSILL EESQITTQGQ GAGGIYASNT GTGSHTAYTQ VTLNNSQIHS EQGPGIWANG ADINVDVKNG SQLTGGNGLL VYASSNAGAA SNVNVNGDNH AVLLGDIHAA ENSNINLALN NNSVWTGAAT NAKQVDIDSS SIWNLTGDAD VESMHVLGQM NFISNSSDTN SRAPYDNFST LTINSNVTGS GSFTFNVQLG DNDSPVDRLY VIGNASGDHG VQVINQGGLG ALTTGDGINL ITVDGETHSG SFTMSNSVSA GAYEYFLYKI DDYRWNLQSN LINPGPGPEP EIEPEEIAYR PEVPGYIAAP WLNAFYGFTT LGSLHERRGS AEGAAEGFNQ DSWGRIRGQH NNFDAGRFSY DSNIWFMQLG HDVYQAKNAA GTQVTGGMMI TLGKQNSDTR DRARAINPDL SIDTGKIKTE AYGFGGYYTL MTEEGGYLDI VSQATLYRNN YESQHNTKHN GYGVVMSAEV GQPYPLAAGW VVEPQGQLKY QYLHLSPKNF NDSISEIGGT DYSVGQVRGG LRLFSDASEK RDIKPYLTTD VLHQLGRNPQ VTVATVDIRP DFTKTFWQGG AGVTAKVNSQ VDLYADAKYQ KSFDGKLDGY LGNLGVKVSF
|
| |