Gene YpAngola_A4077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4077 
Symbol 
ID5802556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4341114 
End bp4344326 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content49% 
IMG OID641341858 
Productputative autotransporter protein 
Protein accessionYP_001608364 
Protein GI162419377 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.00997828 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAAACAA ACCGCTCTAC GCTTTCCCCG TGCTTTCGTA AAACAATGAT AGCCAGTTTG 
TTGGTGCCTC TTTGCAGCCC CCTGTATAGC TGGGCGGTAC AAACGGCCAG CATAACAGAT
GGCAGCACGA TGGTTATCTC TGGGGGTTAT GACACTGAGG CTAATAACCA CTCGGCGGTA
TTTGTGCAAG GTTCCGGTAG CACCATTAAT GGTGGCTCCG ATGTCGTTAT TGAAACCACG
GGGGTTGGTG CAATTGGTGC CTATGCCTCT GAAGGTGGGT CGTTGGGGCT GACGGGTTCG
ACTATCAATA CCGAGAATGG TGTGGCTTTT GGTGTCTTAA ATGACAAAGG TACGGTGAAT
TTACAAGGTG GTACGATTAC CACGAAAGGT CAGACGGCAT ATGGCGTGTA TTCCTCTGGT
TTGGGCAGTA ATACCGATAT TCACAGTTCG GCGATCACGA CCAGCTACTC GTTAACCCAC
GCTATTTATG GTGCCGGAGG GACGGGATTG ACATTGAACA ATACCACCCT CAATACCAGT
GGCAGTGGTA GCTATGGCAT TTATCTGGAT GGTTCCGGAG GGAGCTTAAC GGGCGCGAAT
AACACCATTA ATAGTACTCA TGCGACCAAG GGTGCGGGTA TCTATATTTC AGCGGGTGGC
TCAAATGCGA CTTTAGATAA CACCACACTG AATATCACTA AAGGTGCTGT TGGTGTGAAT
GTGGGGGAGG GTTCATCTAT TACGATGGAT GGCCTTATTG CCACCGGTAA TATCACCAAC
CTATTTAAAG TGAATGGGAA TGCCTCGGTC AGTAATGCCA ATATCGAATT AGCCGCGGGT
GGCGTATTAA TGGCACAGGG CAGCAGTGCG TCCAATCAAG CGGTCATCAT ATTAAATAAT
GTCAATGTTA TTTCTAACAG CGGCAGCACG ACACTGGTTG ATGTTAATAA GGACGCTGAC
GTCACCATTA ATGGGGGGGC TTACCACTCA AAAGGTAACA ATGCGAAGGG AATCTGGGTT
CGAGATAATA ACTCATCGCT GAATGTCGAT AACGCCGTGA TTATCACCGA GGGCGTGAAT
GCAACGGCGA TTGAAAATCG TGGCACCGCT ATCGTAAAAA ATACCACGGT GATAACTAAA
GGGAATAACT CTCACGGCCT CTACTCTGAG CAAAGCCTTG ATGCCACCAA TATGGCAATT
TCCACTGCGG GGATTGGCAG TATTGGGGTG GCGGCAGCTA AAGGCGGTAA CCTAAATCTG
AATGATGCAT TCATCGAGAC GACGGGTAAT TCAGGTATGG TGCTGGGTAC TTTTGCCGAC
TCATCCATCA GCGCTAAAAA TATTACAGGG ATCTCGACCG GCGCTGGTGC TTATGCCTTG
TGGGTAGATG ATGGTAGCTC AATTCTGCTG GAAGAGAGCC AAATTACCAC TCAAGGCCAG
GGCGCGGGAG GGATTTATGC CTCAAATACC GGGACCGGCT CTCACACCGC TTACACTCAG
GTTACGCTGA ACAACTCACA GATTCATAGT GAGCAGGGGC CGGGTATCTG GGCTAATGGT
GCTGACATTA ATGTTGATGT GAAGAATGGT TCGCAGTTAA CGGGAGGCAA TGGGTTATTG
GTCTATGCCT CAAGTAATGC AGGGGCCGCC AGCAATGTCA ATGTGAATGG CGATAACCAC
GCCGTTCTGT TGGGTGATAT TCACGCCGCA GAAAACAGCA ATATTAACCT GGCACTGAAT
AATAATTCCG TTTGGACGGG GGCGGCGACT AACGCCAAAC AGGTTGATAT CGACAGCAGC
AGTATCTGGA ATTTAACGGG TGATGCAGAT GTTGAGTCAA TGCATGTATT GGGCCAGATG
AACTTTATCT CAAATAGCAG TGACACTAAT TCACGGGCTC CCTACGATAA TTTCAGCACC
TTAACGATCA ACAGTAATGT CACCGGGAGT GGCAGTTTTA CCTTTAATGT GCAATTGGGT
GATAACGACT CACCAGTGGA TAGACTCTAT GTGATTGGTA ATGCTTCTGG TGACCATGGG
GTTCAGGTTA TTAACCAAGG CGGTTTGGGG GCATTGACCA CGGGCGACGG GATTAACCTG
ATTACCGTTG ATGGGGAGAC CCATTCCGGC TCATTTACTA TGAGTAACTC GGTGAGCGCA
GGGGCCTATG AATATTTCTT GTATAAGATA GATGACTACC GTTGGAACCT GCAATCTAAT
CTCATCAATC CCGGTCCTGG TCCTGAACCA GAAATTGAAC CAGAAGAGAT AGCTTACCGC
CCTGAAGTTC CTGGCTATAT TGCCGCACCT TGGTTAAATG CATTTTATGG TTTTACTACT
TTGGGTAGCT TGCACGAACG CCGTGGCTCG GCCGAGGGAG CAGCCGAAGG GTTTAATCAA
GACTCATGGG GCCGGATCCG TGGGCAGCAT AATAATTTTG ACGCGGGCCG TTTTAGCTAC
GATTCAAATA TCTGGTTTAT GCAATTGGGT CATGATGTCT ATCAGGCCAA AAATGCCGCA
GGCACTCAAG TGACTGGCGG TATGATGATC ACCCTAGGTA AGCAGAATAG CGATACACGG
GATCGGGCGC GGGCGATAAA TCCGGATTTG TCGATCGATA CCGGCAAGAT CAAAACCGAG
GCTTATGGGT TTGGGGGCTA TTACACCCTG ATGACCGAGG AAGGCGGTTA CCTTGATATC
GTTAGCCAGG CGACGCTATA CCGCAACAAC TATGAGAGCC AACATAATAC CAAACATAAT
GGCTACGGTG TTGTGATGTC TGCCGAAGTG GGTCAGCCGT ATCCACTGGC TGCTGGCTGG
GTAGTGGAGC CTCAGGGGCA GCTAAAATAT CAATACCTGC ACCTGAGTCC GAAGAATTTC
AACGATAGCA TTTCAGAGAT CGGGGGGACG GATTACTCTG TTGGTCAGGT ACGTGGCGGG
CTGCGTCTGT TCAGTGACGC GAGTGAGAAG CGGGACATTA AGCCTTATTT GACCACCGAT
GTGCTTCACC AGTTAGGCCG AAACCCACAG GTGACGGTAG CGACGGTGGA TATCCGTCCT
GACTTCACAA AAACCTTCTG GCAGGGGGGC GCAGGGGTGA CCGCCAAAGT GAATAGTCAG
GTTGATCTCT ATGCTGATGC GAAATACCAA AAATCCTTTG ATGGCAAATT AGATGGCTAC
TTAGGTAATT TGGGCGTGAA AGTCAGTTTC TGA
 
Protein sequence
MKTNRSTLSP CFRKTMIASL LVPLCSPLYS WAVQTASITD GSTMVISGGY DTEANNHSAV 
FVQGSGSTIN GGSDVVIETT GVGAIGAYAS EGGSLGLTGS TINTENGVAF GVLNDKGTVN
LQGGTITTKG QTAYGVYSSG LGSNTDIHSS AITTSYSLTH AIYGAGGTGL TLNNTTLNTS
GSGSYGIYLD GSGGSLTGAN NTINSTHATK GAGIYISAGG SNATLDNTTL NITKGAVGVN
VGEGSSITMD GLIATGNITN LFKVNGNASV SNANIELAAG GVLMAQGSSA SNQAVIILNN
VNVISNSGST TLVDVNKDAD VTINGGAYHS KGNNAKGIWV RDNNSSLNVD NAVIITEGVN
ATAIENRGTA IVKNTTVITK GNNSHGLYSE QSLDATNMAI STAGIGSIGV AAAKGGNLNL
NDAFIETTGN SGMVLGTFAD SSISAKNITG ISTGAGAYAL WVDDGSSILL EESQITTQGQ
GAGGIYASNT GTGSHTAYTQ VTLNNSQIHS EQGPGIWANG ADINVDVKNG SQLTGGNGLL
VYASSNAGAA SNVNVNGDNH AVLLGDIHAA ENSNINLALN NNSVWTGAAT NAKQVDIDSS
SIWNLTGDAD VESMHVLGQM NFISNSSDTN SRAPYDNFST LTINSNVTGS GSFTFNVQLG
DNDSPVDRLY VIGNASGDHG VQVINQGGLG ALTTGDGINL ITVDGETHSG SFTMSNSVSA
GAYEYFLYKI DDYRWNLQSN LINPGPGPEP EIEPEEIAYR PEVPGYIAAP WLNAFYGFTT
LGSLHERRGS AEGAAEGFNQ DSWGRIRGQH NNFDAGRFSY DSNIWFMQLG HDVYQAKNAA
GTQVTGGMMI TLGKQNSDTR DRARAINPDL SIDTGKIKTE AYGFGGYYTL MTEEGGYLDI
VSQATLYRNN YESQHNTKHN GYGVVMSAEV GQPYPLAAGW VVEPQGQLKY QYLHLSPKNF
NDSISEIGGT DYSVGQVRGG LRLFSDASEK RDIKPYLTTD VLHQLGRNPQ VTVATVDIRP
DFTKTFWQGG AGVTAKVNSQ VDLYADAKYQ KSFDGKLDGY LGNLGVKVSF