Gene YpAngola_A0425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0425 
Symbol 
ID5798888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp442064 
End bp446389 
Gene Length4326 bp 
Protein Length1441 aa 
Translation table11 
GC content49% 
IMG OID641338431 
Productpertactin family protein 
Protein accessionYP_001605030 
Protein GI162420670 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0113574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA GTAATACTCT TAATACGCGT CTGCTACCAC TGTCTATTTT AATTTCATCG 
TTGGTTTCTG GCGGGGCTAT GGCTGTGTCA CAAATAGCAA CCACTGATAC ACCCGCAGTA
ACACCAATAA AATCAACACT AACAGGTCCA TTTGAGCGGA ATTCGGCTGG TACAAGTTTT
GGATCAAATG TAGATGTAAT TGATAATACA TCTACGGCAA CCCGCGTGAT TGCTGAAACT
ACGCCGGAAG CCGAGAGCAC AATTGGTGAA GCGACTGGGC AAGAAGGCGG TAACGCAACA
GCCGTTATCC CCCCTACTAC AACACCATCA GAGCAGGAAA TCACAGAACC TGAACAACCG
GGCCTGCTTG ATAAGATCAA AGATCTGCTG GGGTTGGGTG AAATTACTCA AGAACAAGCC
GATGCATTAG AAAAAAACGT TAAGACTAAA GTTGAGAAAG TGGACGCACA GACTGCGGCG
AAGTTGGCAC TTGAATCAGC CCAAGCAGAG GCCCAAAAAG CAGCAGAAGA TGCTTTATAT
CTAAAAACCG AAAATGTTTC ATATCAGGCA TTCGCTCAAA CTGAAGAAAA GATTAAAAAA
GAAGCTGATG AAGCAAAAAA AAAGCAAGAT AAGACTAAAG AAGATGCCAT TAAGGCCGTC
AAGGTTAATA ACACTCCATT AGTTCCTGGT GATAAGGATA TTGCTGAGAA AGTCACCAAA
GCCGTGACCG ATACAACAAA AGTACAGGGA GAAAAAGCAG TTACTCTGGC TACAAAAATA
ACTGATGCCA AAGTCGCCCA AGAAAAGAAA GACGCGAATA CAGAGGCGCT TGCAGAAATC
GACGGGCGGC TCATTTCTGT CTCAAACGCT TTAATACAAG CGACAGGTAC CGATAAGGGT
CCGTTGGATC AGAAACTTAA AGAAGCTCAA CAAGCTAAAA CCGAACAAGA CGGAAAAGAA
CTGGCGTCCG GCGGGTATAA AGAACTCTTC GAGGAGGATA AGAAAACTAG CGGTTACTTT
GGCATTGCGG AAAATGACAA CGGCTCGGGA CAGCAAGAAA AATTAGCCGA AGCAAAAAAG
AATCGAGATG CCTATAACAA AGCAGCTAAA AAAGAACTTG ACGCTATCGC CAAGGCCCAA
AAAGCAGTTG AGGCTATTGA TGCACAGATA GTAAAATTAA AAAAAGATAA AGGTGATATC
GAACAAGAAC AAAGTACCGA AAAGGGCAAA ACAGGTGGTT TAGATATTGC GCTCAGCGGT
GCTAACGACG CTAAAGACGC AGCACAGGGT GAATTCGACA CGGCAAAAAA CGCAGCTGAA
TTAGCTGAAT TAGCTGAAAC AGCAGCAAAG GCTATCGAAG CAGCAAAAAT CACCGATAAG
GCAGTTGAGG ATGCAACAGC AGCTTATAAA GAAGCCGCAG ACAAAGCGGA ACAAACTAAA
ACAGCCCTTG AAGCGGCTGA AAAAGCTAAA GAAGACGCTG ATAAACTCGT AGTCACTAAC
ACTGGCCTAT TGAATGACGC TGACCAAGCA CTTGAGCAGT TAGTGACCGC CCAAAATAAC
GCCCAACCTA CACTTGATCT GCCAGCCATT GATGTGACCA TTGCGCCTGC CAAGACACAA
GATGTGATTG AGGGCACCAG CGCCATTGCC ACCCAAGTGG CCGGTGGCAC ACAAAATGTT
GCCAAGGGCG GTAAAGCGAT TGATAGCGTT ATTACCAAAG ACGGTATTGT AAACCTTGCT
GCTGGTGCCA ATGCCAAAGG GACCGAGGTT ACTAAAGGTA CCCTGAACAA CAACGGCGGG
GTTGATACCG ATACTGTTGT CAGTACTGAA GGTAAATTGG TTCTGACGGG TGGTAGCGAA
ACGGCCATCG CAACCTCAAC CGGTGCCAAG GTTGCTGAAG GTGGTGTAGT GACCGCAGGT
GACCATTCCG TTATCGAAAA AATGATCAGT AGCGGTAACG TGACCGCCAG CGGCAATAAT
ACCATCGTGC GTGATACGAC CATTAATGAC GGTAAATTAA GCCTGGCAGG CACCGCAACC
GCCAATAACA CCACGTTCAA CGGCGGTATT TTCAGCGTTG AAGGTGATAC CGCTGCCACC
AAGACTAACA TGACTGGCGG TAAATTTGCT GTTACAGGCA ATGCCACAAT TAAAGACACC
GTGCTCAGCG CCAGTGACTT CTCGCTGGCT GACAAAGTCA CCGCAAACAA CACCACCCTG
ACTGGCGGTA CCTTTACCGT TGCAGGTGAT ACCGCTGCCA CCAAGACTAA GATGACTGGT
GGTGAATTTG CTGTTACAGG CAATGCCAAG ATTGAAGACA CCGTACTCAA CGCAAGTGAC
TTCTCGCTGG CTGACAAAGC CACCGCGAAC AACACCACCC TGACTGACGG TACTTTCACC
GTTGCAGGTG ATGCCGCGGT CACCGCGACG AACATGAGTG GCGGTAAATT TGCGGTTAAA
GGCAAAGCCA AGATCAAAGA CACCCAACTC AGTGCAGGTA ATTTCACTCT GGCTGAAAAT
GCCACAGCGA ATGACACCAC ACTGAATGGC GGTAAATTTG ACGTTTCGAA CGAGGCTACA
GCGACTAACA CCACCATTAA TAACGGCCTG TTTACGCTGA AAGATGGCGC TCACGCGGAC
AGCACCACAG TCAATAGCGG CACCTTCGTC ATGGCCGATC AATCTACGGC CAACGGCATC
CAACTGGTAG ACAGCGCCTT CACACTCGCA AGCGGTGCTA AAGCCTCCGG TATCACCAAA
TTAACTGGCG GTCAGGCACA GGTAGCCGGT TCACTGGAAA GCTTGAGCCT TACCGGTGGC
CGCGCAGACT TTGCCAACAG CGCCAAAGCC TCTGGCCTGC TTGATATCAG CGCTGATAGC
CAGATCATAA TGAACCGCGG TGCAGATACC GCACAAGCGA ACCTGAACCT GGCTGGCCGC
CTTGAATTGC TCGCCAGTGA TGTTGCTCAA GCAGTGGCTC AGCCAGTTGC CCGTGCGGCC
ATGGAGTTAT CAAATGCGCG TGCGGTAATG CCAGCCCCTG CAATGCCAGT CCCTGCCGCC
GCACCGGTTG CACACTTCGC CCTCAACGAT GTGGTTATGA CCGGGGGCAC TGTCGATATG
AGCAACGCGA AAAATGCTCA ACTGACCATG GCTTCACTGA ATGGTACAGG GAACTTTAAC
CTCGGTTCTG TCATGCAAAG CGATTCGGTC GCGCCATTAA ATGTATCCGG TGACGCGAAC
GGTGACTTCA TCATTGCAAT GAATAGCAGC GGTCAAGCAC CAACTAACCT GAATGTGGTA
AATACCAACG GTGGTGATGC ACGCTTTGCC TTAGCCAATG GTCCGGTTGC TTTAGGTAAC
TACATGACTA ACCTGGCTAA AGATGCCAAC GGTAACTTTG TCCTGACCGC AGATAAATCG
GCTATGACAC CAGGCACTGC CGGTATTCTG GCCGTGGCTA ACACCACACC GGTTATCTTT
AACGCTGAGT TAAGTTCTAT TCAACAGCGT TTGGATAAGC AAAGCACCGA AACCAACCAA
AGCGGCATGT GGGGCAGCTA CCTGAACAAC AACTTTGCAG TGAAAGGCCG CGCCGCTAAC
TTCGATCAGA AGTTGAACGG GATGACATTG GGTGGCGATA AAGCCACTGC ACTGGCAGAC
GGCGTGTTGA GCGTTGGTGG TTTCGCCAGC TACAGCAGCT CTGATATCAA AACGGATTAT
CAAAGCAAAG GTAAAGTGGA TAGCCATTCA TTCGGTGCCT ACGCACAATA CCTGGCTAAC
AGCGGTTACT ACATGAACGC GGTAGTGAAG AATAACCAGT TTAGCCAAGA CGTTAACATC
ACCTCAATTA ACGGCAGCGC CAGCGGTGTG TCTAACTTCT CGGGTATGGG TATCGCACTG
AAAGCCGGTA AGCACTTCAA CTTCAATGAG GCTTACGTCT CGCCATACGT TGCAATGAGC
GCCTTTAGCT CGGGTAAGAG CAACATCTCC TTGTCTAACG GCATGGAAGC ACAGAGCAGC
AGCACCCGCT CTGCGATGGG TACCCTTGGG GTGAATGCAG GTTACCGCTT CGTGATGAAC
AACGGCGCAG AACTCAAGCC ATACGCTATC TTCGCGGTGG ATCATGAGTT CGCGAAAAAC
AACCAAGTGA CGGTGAATCA GGAAGTGTTT GACAATAACT TGAGCGGGAC CCGTGTGAAC
ACCGGCGCCG GCATGAACGT CAACATCACC CCTAATCTGT CTGTCGGTTC TGAAGTGAAG
TTGTCCAGCG GTAAAGATAT CAAGACACCA GTAACCATTA ATCTGAACGT GGGTTACAGC
TTCTAA
 
Protein sequence
MKNSNTLNTR LLPLSILISS LVSGGAMAVS QIATTDTPAV TPIKSTLTGP FERNSAGTSF 
GSNVDVIDNT STATRVIAET TPEAESTIGE ATGQEGGNAT AVIPPTTTPS EQEITEPEQP
GLLDKIKDLL GLGEITQEQA DALEKNVKTK VEKVDAQTAA KLALESAQAE AQKAAEDALY
LKTENVSYQA FAQTEEKIKK EADEAKKKQD KTKEDAIKAV KVNNTPLVPG DKDIAEKVTK
AVTDTTKVQG EKAVTLATKI TDAKVAQEKK DANTEALAEI DGRLISVSNA LIQATGTDKG
PLDQKLKEAQ QAKTEQDGKE LASGGYKELF EEDKKTSGYF GIAENDNGSG QQEKLAEAKK
NRDAYNKAAK KELDAIAKAQ KAVEAIDAQI VKLKKDKGDI EQEQSTEKGK TGGLDIALSG
ANDAKDAAQG EFDTAKNAAE LAELAETAAK AIEAAKITDK AVEDATAAYK EAADKAEQTK
TALEAAEKAK EDADKLVVTN TGLLNDADQA LEQLVTAQNN AQPTLDLPAI DVTIAPAKTQ
DVIEGTSAIA TQVAGGTQNV AKGGKAIDSV ITKDGIVNLA AGANAKGTEV TKGTLNNNGG
VDTDTVVSTE GKLVLTGGSE TAIATSTGAK VAEGGVVTAG DHSVIEKMIS SGNVTASGNN
TIVRDTTIND GKLSLAGTAT ANNTTFNGGI FSVEGDTAAT KTNMTGGKFA VTGNATIKDT
VLSASDFSLA DKVTANNTTL TGGTFTVAGD TAATKTKMTG GEFAVTGNAK IEDTVLNASD
FSLADKATAN NTTLTDGTFT VAGDAAVTAT NMSGGKFAVK GKAKIKDTQL SAGNFTLAEN
ATANDTTLNG GKFDVSNEAT ATNTTINNGL FTLKDGAHAD STTVNSGTFV MADQSTANGI
QLVDSAFTLA SGAKASGITK LTGGQAQVAG SLESLSLTGG RADFANSAKA SGLLDISADS
QIIMNRGADT AQANLNLAGR LELLASDVAQ AVAQPVARAA MELSNARAVM PAPAMPVPAA
APVAHFALND VVMTGGTVDM SNAKNAQLTM ASLNGTGNFN LGSVMQSDSV APLNVSGDAN
GDFIIAMNSS GQAPTNLNVV NTNGGDARFA LANGPVALGN YMTNLAKDAN GNFVLTADKS
AMTPGTAGIL AVANTTPVIF NAELSSIQQR LDKQSTETNQ SGMWGSYLNN NFAVKGRAAN
FDQKLNGMTL GGDKATALAD GVLSVGGFAS YSSSDIKTDY QSKGKVDSHS FGAYAQYLAN
SGYYMNAVVK NNQFSQDVNI TSINGSASGV SNFSGMGIAL KAGKHFNFNE AYVSPYVAMS
AFSSGKSNIS LSNGMEAQSS STRSAMGTLG VNAGYRFVMN NGAELKPYAI FAVDHEFAKN
NQVTVNQEVF DNNLSGTRVN TGAGMNVNIT PNLSVGSEVK LSSGKDIKTP VTINLNVGYS
F