Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1750 |
Symbol | |
ID | 5800221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 1804164 |
End bp | 1811756 |
Gene Length | 7593 bp |
Protein Length | 2530 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641339685 |
Product | hemagglutination activity domain-containing protein |
Protein accession | YP_001606240 |
Protein GI | 162420846 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0000776384 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAAGA ACCTGTACCG TATTATTTTT AACAAAGTGC GGGGGATGAT GATCGTCGTC GCAGATATTG CTGCTTCTGG TCGGGCGTCC TCTTCGCCTT CATCAGGATT AGGGCACACG CAACACCGCC GTATCAGTGC CTTGTCCACG CTAAGTTTCA GCCTGTTACT GGCGCTGGGC TGTGTCTCGC TCTCCGTTCA GGCGGCGATT GTCGCCGATG CCAGTGCGCC GGGCAACCAG CAACCCACCA TTATCAACAG CGCCAACGGC ACGCCACAGG TCAATATCCA GGCCCCCAGC AGCGGCGGGG TTTCACGTAA CGTTTACAGC CAGTTTGATG TTGATGGCCG CGGCGTGATC CTCAATAACG GCCACGGCGT CAACCAGACC GAACTCGGTG GTTTTATCGA CGGCAACCCG TGGCTGGCGC GGGGTGAGGC CAGCATTATC CTCAACGAAG TCAACAGCCG TGACCCCAGT AAACTTAACG GGTATATCGA GGTGGCGGGG CGCAAGGCAC AGGTGGTTAT CGCTAACTCG GCGGGCATTA CCTGCGAGGG CTGCGGTTTT ATCAACGCCA ACCGTGTGAC CCTGACGACC GGTCAGGCGC AGCTCAATAA CGGCCAGCTC ACCGGTTACG ATGTGGAGCG GGGGGACATT GTTATTCAGG GCACCGGCAT GGACAGCAGC CGTCAAGATC ATACCGACCT GATTGCCCGC TCCGTCAAAG TGAATGCCGG GATCTGGGCC AACGAACTGA GTGTCACCAC CGGGCGCAAT CAGGTGGATG CCGCCCATCA AAACATTAAC GCCAAAGCCG CCGATGGCAG CCCGCGGCCA ACCGTGGCGG TGGATGTCGC CCATTTGGGG GGCATGTACG CCGGTAAAAT TCGCCTGATT GGCACTGAAA GCGGTGTCGG TGTGCACAAT GCGGGCGAGA TAGGGGCGTC AGCGGGCGAT ATTACGATAA CGGCCGATGG CATGCTGATG AACAGCGGCC AGATCAACAG CAGCCAACAG TTGGTGGTCA ATACCGCTGC GGATATAGAG AACACCGGTG TGCTCTATGC ACAGGGCAAT ACCCAACTCA CTACGGCGGG TACACTCAGC AACAGCGGCA CCCTCGCGGC GGGGGGGGAT ACCTCTGTCC GTGCGGCGGA GGTGAACAGC ACCCGCAATT CTGTTCTGGG GGCGGGTGTG AAGTCCGACA ACAGCGCCAT TACCAGTGGC ACACTGAGCG TTGAAGCTAG CGGGAAGATC ACCGCCCAGG GAAAAAATAT CAGCGGCACG GCGCAGCGTT TCACCGCACA CCGTCTTGAG CTGAGTGGCA GCCAGACGCA AAGCCGTGAT ATCACGCTCA TTGCACAAGG GGGTGAGATC GATCTGACCG GCGCTGAACT GTTAGCCAGT GATCGCCTGT CGGCTGCAAC CACCGCGTTA TTGCGCACCG ATAACGCCAG CCTGATCGCC GAACAGATTA CGCTCGACGC GCAGGCACTC TCCAATGTCG GGGGCCTGAT AGCCCACACC GGGACGACAG ATTTTAATCT GAATCTGCCG GGCGATGTCG ATAACCGGGG CGGCACCCTC CTCTCAAGCG GCACCCTCTC GCTACAGGCG GAAAGCCTGA ACAGCAACGG CAACAGCCTG CTGGGGGCGG GAGTACAAAG TGATGGCCGC CTGACGGAAA TCGGTGACCT GAGGGTGACC ACCCGTCAGG ACCTGATCGC ACACGGGCAG ACCCTTGCCG CGGGTGCCAT GGCGTTAACC GGCAGCCGGA TTGATCTGGC CGACAGTTAC ACGCAAGCCC GTGAGATGAC CCTCACCGCC AACCGTGGCG ATATCAGCAC CCAGCGCGCC ACTGTACTCG CCCTTGACAC ACTGAGCATC AACACCGCTC AAACCCTCAA TAATCAGGGG GGCACGCTGG CGGGCAACAC GCTCGCGTTG GATCTGGGCC AGTTTGATAA CCAAGGCGGC CAGGTGACGG CCAGCCAGGA TCTGACCATC GATTTACAGC GTGATTTCAG CCACCAGGCG GGCTCGACCC TTCAGGCCGG GCGTGATTTA ACCTTGACAT CCCTGGGCGC GGTCACCAAT GACGGGCAAC TGGTGGCGGG GGGCACACTC AGTACCCACT CGGACAGCTT ACTGAATAGC GGCAACCTGA TCGCGACGCA AGCAGAGCTC AACGCCACCG GCGCATTGAT TAACCATGGC GAGATATTGA CCCTCGGTGG GCTTGATACC GACTCCAACA CCCTGTTCAA CACGGGCAGT ATTATTAGCG CCGAAGCCAC ACTTAACGCA CGGGAGCGTA TTACCAACTC CGGCCCTGAC GCCCTGATCG GTGCTACCGA TGAAAACGGC ACCCTAGCCC TGCTGGCCCC GGTGATTGAA AACAGCGATA CCGTCACCCA CACTGACACC GCGCCGACCA CCACGATTTT AGGCATGGGC ACGGTTATTC TGGCCGGCGG GCACGCGCGT GATGGCCATT ACGCCTCTGC GGCTCAGGTG CTCAACCTTT CCGGCCTGAT CGAATCCGGC AAAGACATGT TGATTTACGC CACGACGCTG ACCAACAGCC GCCATATTTT GACCGCCAAC ACCGACTTTA TCGTGGCCGA TACGGTGACA GGCACGGCTG TCTGGACGGC AGAAAACCCC GATATTCCAG GCGGGCGCTA TGCTGAACCG CCGGATGGCG GTGCCGATAA CAGCGATTAT ATCGGCACAG AGTATACGTC GGTTATCGCC TATAACGGCA TCGATCAGAT CAGCCCGGAA GCGCAACTGC TGGCGGGGGG AAACCTGACA CCGCAGGTGG GCACGCTGGA GAATTTCTGG AGCAAAGTGA GTGCACAGGG GGAGATTGAT CTCACCGGCG TCACCCTGCA ACAGGATGGC TGGGGTGACC AGCAACGCCT GATGGAGCAG ACCACCTCCA GCGGTGTCTG GCGCTACCGA ACCTACAAAG GCGGTTTGTG GACGCGTGAG TGGGGACCTG AAGTCAGTGA GCGCGCCACC AGTGAATATG CCTCAAGTTT TACCGCAAAA ACACTCAGCG GCAGTGGCAC GACCATTAAC AATGGGGCCA ACCCCGGTGC CATCGCACCG CCTGCCGATC GCGATAATAG CGGCAAAGAT CTGGCGGTCG AATTTAACGG GATCTCTCTG ACACAGCCGA ATGGCGGGCT GTATCAGTTC ACAACCGACC ACACCGTCGG CGGTGGCGGT TATCTGATCG AAACCCACCC GGCGTTTGCC AACCTGAATA ACTGGCGCGG GTCAGATTAC GTGCTCCAGC AGTTGAATAA TGACCCGGAC GTGATATTCA AACGTCTGGG GGATAACGCC TATGAACAGC GGCTGGTGCG GGATCAGGTG CTGGCGTTGA CCGGCCAGGC GGTGGCCAGT GATTACCGCA GTGCACAAGA GCAGTTCGAG GCACTGTTTG CGGCGGGCCT TGAGTACAGC AAGGCGTTCA ATATTGCCCT CGGCACCCAC CTCAGTGCGG AGCAGATGGC GGCCCTGACC CACAATATCG TGCTGATGGA AACCCGTGAC GTCGCCGGGC AAACCGTATT AGTCCCCGTG GTCTATCTGG CGGGGGTTAA ACCGGGCGAT CTGCAGGCCA ACGGGGCATT GATCGCGGCA GAGAATATCA GCCTGACCGA GGTTCAGGGG TTCACCAATG CGGGGGCGAT AACCGCCACG AATGACCTGA AAATCAGCAT GGCGCAAGAT ATCACGCTGA ATAACCGTGG TGGCTTGCTT CAGGCGGGCG GCGATATGCA GCTCAGCACA CTGAACAGCG ATATCGACCT GACCAGCGCG CGGATCAATG CCACCAACCT GCAACTGGAC AGCGGCCGCG ATGTGATATT GCGCACCGAC AGTGCGCAGC TCAGTAGCGA CAATGGCGCA GTCTCGCGGG ATCAAACGAT CTTGGGGCCG CTGGCCAGCA TCAATGTCAG CAATAATGCG ACTATTAATA CCGGGCGTGA TTTTATCATG CAAGGCGCAA GCCTCAATGT CGGTCAGGAT CTGCAGGTCA CGACTGGCGG CGACTGGCAA CTGGAGACGG TACAAACACG CGACCAGATA AGCACCCATG ATGGCCGTGG CAGTGCGACC AGTGAGCATA TTCGCCATCT GGGCAGTGAA GTGAATGTCG GCGGCGCGCT GACGGCCAAC GTCGACAATC TGACGGCGGT AGGGGCCAAC ATTAATGCCG CTACCCTTGA GGTGCAGGCG CAGAACATCA GCCTCAGCGC GGCCACCGAC AGCCTGCACG TTACCGGCGA ATCGTCGAGC AAGCGGCATA CCAGCTCGGT GAACCTCTAT GATGAAACCC TGCTTGGCAG CCAGTTGAAT GCCACGGGCG ATATCAATTT GCAGGCGGCG CAAGACATCA CCCTGCGAGC CAGTGCGGTA CAAACCGATG GCGCGCTGAC ACTGGCAGCG GGCGGGGATG TGCTCCTGAC CACCCAGACT GAGCAACATG ACGAACAGCG CAATCATACC GGTCTCAGCA AAGGGATTGC ATCCAGCACC CTGACACGCA CCGAAGACAG TCTTAGCCAG ACACTGGCGG TGGGCTCGAT GCTCTCGGCG GGATCTATTG ATGTCAGCGG TAAAAATATC GCGGTGATGG GCAGCAACGT GGTGGCCGAT CAGGATATCA GCCTGCGTGC GCAGGAGAAC ATTACCGTCG GCACGGCGCA GCAGAGCGAG AGCGAATCGC ACCTGTTCGA ACAGAAAAAA TCGGGCCTGA TGAGCACCGG CGGTATCGGT GTCACGGTGG GCAGCAGTAC CAAAATGACC GATTCTGGTC AATCGATTTC CAGCGTGGGC AGCACGGTGG GCAGCGTACT GGGCAATGTC AGCATGACCG CCGGTGAAGA CCTGAGGGTG CAAGGTGCCG AGGTGTTGGC CGGTAAAGAC ATCAATCTGA CAGGTAAAAA TGTCAGTATT CTGGCGGCGG AGAATCAGCT TACCCAGAGC CACACCGTCG AGCAAAAACA GAGCGGCCTG ACACTGGCAC TGTCCGGTGC GGTGGGCAGT GCCGTCAATA CCGCGGTGAC CACCGCGAAA GCGGCCAGCG AAGAGAGCAG TGGCCGCTTG GGGGCATTGC AGGGGGTTAA AGCGGCGCTC AATGGCGTGC AGGCGGTGCA GGCCGGGCAG TTGGTGCAGG CGGAGGGGGG CGATGCCGCC AGCATGTTCG GCATCAGTGC GTCCTTGGGC TCACAAAAAT CGTCCTCGGA GCAACATCAG GAACAGACCC ACGTGACGGG CTCGACGCTG ACGGCAGGCA ACAATCTGAC CATCAATGCC ACCGGAGAGG GGAATGCGGC AAACAGCGGC GATATTGTGG TGCAAGGCAG CCAGCTCCAG GCCGGTGGCG ATACCACGCT GGATGCGGCG CGTGATGTGC TGCTACTCGG CGCTGCTAAC ACACAAAAAA CCGACGGCAG CAACAGCAGC AGTGGCGGCA GTGTTGGTGT CAGTCTGGGC ATCAGTGGGG CCAGCAGTGG TCTGAGTATT TTTGCCAACG CCAATAAAGG TCAGGGAAGT GAGCACGGCG ACGGTACCTC CTGGACTGAA ACGACCCTTG ACAGCGGCGG CACGCTGTCG CTGTACAGTG GCCGCGATAC CTCACTGGTC GGTGCGCAGG TCAGCGGCGA AACGGTGAAG GTGGAGGTGG GCCGCGACCT GTTGCTGCAA AGTCAGCAGG ACAGCGATAA CTATGACGCC AAACAACAAA ATAGCAGCGT TGGCGGCAGT TTCAGCCCTG GATCCATGAC GGGCAGTATC AGTATCAATG GCAGTCAGGA CAAGCTGCAC AGCAACTTTG ACTCGGTGCA GGAGCAGACG GGTATCTTTG CCGGCTCGGG TGGCTTTGAT ATCACGGTGG GTGGACATAC CCAGCTTGAC GGTGCGGTGA TTGGCAGCAC GGCGACAGCC GATAAAAACA CGCTGGATAC CGGGACACTG GGCTTCAGTG ATATCGATAA TCAAGCCGAT TTCAAGGTTG AACATCAAAG TGTGGGTATC AGCACCGGGG GGAATATCGG CAGTCAGTTT GTTGGCAATA TGGCCAACGG CTTGCTGGTC GGGGCCAATA ACGAAGGCCA CGCCGACAGC ACCACCCATG CGGCCGTTTC TGAAGGTACG ATCACGGTGC GCGACACGGA TAACCAGCAG CAAAATGTTG ATGACCTGAG CCGTGACGTG GAGCAGGCCA ACAATGCCCT TTCCCCTATC TTTGATAAAG AGAAAGAACA AAACCGGCTG AAGGAAGCGC AGCTTATCGG CGAGATAGGC AGTCAGGTGG GGGATGTGTT CCGCACACAG GGGCAGATTA TCGCCACTCA GGCGGCGACT GAAAAAATGC AGGAGGTGAG TGAGGCTGAT CGTGAGGCGG CGAAAGCCAA CTGGGAAAAA GCCAATCCGG GTCAGATTGC AACGGCTGAA GATATCAACG GTCAGGTTTA TAAAACGGCC TATGATCAGG CATTCAATGC ATCGGGTTAC GGCACCGGGG GTAAATTCCA GCAGGCGGTA CAAGCGGCGA CAGCGGCCCT CCAGGGGCTG GCGGGCGGAG ATATAGCCAA AGCGATAGCG GGAGGCAGTG CGCCGTATCT GGCGGAAGTG ATTAAGCAAA GCACGGGTGA TAACGAAGAA GCGCGACTGG CGGCACATGC GGTGGTCGGT TCTGTTCTGG CACATCTACA GGGCAATAGC GCGGTTGCGG GAGGCGCAGG TGCCTTGACG GGTGAGATAG CGGCTGATTT AATCATGCAG CAGTTGTACC CGGGAAAAAT GGTCAGTGAA CTCAGCGAGA CAGAAAAACA GACCATCAGC GCGTTAAGTA CATTAGCAGC AGGGCTGGCG GGGGGTTTGA CGGGAGACAG CAGCGCCGAC GCGGTTGCGG GTGCACAGGC TGGGAAAAAT GCGGTAGAGA ATAATGCGCT GGGTAGCCTT GGAGATATAT TTGGTAGCCA AGGTGCGAAA TATTTTGAAG GGGCGGGTTC GTTAGAAAGG GAGCTTTCAA CCGATAACAC GCTGACTGTG CAAGAAAAAC AAGCTATTCG GGATCATTAT TTGAAAGGTG ATTTGCCAGA AGATGTAGTT AAAGCAATTC TGGAAAATAC CCCTGCATCA GATACTGTGA TGGCGTTGCT TCAAGCAGAA TCGACTAAAG ACTATGCTCT GGCGTTATTG AGTTCTTTAC CTTTAGAACG TGCAATTGCT GTTTTGGGTA AAACTGCGAA TACATTAATA AAGTCTTCGG TTGTTGATAA AATTCTTGAC GCTCAACGCG TTGGTAGTGG TCTTAAGCCA GATCCTAGCC ATCGTGGTGC TAGCTATTTA AACCGAGAAC AATTAATGGC TGGTGAGGTA TTCAATATTA CTGGTGGGGA TGGTGTGAAG CGTTCACTAT TGCAGACTAA AGGAACCTTT AATGGCAAGG ATGGAGTTTT TGAGTATATC TATGATAAGA CAGGAAATGT GACGCACCAA CGCTTTATTG AAGGTGTTGG CATAACCGGA ATACCAAACC AAAAAGCACC GAAGGTGAAA TAA
|
Protein sequence | MNKNLYRIIF NKVRGMMIVV ADIAASGRAS SSPSSGLGHT QHRRISALST LSFSLLLALG CVSLSVQAAI VADASAPGNQ QPTIINSANG TPQVNIQAPS SGGVSRNVYS QFDVDGRGVI LNNGHGVNQT ELGGFIDGNP WLARGEASII LNEVNSRDPS KLNGYIEVAG RKAQVVIANS AGITCEGCGF INANRVTLTT GQAQLNNGQL TGYDVERGDI VIQGTGMDSS RQDHTDLIAR SVKVNAGIWA NELSVTTGRN QVDAAHQNIN AKAADGSPRP TVAVDVAHLG GMYAGKIRLI GTESGVGVHN AGEIGASAGD ITITADGMLM NSGQINSSQQ LVVNTAADIE NTGVLYAQGN TQLTTAGTLS NSGTLAAGGD TSVRAAEVNS TRNSVLGAGV KSDNSAITSG TLSVEASGKI TAQGKNISGT AQRFTAHRLE LSGSQTQSRD ITLIAQGGEI DLTGAELLAS DRLSAATTAL LRTDNASLIA EQITLDAQAL SNVGGLIAHT GTTDFNLNLP GDVDNRGGTL LSSGTLSLQA ESLNSNGNSL LGAGVQSDGR LTEIGDLRVT TRQDLIAHGQ TLAAGAMALT GSRIDLADSY TQAREMTLTA NRGDISTQRA TVLALDTLSI NTAQTLNNQG GTLAGNTLAL DLGQFDNQGG QVTASQDLTI DLQRDFSHQA GSTLQAGRDL TLTSLGAVTN DGQLVAGGTL STHSDSLLNS GNLIATQAEL NATGALINHG EILTLGGLDT DSNTLFNTGS IISAEATLNA RERITNSGPD ALIGATDENG TLALLAPVIE NSDTVTHTDT APTTTILGMG TVILAGGHAR DGHYASAAQV LNLSGLIESG KDMLIYATTL TNSRHILTAN TDFIVADTVT GTAVWTAENP DIPGGRYAEP PDGGADNSDY IGTEYTSVIA YNGIDQISPE AQLLAGGNLT PQVGTLENFW SKVSAQGEID LTGVTLQQDG WGDQQRLMEQ TTSSGVWRYR TYKGGLWTRE WGPEVSERAT SEYASSFTAK TLSGSGTTIN NGANPGAIAP PADRDNSGKD LAVEFNGISL TQPNGGLYQF TTDHTVGGGG YLIETHPAFA NLNNWRGSDY VLQQLNNDPD VIFKRLGDNA YEQRLVRDQV LALTGQAVAS DYRSAQEQFE ALFAAGLEYS KAFNIALGTH LSAEQMAALT HNIVLMETRD VAGQTVLVPV VYLAGVKPGD LQANGALIAA ENISLTEVQG FTNAGAITAT NDLKISMAQD ITLNNRGGLL QAGGDMQLST LNSDIDLTSA RINATNLQLD SGRDVILRTD SAQLSSDNGA VSRDQTILGP LASINVSNNA TINTGRDFIM QGASLNVGQD LQVTTGGDWQ LETVQTRDQI STHDGRGSAT SEHIRHLGSE VNVGGALTAN VDNLTAVGAN INAATLEVQA QNISLSAATD SLHVTGESSS KRHTSSVNLY DETLLGSQLN ATGDINLQAA QDITLRASAV QTDGALTLAA GGDVLLTTQT EQHDEQRNHT GLSKGIASST LTRTEDSLSQ TLAVGSMLSA GSIDVSGKNI AVMGSNVVAD QDISLRAQEN ITVGTAQQSE SESHLFEQKK SGLMSTGGIG VTVGSSTKMT DSGQSISSVG STVGSVLGNV SMTAGEDLRV QGAEVLAGKD INLTGKNVSI LAAENQLTQS HTVEQKQSGL TLALSGAVGS AVNTAVTTAK AASEESSGRL GALQGVKAAL NGVQAVQAGQ LVQAEGGDAA SMFGISASLG SQKSSSEQHQ EQTHVTGSTL TAGNNLTINA TGEGNAANSG DIVVQGSQLQ AGGDTTLDAA RDVLLLGAAN TQKTDGSNSS SGGSVGVSLG ISGASSGLSI FANANKGQGS EHGDGTSWTE TTLDSGGTLS LYSGRDTSLV GAQVSGETVK VEVGRDLLLQ SQQDSDNYDA KQQNSSVGGS FSPGSMTGSI SINGSQDKLH SNFDSVQEQT GIFAGSGGFD ITVGGHTQLD GAVIGSTATA DKNTLDTGTL GFSDIDNQAD FKVEHQSVGI STGGNIGSQF VGNMANGLLV GANNEGHADS TTHAAVSEGT ITVRDTDNQQ QNVDDLSRDV EQANNALSPI FDKEKEQNRL KEAQLIGEIG SQVGDVFRTQ GQIIATQAAT EKMQEVSEAD REAAKANWEK ANPGQIATAE DINGQVYKTA YDQAFNASGY GTGGKFQQAV QAATAALQGL AGGDIAKAIA GGSAPYLAEV IKQSTGDNEE ARLAAHAVVG SVLAHLQGNS AVAGGAGALT GEIAADLIMQ QLYPGKMVSE LSETEKQTIS ALSTLAAGLA GGLTGDSSAD AVAGAQAGKN AVENNALGSL GDIFGSQGAK YFEGAGSLER ELSTDNTLTV QEKQAIRDHY LKGDLPEDVV KAILENTPAS DTVMALLQAE STKDYALALL SSLPLERAIA VLGKTANTLI KSSVVDKILD AQRVGSGLKP DPSHRGASYL NREQLMAGEV FNITGGDGVK RSLLQTKGTF NGKDGVFEYI YDKTGNVTHQ RFIEGVGITG IPNQKAPKVK
|
| |