Gene YpAngola_A1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1750 
Symbol 
ID5800221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1804164 
End bp1811756 
Gene Length7593 bp 
Protein Length2530 aa 
Translation table11 
GC content57% 
IMG OID641339685 
Producthemagglutination activity domain-containing protein 
Protein accessionYP_001606240 
Protein GI162420846 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000776384 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAAGA ACCTGTACCG TATTATTTTT AACAAAGTGC GGGGGATGAT GATCGTCGTC 
GCAGATATTG CTGCTTCTGG TCGGGCGTCC TCTTCGCCTT CATCAGGATT AGGGCACACG
CAACACCGCC GTATCAGTGC CTTGTCCACG CTAAGTTTCA GCCTGTTACT GGCGCTGGGC
TGTGTCTCGC TCTCCGTTCA GGCGGCGATT GTCGCCGATG CCAGTGCGCC GGGCAACCAG
CAACCCACCA TTATCAACAG CGCCAACGGC ACGCCACAGG TCAATATCCA GGCCCCCAGC
AGCGGCGGGG TTTCACGTAA CGTTTACAGC CAGTTTGATG TTGATGGCCG CGGCGTGATC
CTCAATAACG GCCACGGCGT CAACCAGACC GAACTCGGTG GTTTTATCGA CGGCAACCCG
TGGCTGGCGC GGGGTGAGGC CAGCATTATC CTCAACGAAG TCAACAGCCG TGACCCCAGT
AAACTTAACG GGTATATCGA GGTGGCGGGG CGCAAGGCAC AGGTGGTTAT CGCTAACTCG
GCGGGCATTA CCTGCGAGGG CTGCGGTTTT ATCAACGCCA ACCGTGTGAC CCTGACGACC
GGTCAGGCGC AGCTCAATAA CGGCCAGCTC ACCGGTTACG ATGTGGAGCG GGGGGACATT
GTTATTCAGG GCACCGGCAT GGACAGCAGC CGTCAAGATC ATACCGACCT GATTGCCCGC
TCCGTCAAAG TGAATGCCGG GATCTGGGCC AACGAACTGA GTGTCACCAC CGGGCGCAAT
CAGGTGGATG CCGCCCATCA AAACATTAAC GCCAAAGCCG CCGATGGCAG CCCGCGGCCA
ACCGTGGCGG TGGATGTCGC CCATTTGGGG GGCATGTACG CCGGTAAAAT TCGCCTGATT
GGCACTGAAA GCGGTGTCGG TGTGCACAAT GCGGGCGAGA TAGGGGCGTC AGCGGGCGAT
ATTACGATAA CGGCCGATGG CATGCTGATG AACAGCGGCC AGATCAACAG CAGCCAACAG
TTGGTGGTCA ATACCGCTGC GGATATAGAG AACACCGGTG TGCTCTATGC ACAGGGCAAT
ACCCAACTCA CTACGGCGGG TACACTCAGC AACAGCGGCA CCCTCGCGGC GGGGGGGGAT
ACCTCTGTCC GTGCGGCGGA GGTGAACAGC ACCCGCAATT CTGTTCTGGG GGCGGGTGTG
AAGTCCGACA ACAGCGCCAT TACCAGTGGC ACACTGAGCG TTGAAGCTAG CGGGAAGATC
ACCGCCCAGG GAAAAAATAT CAGCGGCACG GCGCAGCGTT TCACCGCACA CCGTCTTGAG
CTGAGTGGCA GCCAGACGCA AAGCCGTGAT ATCACGCTCA TTGCACAAGG GGGTGAGATC
GATCTGACCG GCGCTGAACT GTTAGCCAGT GATCGCCTGT CGGCTGCAAC CACCGCGTTA
TTGCGCACCG ATAACGCCAG CCTGATCGCC GAACAGATTA CGCTCGACGC GCAGGCACTC
TCCAATGTCG GGGGCCTGAT AGCCCACACC GGGACGACAG ATTTTAATCT GAATCTGCCG
GGCGATGTCG ATAACCGGGG CGGCACCCTC CTCTCAAGCG GCACCCTCTC GCTACAGGCG
GAAAGCCTGA ACAGCAACGG CAACAGCCTG CTGGGGGCGG GAGTACAAAG TGATGGCCGC
CTGACGGAAA TCGGTGACCT GAGGGTGACC ACCCGTCAGG ACCTGATCGC ACACGGGCAG
ACCCTTGCCG CGGGTGCCAT GGCGTTAACC GGCAGCCGGA TTGATCTGGC CGACAGTTAC
ACGCAAGCCC GTGAGATGAC CCTCACCGCC AACCGTGGCG ATATCAGCAC CCAGCGCGCC
ACTGTACTCG CCCTTGACAC ACTGAGCATC AACACCGCTC AAACCCTCAA TAATCAGGGG
GGCACGCTGG CGGGCAACAC GCTCGCGTTG GATCTGGGCC AGTTTGATAA CCAAGGCGGC
CAGGTGACGG CCAGCCAGGA TCTGACCATC GATTTACAGC GTGATTTCAG CCACCAGGCG
GGCTCGACCC TTCAGGCCGG GCGTGATTTA ACCTTGACAT CCCTGGGCGC GGTCACCAAT
GACGGGCAAC TGGTGGCGGG GGGCACACTC AGTACCCACT CGGACAGCTT ACTGAATAGC
GGCAACCTGA TCGCGACGCA AGCAGAGCTC AACGCCACCG GCGCATTGAT TAACCATGGC
GAGATATTGA CCCTCGGTGG GCTTGATACC GACTCCAACA CCCTGTTCAA CACGGGCAGT
ATTATTAGCG CCGAAGCCAC ACTTAACGCA CGGGAGCGTA TTACCAACTC CGGCCCTGAC
GCCCTGATCG GTGCTACCGA TGAAAACGGC ACCCTAGCCC TGCTGGCCCC GGTGATTGAA
AACAGCGATA CCGTCACCCA CACTGACACC GCGCCGACCA CCACGATTTT AGGCATGGGC
ACGGTTATTC TGGCCGGCGG GCACGCGCGT GATGGCCATT ACGCCTCTGC GGCTCAGGTG
CTCAACCTTT CCGGCCTGAT CGAATCCGGC AAAGACATGT TGATTTACGC CACGACGCTG
ACCAACAGCC GCCATATTTT GACCGCCAAC ACCGACTTTA TCGTGGCCGA TACGGTGACA
GGCACGGCTG TCTGGACGGC AGAAAACCCC GATATTCCAG GCGGGCGCTA TGCTGAACCG
CCGGATGGCG GTGCCGATAA CAGCGATTAT ATCGGCACAG AGTATACGTC GGTTATCGCC
TATAACGGCA TCGATCAGAT CAGCCCGGAA GCGCAACTGC TGGCGGGGGG AAACCTGACA
CCGCAGGTGG GCACGCTGGA GAATTTCTGG AGCAAAGTGA GTGCACAGGG GGAGATTGAT
CTCACCGGCG TCACCCTGCA ACAGGATGGC TGGGGTGACC AGCAACGCCT GATGGAGCAG
ACCACCTCCA GCGGTGTCTG GCGCTACCGA ACCTACAAAG GCGGTTTGTG GACGCGTGAG
TGGGGACCTG AAGTCAGTGA GCGCGCCACC AGTGAATATG CCTCAAGTTT TACCGCAAAA
ACACTCAGCG GCAGTGGCAC GACCATTAAC AATGGGGCCA ACCCCGGTGC CATCGCACCG
CCTGCCGATC GCGATAATAG CGGCAAAGAT CTGGCGGTCG AATTTAACGG GATCTCTCTG
ACACAGCCGA ATGGCGGGCT GTATCAGTTC ACAACCGACC ACACCGTCGG CGGTGGCGGT
TATCTGATCG AAACCCACCC GGCGTTTGCC AACCTGAATA ACTGGCGCGG GTCAGATTAC
GTGCTCCAGC AGTTGAATAA TGACCCGGAC GTGATATTCA AACGTCTGGG GGATAACGCC
TATGAACAGC GGCTGGTGCG GGATCAGGTG CTGGCGTTGA CCGGCCAGGC GGTGGCCAGT
GATTACCGCA GTGCACAAGA GCAGTTCGAG GCACTGTTTG CGGCGGGCCT TGAGTACAGC
AAGGCGTTCA ATATTGCCCT CGGCACCCAC CTCAGTGCGG AGCAGATGGC GGCCCTGACC
CACAATATCG TGCTGATGGA AACCCGTGAC GTCGCCGGGC AAACCGTATT AGTCCCCGTG
GTCTATCTGG CGGGGGTTAA ACCGGGCGAT CTGCAGGCCA ACGGGGCATT GATCGCGGCA
GAGAATATCA GCCTGACCGA GGTTCAGGGG TTCACCAATG CGGGGGCGAT AACCGCCACG
AATGACCTGA AAATCAGCAT GGCGCAAGAT ATCACGCTGA ATAACCGTGG TGGCTTGCTT
CAGGCGGGCG GCGATATGCA GCTCAGCACA CTGAACAGCG ATATCGACCT GACCAGCGCG
CGGATCAATG CCACCAACCT GCAACTGGAC AGCGGCCGCG ATGTGATATT GCGCACCGAC
AGTGCGCAGC TCAGTAGCGA CAATGGCGCA GTCTCGCGGG ATCAAACGAT CTTGGGGCCG
CTGGCCAGCA TCAATGTCAG CAATAATGCG ACTATTAATA CCGGGCGTGA TTTTATCATG
CAAGGCGCAA GCCTCAATGT CGGTCAGGAT CTGCAGGTCA CGACTGGCGG CGACTGGCAA
CTGGAGACGG TACAAACACG CGACCAGATA AGCACCCATG ATGGCCGTGG CAGTGCGACC
AGTGAGCATA TTCGCCATCT GGGCAGTGAA GTGAATGTCG GCGGCGCGCT GACGGCCAAC
GTCGACAATC TGACGGCGGT AGGGGCCAAC ATTAATGCCG CTACCCTTGA GGTGCAGGCG
CAGAACATCA GCCTCAGCGC GGCCACCGAC AGCCTGCACG TTACCGGCGA ATCGTCGAGC
AAGCGGCATA CCAGCTCGGT GAACCTCTAT GATGAAACCC TGCTTGGCAG CCAGTTGAAT
GCCACGGGCG ATATCAATTT GCAGGCGGCG CAAGACATCA CCCTGCGAGC CAGTGCGGTA
CAAACCGATG GCGCGCTGAC ACTGGCAGCG GGCGGGGATG TGCTCCTGAC CACCCAGACT
GAGCAACATG ACGAACAGCG CAATCATACC GGTCTCAGCA AAGGGATTGC ATCCAGCACC
CTGACACGCA CCGAAGACAG TCTTAGCCAG ACACTGGCGG TGGGCTCGAT GCTCTCGGCG
GGATCTATTG ATGTCAGCGG TAAAAATATC GCGGTGATGG GCAGCAACGT GGTGGCCGAT
CAGGATATCA GCCTGCGTGC GCAGGAGAAC ATTACCGTCG GCACGGCGCA GCAGAGCGAG
AGCGAATCGC ACCTGTTCGA ACAGAAAAAA TCGGGCCTGA TGAGCACCGG CGGTATCGGT
GTCACGGTGG GCAGCAGTAC CAAAATGACC GATTCTGGTC AATCGATTTC CAGCGTGGGC
AGCACGGTGG GCAGCGTACT GGGCAATGTC AGCATGACCG CCGGTGAAGA CCTGAGGGTG
CAAGGTGCCG AGGTGTTGGC CGGTAAAGAC ATCAATCTGA CAGGTAAAAA TGTCAGTATT
CTGGCGGCGG AGAATCAGCT TACCCAGAGC CACACCGTCG AGCAAAAACA GAGCGGCCTG
ACACTGGCAC TGTCCGGTGC GGTGGGCAGT GCCGTCAATA CCGCGGTGAC CACCGCGAAA
GCGGCCAGCG AAGAGAGCAG TGGCCGCTTG GGGGCATTGC AGGGGGTTAA AGCGGCGCTC
AATGGCGTGC AGGCGGTGCA GGCCGGGCAG TTGGTGCAGG CGGAGGGGGG CGATGCCGCC
AGCATGTTCG GCATCAGTGC GTCCTTGGGC TCACAAAAAT CGTCCTCGGA GCAACATCAG
GAACAGACCC ACGTGACGGG CTCGACGCTG ACGGCAGGCA ACAATCTGAC CATCAATGCC
ACCGGAGAGG GGAATGCGGC AAACAGCGGC GATATTGTGG TGCAAGGCAG CCAGCTCCAG
GCCGGTGGCG ATACCACGCT GGATGCGGCG CGTGATGTGC TGCTACTCGG CGCTGCTAAC
ACACAAAAAA CCGACGGCAG CAACAGCAGC AGTGGCGGCA GTGTTGGTGT CAGTCTGGGC
ATCAGTGGGG CCAGCAGTGG TCTGAGTATT TTTGCCAACG CCAATAAAGG TCAGGGAAGT
GAGCACGGCG ACGGTACCTC CTGGACTGAA ACGACCCTTG ACAGCGGCGG CACGCTGTCG
CTGTACAGTG GCCGCGATAC CTCACTGGTC GGTGCGCAGG TCAGCGGCGA AACGGTGAAG
GTGGAGGTGG GCCGCGACCT GTTGCTGCAA AGTCAGCAGG ACAGCGATAA CTATGACGCC
AAACAACAAA ATAGCAGCGT TGGCGGCAGT TTCAGCCCTG GATCCATGAC GGGCAGTATC
AGTATCAATG GCAGTCAGGA CAAGCTGCAC AGCAACTTTG ACTCGGTGCA GGAGCAGACG
GGTATCTTTG CCGGCTCGGG TGGCTTTGAT ATCACGGTGG GTGGACATAC CCAGCTTGAC
GGTGCGGTGA TTGGCAGCAC GGCGACAGCC GATAAAAACA CGCTGGATAC CGGGACACTG
GGCTTCAGTG ATATCGATAA TCAAGCCGAT TTCAAGGTTG AACATCAAAG TGTGGGTATC
AGCACCGGGG GGAATATCGG CAGTCAGTTT GTTGGCAATA TGGCCAACGG CTTGCTGGTC
GGGGCCAATA ACGAAGGCCA CGCCGACAGC ACCACCCATG CGGCCGTTTC TGAAGGTACG
ATCACGGTGC GCGACACGGA TAACCAGCAG CAAAATGTTG ATGACCTGAG CCGTGACGTG
GAGCAGGCCA ACAATGCCCT TTCCCCTATC TTTGATAAAG AGAAAGAACA AAACCGGCTG
AAGGAAGCGC AGCTTATCGG CGAGATAGGC AGTCAGGTGG GGGATGTGTT CCGCACACAG
GGGCAGATTA TCGCCACTCA GGCGGCGACT GAAAAAATGC AGGAGGTGAG TGAGGCTGAT
CGTGAGGCGG CGAAAGCCAA CTGGGAAAAA GCCAATCCGG GTCAGATTGC AACGGCTGAA
GATATCAACG GTCAGGTTTA TAAAACGGCC TATGATCAGG CATTCAATGC ATCGGGTTAC
GGCACCGGGG GTAAATTCCA GCAGGCGGTA CAAGCGGCGA CAGCGGCCCT CCAGGGGCTG
GCGGGCGGAG ATATAGCCAA AGCGATAGCG GGAGGCAGTG CGCCGTATCT GGCGGAAGTG
ATTAAGCAAA GCACGGGTGA TAACGAAGAA GCGCGACTGG CGGCACATGC GGTGGTCGGT
TCTGTTCTGG CACATCTACA GGGCAATAGC GCGGTTGCGG GAGGCGCAGG TGCCTTGACG
GGTGAGATAG CGGCTGATTT AATCATGCAG CAGTTGTACC CGGGAAAAAT GGTCAGTGAA
CTCAGCGAGA CAGAAAAACA GACCATCAGC GCGTTAAGTA CATTAGCAGC AGGGCTGGCG
GGGGGTTTGA CGGGAGACAG CAGCGCCGAC GCGGTTGCGG GTGCACAGGC TGGGAAAAAT
GCGGTAGAGA ATAATGCGCT GGGTAGCCTT GGAGATATAT TTGGTAGCCA AGGTGCGAAA
TATTTTGAAG GGGCGGGTTC GTTAGAAAGG GAGCTTTCAA CCGATAACAC GCTGACTGTG
CAAGAAAAAC AAGCTATTCG GGATCATTAT TTGAAAGGTG ATTTGCCAGA AGATGTAGTT
AAAGCAATTC TGGAAAATAC CCCTGCATCA GATACTGTGA TGGCGTTGCT TCAAGCAGAA
TCGACTAAAG ACTATGCTCT GGCGTTATTG AGTTCTTTAC CTTTAGAACG TGCAATTGCT
GTTTTGGGTA AAACTGCGAA TACATTAATA AAGTCTTCGG TTGTTGATAA AATTCTTGAC
GCTCAACGCG TTGGTAGTGG TCTTAAGCCA GATCCTAGCC ATCGTGGTGC TAGCTATTTA
AACCGAGAAC AATTAATGGC TGGTGAGGTA TTCAATATTA CTGGTGGGGA TGGTGTGAAG
CGTTCACTAT TGCAGACTAA AGGAACCTTT AATGGCAAGG ATGGAGTTTT TGAGTATATC
TATGATAAGA CAGGAAATGT GACGCACCAA CGCTTTATTG AAGGTGTTGG CATAACCGGA
ATACCAAACC AAAAAGCACC GAAGGTGAAA TAA
 
Protein sequence
MNKNLYRIIF NKVRGMMIVV ADIAASGRAS SSPSSGLGHT QHRRISALST LSFSLLLALG 
CVSLSVQAAI VADASAPGNQ QPTIINSANG TPQVNIQAPS SGGVSRNVYS QFDVDGRGVI
LNNGHGVNQT ELGGFIDGNP WLARGEASII LNEVNSRDPS KLNGYIEVAG RKAQVVIANS
AGITCEGCGF INANRVTLTT GQAQLNNGQL TGYDVERGDI VIQGTGMDSS RQDHTDLIAR
SVKVNAGIWA NELSVTTGRN QVDAAHQNIN AKAADGSPRP TVAVDVAHLG GMYAGKIRLI
GTESGVGVHN AGEIGASAGD ITITADGMLM NSGQINSSQQ LVVNTAADIE NTGVLYAQGN
TQLTTAGTLS NSGTLAAGGD TSVRAAEVNS TRNSVLGAGV KSDNSAITSG TLSVEASGKI
TAQGKNISGT AQRFTAHRLE LSGSQTQSRD ITLIAQGGEI DLTGAELLAS DRLSAATTAL
LRTDNASLIA EQITLDAQAL SNVGGLIAHT GTTDFNLNLP GDVDNRGGTL LSSGTLSLQA
ESLNSNGNSL LGAGVQSDGR LTEIGDLRVT TRQDLIAHGQ TLAAGAMALT GSRIDLADSY
TQAREMTLTA NRGDISTQRA TVLALDTLSI NTAQTLNNQG GTLAGNTLAL DLGQFDNQGG
QVTASQDLTI DLQRDFSHQA GSTLQAGRDL TLTSLGAVTN DGQLVAGGTL STHSDSLLNS
GNLIATQAEL NATGALINHG EILTLGGLDT DSNTLFNTGS IISAEATLNA RERITNSGPD
ALIGATDENG TLALLAPVIE NSDTVTHTDT APTTTILGMG TVILAGGHAR DGHYASAAQV
LNLSGLIESG KDMLIYATTL TNSRHILTAN TDFIVADTVT GTAVWTAENP DIPGGRYAEP
PDGGADNSDY IGTEYTSVIA YNGIDQISPE AQLLAGGNLT PQVGTLENFW SKVSAQGEID
LTGVTLQQDG WGDQQRLMEQ TTSSGVWRYR TYKGGLWTRE WGPEVSERAT SEYASSFTAK
TLSGSGTTIN NGANPGAIAP PADRDNSGKD LAVEFNGISL TQPNGGLYQF TTDHTVGGGG
YLIETHPAFA NLNNWRGSDY VLQQLNNDPD VIFKRLGDNA YEQRLVRDQV LALTGQAVAS
DYRSAQEQFE ALFAAGLEYS KAFNIALGTH LSAEQMAALT HNIVLMETRD VAGQTVLVPV
VYLAGVKPGD LQANGALIAA ENISLTEVQG FTNAGAITAT NDLKISMAQD ITLNNRGGLL
QAGGDMQLST LNSDIDLTSA RINATNLQLD SGRDVILRTD SAQLSSDNGA VSRDQTILGP
LASINVSNNA TINTGRDFIM QGASLNVGQD LQVTTGGDWQ LETVQTRDQI STHDGRGSAT
SEHIRHLGSE VNVGGALTAN VDNLTAVGAN INAATLEVQA QNISLSAATD SLHVTGESSS
KRHTSSVNLY DETLLGSQLN ATGDINLQAA QDITLRASAV QTDGALTLAA GGDVLLTTQT
EQHDEQRNHT GLSKGIASST LTRTEDSLSQ TLAVGSMLSA GSIDVSGKNI AVMGSNVVAD
QDISLRAQEN ITVGTAQQSE SESHLFEQKK SGLMSTGGIG VTVGSSTKMT DSGQSISSVG
STVGSVLGNV SMTAGEDLRV QGAEVLAGKD INLTGKNVSI LAAENQLTQS HTVEQKQSGL
TLALSGAVGS AVNTAVTTAK AASEESSGRL GALQGVKAAL NGVQAVQAGQ LVQAEGGDAA
SMFGISASLG SQKSSSEQHQ EQTHVTGSTL TAGNNLTINA TGEGNAANSG DIVVQGSQLQ
AGGDTTLDAA RDVLLLGAAN TQKTDGSNSS SGGSVGVSLG ISGASSGLSI FANANKGQGS
EHGDGTSWTE TTLDSGGTLS LYSGRDTSLV GAQVSGETVK VEVGRDLLLQ SQQDSDNYDA
KQQNSSVGGS FSPGSMTGSI SINGSQDKLH SNFDSVQEQT GIFAGSGGFD ITVGGHTQLD
GAVIGSTATA DKNTLDTGTL GFSDIDNQAD FKVEHQSVGI STGGNIGSQF VGNMANGLLV
GANNEGHADS TTHAAVSEGT ITVRDTDNQQ QNVDDLSRDV EQANNALSPI FDKEKEQNRL
KEAQLIGEIG SQVGDVFRTQ GQIIATQAAT EKMQEVSEAD REAAKANWEK ANPGQIATAE
DINGQVYKTA YDQAFNASGY GTGGKFQQAV QAATAALQGL AGGDIAKAIA GGSAPYLAEV
IKQSTGDNEE ARLAAHAVVG SVLAHLQGNS AVAGGAGALT GEIAADLIMQ QLYPGKMVSE
LSETEKQTIS ALSTLAAGLA GGLTGDSSAD AVAGAQAGKN AVENNALGSL GDIFGSQGAK
YFEGAGSLER ELSTDNTLTV QEKQAIRDHY LKGDLPEDVV KAILENTPAS DTVMALLQAE
STKDYALALL SSLPLERAIA VLGKTANTLI KSSVVDKILD AQRVGSGLKP DPSHRGASYL
NREQLMAGEV FNITGGDGVK RSLLQTKGTF NGKDGVFEYI YDKTGNVTHQ RFIEGVGITG
IPNQKAPKVK