Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A3922 |
Symbol | |
ID | 5802400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 4165886 |
End bp | 4170793 |
Gene Length | 4908 bp |
Protein Length | 1635 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641341711 |
Product | hemagglutination activity domain-containing protein |
Protein accession | YP_001608221 |
Protein GI | 162418196 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAC ACACCTTTAA GCTCTCACCG GCAGGGAAAT TGGCGGCGGC GGTAACCATT ATTTCGGTTT CTGTTGCCAC CTGTTATGCC GCCGGTATTG TCGGAGCAGG CGATTCTGCA CATAAACCGG ATGTTAGCTC GGTAAATGGC ACATCGGTAA TTAATATCGT GCAACCGTCA GCCTCCGGCT TATCACATAA CCAATTTCAG GATTTTAATG TTGGCGAAAA GGGGGCGGTA CTGAATAACG CCACCAGTGC AGGTAACTCA ATACTTGCCG GGCAGTTAGC CGCTAACCAA AATTTAAACG GTCAAGCGGC CAGTATTATT CTTAATGAAG TCATCAGCCG TAATCCTTCT CTGTTATTGG GCCAACAAGA AATTTTTGGT ATGACGGCGG ATTATATTCT GGCCAACCCA AATGGCATCA CCTGTAATGG CTGTGGTTTT ATGAATACCA ACCGCGAGTC ACTGGTCGTG GGTAATCCCT TGATTGAACA AGGATCGCTG AAGGGTTTTG AAACTTTCAA TAACAATAAT TGGTTGATAA TTCAAGATAA GGGGCTAACT GCTAATAAAA TATTGGATCT GATCGCACCC AGAATTGAGG TCACAGGTAT GGTCACCTCA ACGGAAGCCA TTAATGCCTT ATCAGGTAAT AACCAAATTT CAACTGATGG GCAAATATTA GAGTCTCGCC AAGAAGATCA TCCAAATCGT CCAACCAGTT TGGGGGGATG GTTTAGCAGT TTGTTTTCAA GTGAAAGTGA AGAATCTATT GATGGTAAAT ATCTTGGCAG TATGCAATCA GGCCGGATTA ATTTAGTCAG CACTCGGGAA GGCAGTGGGG TAAAAATTGC CGGGAGTCTA AACGGTAGCG AAGAAATAAA CGCCACTATC AAAGGAAAGC TGCAATTAGA AGCCGCCAAA TTAGGGGGTA ATGATATCAA CATCAACGCC AATAGCATCC AGGCATTCGG TAATCTTCAT AAAAATGAAG ACAATGGGGG TGTTACGCAG AGTCTGGAAC GCACTCAGTT AAAAGGCAAA AATATTGCTA TCGTTGCCAA TAAAAAGAAT CAGCTTTCCG CAGTAAAAAT CGATGCTGAC AATGTGACGT TGAGAGGAGG TGAATTGGTA TTGGATGAGA GAATCCTCAC CAAGACAGAA CAAGAATCTT CATCAGACGG TGGAAATGGG GTGCTGGGCC TTGGTAAGTG GAATTGGTCT GAAAATAAAG AGGAGAAACA ACAAACCAGT ATTGGCACCA CCATTACAGC GAAAAATAAT GCCTCTCTTA AATCAACTCA AGATGATATT AAACTGTCGG CTGCCACAAT AACCGCGGGT AAAAATCTTG CCATCAAAGC CAAAAAAGAT TTACACATAG ACGGTGCTAT TGAAGAGAAT TCAATTCACG ACTACGGCCA TAATTATAAA CATATGGTCA AAGATGAATC TTGGAATAAT AAAACCACCA AACAAACACT CAATAAAACC ACACTTGAGG CCGGTAAGAA TTTGGGCCTC ACCGCAGAAA ATAAAATCAC GACTCAGGGA ATCAAAGCTT CTGCGGGGGG TGACGTCGTA ATCGATGCCA ACGACGTTAA GATTGGGGTA CAAAAAACCA GTAATCAGGA AACAACCGAC GGTAAACATG AAAGAAACTT GGGACTCGGT GGTGTCGATC ACAATAATAA CGATAAGTAT GCGGAAACCA GCCATAGCTC AGAAATAACC GCTGACGGTA ATATACTCAT CAGCGTGAAA GATGACGTTG CTATTACTGG CAGTAAAGTA AAAGCAACTA AAGATGGTTT TGTTCAAGCT AAAGAGGGTG GGATCAAAAT CGATAACGCC ATCAGTACCA CAACCAATAA AGTTGATGAG CGTACGGGGG TAGCATTTGA TATTACCGGC AGTTCCAAAA AAGCCAATAA CAGTGAAGAA AAATCTACCG GTAGCGAAGT CGTCTCTGAA GCAAACCTGA AAATCATCAG TAAAAAAGAT GTCGATGTGA TTGGCAGCTT GGTAAAAAGC GCCGGTGAAT TAGGCATTGA GACCTTAGGC GATATCAATG TCGCCGCTGC GCAGGAAAAA CAAAAGATCG ATGAACAAAA AACCCTACTG ACAATTGATG GTTTTACCAG TGATGACGGT AAAAATCAGT ATCAAGCGGG GCTAAAAGTG GAACACACCA GCGAGAGTGA AAAAACGGAG AAAGTGATAA ATCACGGCTC TACTCTCGAG GGGGGCACCG TCAAACTGGA GGCAGACAAA GATGTGACAT TCACTGGGTC ACAACTCAAT ACCACCAAAG GCGATGCGGA TATCACTGCT GAAAACGTCT CTTTTGTCGC TGCACAAGAC ACCACCACCA GCAATAAAGA AAAAGAGACC GTGGGTGTGA ATGCTCACTA TACCGGTGGG ATGGATAAAG CGGGTAGCGG TGCGGGGGTG AGTTATGAAG AGACGAAGAC GGACAGCGAG AAATCAACTG CGGTCGTTTC CCATACTGAC GTGAAAGGCG ATCTGAATCT CAACGCCAAA CAAGAGATAA CTAACCAAGG CACCGATCAT AACGTCGAAG GCAGTTACGC GGCGAATGCA ACTAACGTTA ACAATTTAGC CGCAGAAAAC AGTGAAACCA CCACCACAAC CACCAACACT GTCGATGTTA AATATGGTGG TAATATCGCT TATGACGGTG TTACTCGTCC TGTCGAAAAG ACGATAGAAA GTGGTAAAAA ATTGGATGTT GGTGGCGTTA TTGAAAACGT AGGAAATGTC GCGCCAGATT CAGTCAATGC CGGCGTAGAC CTCTCCGTTA ATGTTGGCGA AAAAGAGAAT AAATCCAGTA ATTCTCAAGC GGTAGTGACA TCAATAAAAA GTGGTGATAT CAGCATCACA GCAAAAGAGG ATGTCAAGGA TCAAGGAACT CACTACCAAG CTGACAAAGG TGGAATAAAG ATTGATGCTG CCAACCACAC TTTTGAATCT GCGGTTAATC GTGCTGAAGA ATCAGAAAAA GTCGTATCGG GCGGCGTAGA TATGCGAATT TATACCACTA CGGGCGAAGA CATCAATGTT GATGCAAAAG GTAAGGGTGA AAATAAACAG CAAGAGGTTA AGGCCGAGCA GGCTCAAACT GGCAGCATGA AAGCTGTTGG CGATATTATT ATCAACGTAC AAGAAAACGC CCGTTATCAA GGGACTGATC TTGATTCTGG TGATGGAAAA ATCGCTATCA CAGCAGGCGA AGAGATTAAA TTTGAACAAG CAACCAGCCA CCTCAGTGAA AACCACAATA AGATTGATGC AAACGCCAAA GCAAACTTCG GTACTAAGCC GAACAGTAAA GACTTTGGCG GCGGCTTGGG CGGCGGCCAT AGTCAAGGCA GCACCAGTGC CGATATTGCC CAAGTCAGCC ATCTACACGG TAAGCAAGGC ATCGAGCTGA ACGCTGGAAA AGATTTAACA CTGCAAGGTA CCGAGTTTGG CAGTAAAAAT GCCGCTACCG GTGATGTTCA ACTTACCGCC GGTGGAAAAG TAGACTTCCA AGCGGCACAG TCACAAAGTT CTAAGCAAGA TATGACCTGG TCTGCCAGCG TCAAGGCAGG AAAAAGCAAA AGTAACAGCA CCAGTGAAAA TGATGATGGC ACAAAAACGC ATACTAACAA CAAAGGGTTC AGCGCAGGAG CAGAAGCCAA CGTTAAAAAC ACTGATGAGA CCAGTCTTAC ACATCAGGGC GGCATTATTA ACAGTAATGG CGCGGTGACC ATTAAGGCAG ACGGTAAAGA TCAGCAAGCC ATCCATCTGC AAAATACCAA CATCATCAGT CAAGAGACCA TTCTGAACGC GGATAATGGC GGGATCGTAA TGGAATCGGC TCAGGATAAA GAGCACAAAA ATAACTGGAA TGTTAACACC AACCTCAATG GCAGCCAGAA AAACACCATT AAAAGCGATG ATCAAGGTGT TGTAGATAAA GATAGCGCCA AGAAAATCCA TAATGCGGCC ATCAAACTCG ATGGCGGTGT TGATAAACTG GACTCTGTGA CGCAACAAAA CACCCATATC AACAGTGATA AGGTGACCCT GAACAGCGCT AAAAATACCG AATTGGCAGG CGCAGTACTC CAGGCTAACC AGATTAGTGG GCAAATCGGT GGGGATCTAA ACGTCATTAG CCGCGAAAAC AGACTCAATA AAGTCAATGT TTCTGCCGCA CTTGGCGGCA GCCACAGTAA TGCAAAACAA GACAGTTTGA TCAGCCAGGT AGCGAATGCC AGCCCAATTA TGTCTGATAA AATCAAGAAT AAACTGGAAG AGAAATCTAC CAAGATCTTC GACAAAGTCG AAAATAAATT CAACACCTTA GGAAAAGAAA AGGACGATAG CGTCCAGACC ATCAGCTATA CCAAAGACGG TCAAACAGTG AAGATCAGCG AAGCCGATGA GAAGAAGGAA ACCAAAGATA AATGGTGGCA GAAAGGGGCT AAATCTGTCG GTAAGAAAAT CAAAAGCGCG GTACAGGATG AACAAGTCGT CGGGGGTAAC GGCAGCGTCA AAGCGAACGT AGAGGTTGTC GAGAGCCAAG GGGTTGAGGA GCAATCAGCT ATCCGGGGTA CACAGAATGT GGATCTGACC GTTAAGGGTA AAACTGACCT GGTTGGCGGG AAAATATCCA GTAAAAATTC GGATGTTAAG CTAAAAACCA ATGGGTTAGA GACGCAGGAT ATCAACGGGA AATATACCGA AGGGGGTGCA CGATTGAACG CTTCTTCATC GGTTATGGAT ATGATTAGCG ATGGCGCTAA GGATGTAATG GATGGCAAGG CACCGCTGGT CAGTGGACAT GGTAAATCTG AACAGAAAAA TGCAACGGGT GGTATCACCA GAGAATGA
|
Protein sequence | MKKHTFKLSP AGKLAAAVTI ISVSVATCYA AGIVGAGDSA HKPDVSSVNG TSVINIVQPS ASGLSHNQFQ DFNVGEKGAV LNNATSAGNS ILAGQLAANQ NLNGQAASII LNEVISRNPS LLLGQQEIFG MTADYILANP NGITCNGCGF MNTNRESLVV GNPLIEQGSL KGFETFNNNN WLIIQDKGLT ANKILDLIAP RIEVTGMVTS TEAINALSGN NQISTDGQIL ESRQEDHPNR PTSLGGWFSS LFSSESEESI DGKYLGSMQS GRINLVSTRE GSGVKIAGSL NGSEEINATI KGKLQLEAAK LGGNDININA NSIQAFGNLH KNEDNGGVTQ SLERTQLKGK NIAIVANKKN QLSAVKIDAD NVTLRGGELV LDERILTKTE QESSSDGGNG VLGLGKWNWS ENKEEKQQTS IGTTITAKNN ASLKSTQDDI KLSAATITAG KNLAIKAKKD LHIDGAIEEN SIHDYGHNYK HMVKDESWNN KTTKQTLNKT TLEAGKNLGL TAENKITTQG IKASAGGDVV IDANDVKIGV QKTSNQETTD GKHERNLGLG GVDHNNNDKY AETSHSSEIT ADGNILISVK DDVAITGSKV KATKDGFVQA KEGGIKIDNA ISTTTNKVDE RTGVAFDITG SSKKANNSEE KSTGSEVVSE ANLKIISKKD VDVIGSLVKS AGELGIETLG DINVAAAQEK QKIDEQKTLL TIDGFTSDDG KNQYQAGLKV EHTSESEKTE KVINHGSTLE GGTVKLEADK DVTFTGSQLN TTKGDADITA ENVSFVAAQD TTTSNKEKET VGVNAHYTGG MDKAGSGAGV SYEETKTDSE KSTAVVSHTD VKGDLNLNAK QEITNQGTDH NVEGSYAANA TNVNNLAAEN SETTTTTTNT VDVKYGGNIA YDGVTRPVEK TIESGKKLDV GGVIENVGNV APDSVNAGVD LSVNVGEKEN KSSNSQAVVT SIKSGDISIT AKEDVKDQGT HYQADKGGIK IDAANHTFES AVNRAEESEK VVSGGVDMRI YTTTGEDINV DAKGKGENKQ QEVKAEQAQT GSMKAVGDII INVQENARYQ GTDLDSGDGK IAITAGEEIK FEQATSHLSE NHNKIDANAK ANFGTKPNSK DFGGGLGGGH SQGSTSADIA QVSHLHGKQG IELNAGKDLT LQGTEFGSKN AATGDVQLTA GGKVDFQAAQ SQSSKQDMTW SASVKAGKSK SNSTSENDDG TKTHTNNKGF SAGAEANVKN TDETSLTHQG GIINSNGAVT IKADGKDQQA IHLQNTNIIS QETILNADNG GIVMESAQDK EHKNNWNVNT NLNGSQKNTI KSDDQGVVDK DSAKKIHNAA IKLDGGVDKL DSVTQQNTHI NSDKVTLNSA KNTELAGAVL QANQISGQIG GDLNVISREN RLNKVNVSAA LGGSHSNAKQ DSLISQVANA SPIMSDKIKN KLEEKSTKIF DKVENKFNTL GKEKDDSVQT ISYTKDGQTV KISEADEKKE TKDKWWQKGA KSVGKKIKSA VQDEQVVGGN GSVKANVEVV ESQGVEEQSA IRGTQNVDLT VKGKTDLVGG KISSKNSDVK LKTNGLETQD INGKYTEGGA RLNASSSVMD MISDGAKDVM DGKAPLVSGH GKSEQKNATG GITRE
|
| |