Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A3524 |
Symbol | |
ID | 5802000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 3746071 |
End bp | 3748707 |
Gene Length | 2637 bp |
Protein Length | 878 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641341340 |
Product | hemagglutination repeat-containing protein |
Protein accession | YP_001607853 |
Protein GI | 162418354 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.20838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGTA TCCAAAAGTG TGACTGTAAT TATCTGGTCA GATTTGCAGT TTCAGATTCC TTTATCAGAG ACAATTCTTT TCGCAGCTTG CTTGCTATTT TTATGGTGAC GATATTTGTG CCAAATACTG CATTTTCTCA ACTCTATGTG AACAATGCCA ATGATCCTGG CTGTTATGTC ATTGCTGATG GGGGGTTTAC GGGTGTTACT AATAGTGATC AAAATTGTTA TACGTTGTCT TTGAATAATA TTAATACCAA TGGGGGGCAG CTATTTGTTG GTGGAAAAGG TGGGATCGCT GGGAAGCCTA GCTTTATCGC CACCCCTTGG ACAGGGACAT TCACGACGGC CGTTGGCACC AGTAATGTCG CCACAGCATA CGGCTTTGTC GTTCAAAGCA ATGGTGCATT TATCAACGGT GATACCTACG TGAAGGGGGG GTTATTTTTG AACGGTAGGA AAGCCACCAA TCTTGCCCCC GCAACGATTT CATCGACCTC CACCGATGCG GTGGTTGGTA GCCAGCTTTA TACGGTGATC CAAGATGGAA CCCGCTATTT CCACGCCAAC TCAGTGAACC CGCAAGACTC CGTGCCTGCG GGTCAGGATG CTATTGCTGT CGGGCCGGCG ACGGTGGTCA ATGGTAATAA CGGGATTGGT ATTGGTAGTA GCGCCGTTGT TGGGCCGAGT GCTGTCGGGG GCATTGCAAT TGGCCCCAAC ACTCAGGCGA CCGGTATCGC CAGCACGGCC CTTGGTGCCG GGTCGCAAGC GCATGGATCA CAGTCTTTGG CATTGGGGGC GGGAGCAACT GCCAGCCAGG CAAACAGTAT CGCGTTAGGG GCGTCGTCGG TCACCACGGT CGGTGCTGAG AGCGACTACA GTGCGTACGG ACTGACGGCT CCCCAAACGT CGGTGGGCGA GGTGGGGATG GGCACGGCAC AGGGGAATCG CAAGATCACC GGTGTGGCAG CCGGTTCGGC TGATTATGAT GTGGTCAATG TCGCGCAATT GACCGCTGTT GGTGACAAGG TCGAGCAGAA TACCGCCGAC ATTACCAGTT TGGGTGGCCG GGTCACCAAT GTTGAGGGGG GGATGACCCG TATCACCAAC GGGGGCGGTA TAAAGTACTT CCACACTCAC TCCACCGAGC CTGATTCGGT GGCCAGCGGC AGTGATTCGG TGGCGATCGG ACCGAATGCG CAGGCGTCCG GTACCACGTC GATAGCCATG GGGGCCGGGT CGACAGCGCA GGGAGCACAG TCTCTGGCAT TGGGGGCGGG AGCGGCTGCC AGCCAGGCAA ACAGTATTGC ATTAGGGGCG TCGTCGGTCA CCACGGTCGG TGCTGAGAGC GACTACAGTG CGTACGGACT GACAGCTCCC CAAACGTCGG TGGGCGAGGT GGGGGTGGGC ACGGCACAGG GGAATCGCAA GATCACCGGT GTGGCAGCCG GTTCGGCTGA TTATGATGCG GTCAATGTCG CGCAATTGAC CGCTGTTGGT GACAAGGTCG ATCAGAATAC CGCTGACATC ACCAGCTTAG ACGGCCGGGT CACCAATGTT GAGGGGGAGA TGGCCAGCAT CACCAACGGG GGCGGCGTGA AATACTTCCA CACCCACTCC ACCGAGTCTG ACTCGGTGGC CAGCGGCAGT GATTCGGTGG CGATCGGACC GAATGCGCAG GCGTCAGGTA CGGCTTCGGT GGCCTCCGGC AAGGGTACGC TGGCCTCCGG TAACGGTGCG GTGGCGATAG GTGATGCAGC AAGCGTCAGC GCAGAGGGCA GTGTTGCCCT GGGGCAGGGT TCCGCTGACA ACGGGCGCGG TGCAGAGAGC TACACCGGCA AGTACTCCAC TACGGATAAC ACCACCTCAG GTACGGTGTC GGTGGGCAAT GCGGCAACCG GAGAGACCCG GACGGTCAGC AACGTTGCCG ACGGGCGAGA GGCCATGGAT GCAGTCAATC TGCGGCAACT CGATGGTGCA ATGGCGGCGG TGGGTGACAC CGTATCAGGG TTGCAGAACG GCACTGACGG GATGTTCCAG GTGAACAACA ACAGCGGTCA GGCCAAGCCT TCGGTCACCG GAACTGATGC GATGGCGGGG GGGGGGGGGG CAGGCTCCGT GGCGTCTGGC AGCCACAGTA CCGCGATGGG TACGGGCAGC AAGGCGACGG CGGCAAACAG CACCGCGCTG GGGGCCAACT CAGTGGCGGA TCGTGAAAAC AGTGTCTCGG TGGGGTCAGT GGGTAATGAA CGGCAGCTCA CTAATATTGC TGTGGGGACT CAGGGCACTG ATGCGGTGAA TCTGGATCAA CTCAACCATA GCATGTCGAA TGTCACCAAC GACGCCAATG CTTATACAGA CCAGCGCTAT TCTGCACTTA AAGAAGATCT GAAAAAACAG GATAGTACGT TAAGTGCGGG GATCGCCGGT GCCATGGCGA TGGCGAGCCT GACTCAACCC TATACGCCGG GTGCCAGCAT GGCGACCATT GGTGCGGCCA GCTATCGGGG CCAGTCGGCG CTGTCGGTGG GGGTGTCGAG TATTTCTGAC AGTGGGCGAT GGGTCAGCAA ATTGCAGGCC TCCTCTAATA CACAAGGCGA TATGGGGGTT GGTGTCGGCG TCGGTTATCA ATGGTAA
|
Protein sequence | MKSIQKCDCN YLVRFAVSDS FIRDNSFRSL LAIFMVTIFV PNTAFSQLYV NNANDPGCYV IADGGFTGVT NSDQNCYTLS LNNINTNGGQ LFVGGKGGIA GKPSFIATPW TGTFTTAVGT SNVATAYGFV VQSNGAFING DTYVKGGLFL NGRKATNLAP ATISSTSTDA VVGSQLYTVI QDGTRYFHAN SVNPQDSVPA GQDAIAVGPA TVVNGNNGIG IGSSAVVGPS AVGGIAIGPN TQATGIASTA LGAGSQAHGS QSLALGAGAT ASQANSIALG ASSVTTVGAE SDYSAYGLTA PQTSVGEVGM GTAQGNRKIT GVAAGSADYD VVNVAQLTAV GDKVEQNTAD ITSLGGRVTN VEGGMTRITN GGGIKYFHTH STEPDSVASG SDSVAIGPNA QASGTTSIAM GAGSTAQGAQ SLALGAGAAA SQANSIALGA SSVTTVGAES DYSAYGLTAP QTSVGEVGVG TAQGNRKITG VAAGSADYDA VNVAQLTAVG DKVDQNTADI TSLDGRVTNV EGEMASITNG GGVKYFHTHS TESDSVASGS DSVAIGPNAQ ASGTASVASG KGTLASGNGA VAIGDAASVS AEGSVALGQG SADNGRGAES YTGKYSTTDN TTSGTVSVGN AATGETRTVS NVADGREAMD AVNLRQLDGA MAAVGDTVSG LQNGTDGMFQ VNNNSGQAKP SVTGTDAMAG GGGAGSVASG SHSTAMGTGS KATAANSTAL GANSVADREN SVSVGSVGNE RQLTNIAVGT QGTDAVNLDQ LNHSMSNVTN DANAYTDQRY SALKEDLKKQ DSTLSAGIAG AMAMASLTQP YTPGASMATI GAASYRGQSA LSVGVSSISD SGRWVSKLQA SSNTQGDMGV GVGVGYQW
|
| |