Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1189 |
Symbol | |
ID | 5799654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 1232691 |
End bp | 1235549 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641339161 |
Product | RHS repeat family protein |
Protein accession | YP_001605731 |
Protein GI | 162419579 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.667138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0000780044 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCACAA GTTTACCAAC CCAGCTTTGC GCCAACACCC CCGCCCTCAC CATCCACGAT AACCGGGGGT TAGCTATTCG TACACTGGCT TATAACCGCC GCGATCATAA TGAAACCGTT GACGAACTGA TCAGCCGCAA CCGCTATAAC GCCTCCGGTC AGCTAATCGC CAGCCGTGAT CCGCGCCTTG AGGTGGATAA TTTCCGCTAT CAATACAGCC TCAGCGGTGT TCCACTGCGC ACCGACAGCG TCGATAGCGG CAGTACACTG CAACTGGCAG ATAGTGCTGG CCGCACGGTG CTCACGCTCG ATGCACACCA CACCCGCCGC TGGGTGGAGT ATGAGACCGG TGAACACAGT TTAGGCCGCC CGCTAAGTTA CCACGAGCAA GCCAAAGGCG GCCTGAAAAC GGTTACCGAC CGCTTTTTCT ATGCCACAAA CAGCGAGCAG GATAAAAACT GCAACCTGAA CGGCCAGTGT GTACGCCATT ACGACAGCGC TGGTTTGCAG GCACTGATTA GCCAGTCGAT TATTGGCGTA CCACTGCAAC AACAGCGCCG TCTACTGACG AATCCCAAAG GCCCAGTTGA CTGGTTTGGC GAAAAGGAAA ACTGGGGCGC TCGCCTGAGC GAACAGCCGT TTGTTAGCCA TAGCACCACC GATGCTCTCG GCCAGTTACT CACGCAAACC GATGCCAAAG GCCATATCCA GCGCATGGCC TATAACCGCG CCGGACAACT TATCGGTTCG TGGCTAACAA TAAAAAATAG CGCTGAACAG GTGATCCTCC GTTCACTCAC CTATTCGGCT GCTGGGCAAA AACTGCGCGA AGAGAGCGGC AACGGGGTGA TTACCGAATA CCGTTATGAA CCCCAGACTC AGCGCTTAAT CGGCATTAAA ACCACCCGTC CGGCGAAGAA AGACCGCCCG ACCCGGTTAC AAGACCTGCG TTACGATTAT GACCCGGTCG GGAATATTCT CGCCATCCAT AATGACGCCG AAACCACCCG CTTCTACCGT AATCAGAAAA TCGTGCCGGA AACCACTTAC CGCTACGATG CACTGTATCA GCTTATCGAG GCCACTGGCC GTGAAGCGGA TACCAACGGC ATACAAAACA GCCAGTTACC CGCGTTGGCG TCACTGAACG ACAGCAACCA GTTCGTCAAC TACACCCGCA GCTACCACTA TGACCGCGCC GGTAACCTGC TAAAAATTCA GCATACCGGT GCCAGCCAAT ACAGTACCCA TATCACGGTG TCCGATTCGT CCAATCACGG CATTCAGCAA CAAGATGGCA TCATCGCCCG TGATATTCGC TCCCAGTTTG ATGCGGCGGG TAATCAGCAA CAACTGCAAC CCGGTCAGCC CCTGCGCTGG AACAGCCGCA ATCAGTTACA GCAGGTGGAA CCTGTGCCCC GCAACGATGG CATCAGTGAC AGCGAAAGTT ATCTCTATGA TGGCAGCGGT AGGCGGGTGG CCAAAATCAG TCTCCATAAA ACCCATAACG CCATCCAAAC CCGTTCGGTC ATTTATTTAG CGGGACTGGA ACTGCGTGGC CAACATAATG ACAATAATCT GACAGAAAGT TTTCAGGTGA TAACCGTGGG TGCTGCGGGC CGTGCTCAGG TACGGGTATT ACACTGGGAG AGCGGCCAAC CCGTTGATAT CGTCAATGAC CAACTGCGTT ACAGTTTCGA TAATCACCTT GGCTCGGCGT TAATCGAATT AGACAGCGAT GGCGATATTA TCAGCCAGGA AGAATATTAC CCATTTGGCG GTACCGCGGT GTTAGCCTCC CGTAATACCG TGGAAGCCAA ATATAAAACC GTTCGTTATT CCGGTAAAGA GCGCGATGCC ACCGGGCTGT ATTATTATGG TTACCGTTAT TACCAACCGT GGCTGGGCCG ATGGTTAAGC GCCGACCCCG CAGGCACTAT AGACGGACTG AATTTATATC GGATGGTGAG GAATAACCCA ATCAGATGGC GTGATAACAA TGGGCTATTA ACCGAAGAGC AAATTAATAT GTACGTTAAT TTGTTTAGTA ATATTGGATT AAAAAATGAT GATGAATTAA AGAGTGAATT ATTAAAATAT GGTTTAAGTG AAGAAGAGCA AAACCAGATA TACCTTAATA TGTTAAGACC TATGCAGTCT GGATCATCAA GCTCATTATT CTCCTTCCCT TCTGAAAGTA GTTCAAGTTC TGGGAGTACG CAAAGTGTTG ATTCAGGTTA TCTCAGTCCA GTAAGAAACT ATCATTTTTT TGAAGATATT AAGTTAGCAA CAATGCACCG TCCCTATCCA AAAAAACAAG CCTCTAGTGA CACAATAACA TATTCAGCAG AAGATTTAAC AGAAGCTAGC CCTATAAAAA TTCTCATTGG TTTGGATTTG ACCAGTAAAA ACACCCAACC ATATAAGTCA GCGCTTGCCG AAAAAGGAAT TAAGTATATC ACTAAAGAAA AATATGAAAT AACAGACTTT TTTGAAGAAG GAGGATTATC GACTGAACAA ATAGATTTAA CAGTAAATAA AATATTAAAA TTACAAAAAA AGGATCTTGT AGGAATTCAT TGTGGGGCAG GTAATGGAAG AAGTGGAGTT ATTGCATCAG CATTATCCAT TAATAAACAG TATACAACAG ATAAAATAAA TAGTTTTGAC GTAACTCATT CATTAAGAGG GTCAATACTT AAAGACACAC AAACATACCA AGTGGATACG GTAACCGCCA AGGCGGTTGG GATTATCAGA GAAATAAATC CTAAAGCAGT GGAACGTAAT CAGGATGTTA TTTCCCTATA TAGATATTCT CATTTTTTAT ATACAAGAAA ACACACTACA TCATTATAA
|
Protein sequence | MSTSLPTQLC ANTPALTIHD NRGLAIRTLA YNRRDHNETV DELISRNRYN ASGQLIASRD PRLEVDNFRY QYSLSGVPLR TDSVDSGSTL QLADSAGRTV LTLDAHHTRR WVEYETGEHS LGRPLSYHEQ AKGGLKTVTD RFFYATNSEQ DKNCNLNGQC VRHYDSAGLQ ALISQSIIGV PLQQQRRLLT NPKGPVDWFG EKENWGARLS EQPFVSHSTT DALGQLLTQT DAKGHIQRMA YNRAGQLIGS WLTIKNSAEQ VILRSLTYSA AGQKLREESG NGVITEYRYE PQTQRLIGIK TTRPAKKDRP TRLQDLRYDY DPVGNILAIH NDAETTRFYR NQKIVPETTY RYDALYQLIE ATGREADTNG IQNSQLPALA SLNDSNQFVN YTRSYHYDRA GNLLKIQHTG ASQYSTHITV SDSSNHGIQQ QDGIIARDIR SQFDAAGNQQ QLQPGQPLRW NSRNQLQQVE PVPRNDGISD SESYLYDGSG RRVAKISLHK THNAIQTRSV IYLAGLELRG QHNDNNLTES FQVITVGAAG RAQVRVLHWE SGQPVDIVND QLRYSFDNHL GSALIELDSD GDIISQEEYY PFGGTAVLAS RNTVEAKYKT VRYSGKERDA TGLYYYGYRY YQPWLGRWLS ADPAGTIDGL NLYRMVRNNP IRWRDNNGLL TEEQINMYVN LFSNIGLKND DELKSELLKY GLSEEEQNQI YLNMLRPMQS GSSSSLFSFP SESSSSSGST QSVDSGYLSP VRNYHFFEDI KLATMHRPYP KKQASSDTIT YSAEDLTEAS PIKILIGLDL TSKNTQPYKS ALAEKGIKYI TKEKYEITDF FEEGGLSTEQ IDLTVNKILK LQKKDLVGIH CGAGNGRSGV IASALSINKQ YTTDKINSFD VTHSLRGSIL KDTQTYQVDT VTAKAVGIIR EINPKAVERN QDVISLYRYS HFLYTRKHTT SL
|
| |