Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1120 |
Symbol | |
ID | 5799584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 1156054 |
End bp | 1158123 |
Gene Length | 2070 bp |
Protein Length | 689 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641339095 |
Product | hypothetical protein |
Protein accession | YP_001605666 |
Protein GI | 162421317 |
COG category | [R] General function prediction only |
COG ID | [COG3107] Putative lipoprotein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAAAT TACTGCTCGA TCGTGTTGAT TCATTGTTTC ATCCGAATTA CCGATTTAAT ATTGAGCATC TTGAAAAAAA CATCACTGGA TACAGTATGC TTTCCTCAAC GTTCGTTCGT TCTAAAGCAG GGCTTGTTCC TGTCATTCTG GCTGCTCTTA TTTTAGCGGC CTGTACAGGC GATGCGCCAC AAACGCCGCC CCCCGTAAAT ATACAGGACG AAGCAAGCGC TAACTCTGAT TATTATCTGC AACAGTTGCA GCAGAGCAGT GATGATAACA AGGCTGACTG GCAATTACTT GCCATTCGTG CCCTATTACG TGAGGCAAAA GTGCCTCAGG CCGCCGAACA ACTCAGCACT CTCCCTGCAA ACCTGAGCGA TACACAGCGC CAGGAACAGC AATTGCTGGC GGCTGAACTG TTGATCGCGC AGAAAAATAC GCCAGCGGCG GCTGATATTC TTGCCAAATT AGAGGCAACT CAACTCTCAG CTAACCAAAA AGTACGCTAC TATCAGGCCC AAATTGCCGC CAATCAGGAT AAAGCCACCC TGCCATTGAT TCGTGCATTT ATCGCTCAGG AACCATTACT GACAGATAAA GCCCATCAAG ATAATATTGA TGGCACTTGG CAGTCACTGT CCCAACTGAC ACCACAAGAA TTAAATACCA TGGTGATCAA CGCAGACGAA AATGTGCTGC AAGGCTGGCT GGATTTACTG CGTGTTTATC AAGATAACAA GCAAGACCCA AAGCTACTGA AAGCCGGGAT TAAAGACTGG CAAACCCGTT ACCCACAAAA CCCGGCAGCG AAAAATCTGC CAACTGCATT AACTCAGATC AGTAATTTCA GCCAGGCATC CACCGCCAAG ATTGCTCTGC TGCTGCCATT AAGTGGCCCG GCACAAGTAT TCGCCGATGC CATCCAGCAA GGTTTTACTG CCGCCCAAAA TGGCTCAGCG GTAACAGCTT CAGTACCAGT AACGCCAAAT GTGACGGAAA GCAGCCCAAC GGATACTGCT GCGGTTGTTT CGGATGATAC CCCGGCCACC CTTCCGGCCC CAGTGCCCCC CCCCGTCGTC ACCAACGCCC AAGTGAAAAT CTACGATACC AACACTCAAC CACTGGCAGC GCTATTGGCT CAAGCCCAGC AAGATGGTGC AACACTGGTC GTTGGCCCTC TGCTAAAACC CGAAGTTGAG CAACTCAGTG CCACCCCAAG CACATTGAAT ATTCTGGCGT TGAACCAACC AGAAGCCAGT AATAACAGCC CAAACATCTG TTACTTTGCC CTATCGCCAG AAGATGAAGC CCGTGATGCA GCGCATCACC TGTGGGAACA GCAAAAAAGA ATGCCGCTGT TGCTGGTGCC TCGTGGTGCC CTTGGTGAAC GCATTGCCAA AGCCTTCGCT GACGAGTGGC AAAAACAAGG TGGGCAAACG GTATTACAAC AGAACTTCGG TTCAACCACT GAGTTGAAGC AATCCATCAA CAGTGGTGCC GGTATCCGCC TGACCGGTAC CCCCGTTAGC GTTTCTAATG TAGCCGCCGC CCCGGCCTCC GTCACTATTG CGGGCCTGAC CATTCCAGCA CCGCCAATCG ATGCACCGGT AGTGTCAACG TCTTCGAGCG GTAACATTGA TGCGGTCTAT ATCATTGCGA CGCCATCTGA ATTAACCCTG ATTAAGCCAA TGATTGATAT GGCAACCAGT TCACGCAGTA AACCTGCGCT GTTTGCCAGT TCACGTAGCT ACCAGGCTGG CGCTGGCCCA GATTACCGTC TGGAAATGGA AGGTATACAG TTTAGTGATA TTCCGCTGAT GGCCGGCTCT AACCCCGCTT TGCTGCAACA AGCATCGGCT AAATACGCTA ACGATTATTC TCTGGTACGC TTATACGCCA TGGGGATTGA TGCCTGGGCA TTGGCAAATC ATTTTTCTGA AATGCGCCAA ATCCCTGGCT TCCAAGTCAA AGGGGTCACC GGTGATTTAA CTGCATCATC AGATTGTGTT ATCACCCGCA AGCTACCTTG GTTACAATAT CGCCAGGGAA TGGTGGTGCC ACTCGCATAA
|
Protein sequence | MQKLLLDRVD SLFHPNYRFN IEHLEKNITG YSMLSSTFVR SKAGLVPVIL AALILAACTG DAPQTPPPVN IQDEASANSD YYLQQLQQSS DDNKADWQLL AIRALLREAK VPQAAEQLST LPANLSDTQR QEQQLLAAEL LIAQKNTPAA ADILAKLEAT QLSANQKVRY YQAQIAANQD KATLPLIRAF IAQEPLLTDK AHQDNIDGTW QSLSQLTPQE LNTMVINADE NVLQGWLDLL RVYQDNKQDP KLLKAGIKDW QTRYPQNPAA KNLPTALTQI SNFSQASTAK IALLLPLSGP AQVFADAIQQ GFTAAQNGSA VTASVPVTPN VTESSPTDTA AVVSDDTPAT LPAPVPPPVV TNAQVKIYDT NTQPLAALLA QAQQDGATLV VGPLLKPEVE QLSATPSTLN ILALNQPEAS NNSPNICYFA LSPEDEARDA AHHLWEQQKR MPLLLVPRGA LGERIAKAFA DEWQKQGGQT VLQQNFGSTT ELKQSINSGA GIRLTGTPVS VSNVAAAPAS VTIAGLTIPA PPIDAPVVST SSSGNIDAVY IIATPSELTL IKPMIDMATS SRSKPALFAS SRSYQAGAGP DYRLEMEGIQ FSDIPLMAGS NPALLQQASA KYANDYSLVR LYAMGIDAWA LANHFSEMRQ IPGFQVKGVT GDLTASSDCV ITRKLPWLQY RQGMVVPLA
|
| |