Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A1994 |
Symbol | |
ID | 5800464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 2081872 |
End bp | 2083944 |
Gene Length | 2073 bp |
Protein Length | 690 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641339917 |
Product | invasin domain-containing protein |
Protein accession | YP_001606467 |
Protein GI | 162419445 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAGG ATATTTATTT TGGTAAAGAT AATCTGCAAC GCAACCCTTA TGCCGTGACT GCCGGGATCA ATTACACCCC CGTGCCTCTA CTCACTGTCG GGGTAGATCA GCGTATGGGG AAAAGCAGTA AGCATGAAAC ACAGTGGAAC CTCCAAATGA ACTATCGCCT GGGCGAGAGT TTTCAGTCGC AACTTAGCCC TTCAGCGGTG GCGGGAACAC GTCTACTGGC TGAGAGCCGC TATAACCTTG TCGATCGTAA CAATAATATC GTGTTGGAGT ATCAGAAACA GCAGGTGGTT AAACTGACAT TATCGCCAGC AACTATCTCC GGCCTGCCGG GTCAGGTTTA TCAGGTGAAC GCACAAGTAC AAGGGGCATC TGCTGTAAGG GAAATTGTCT GGAGTGATGC CGAACTGATT GCCGCTGGCG GCACATTAAC ACCACTGAGT ACCACACAAT TCAACTTGGT TTTACCGCCT TATAAACGCA CAGCACAAGT GAGTCGGGTA ACGGACGACC TGACAGCCAA CTTTTATTCG CTTAGTGCGC TCGCGGTTGA TCACCAAGGA AACAGATCTA ACTCATTCAC ATTGAGCGTC ACCGTTCAGC AGCCTCAGTT GACATTAACG GCGGCCGTCA TTGGTGATGG CGCACCGGCT AGTGGGAAAA CTGCAATCAC CGTTGAGTTC ACCGTTGCTG ATTTTGAGGG GAAACCCTTA GCCGGGCAGG AGGTGGTGAT AACCACCAAT AATGGTGCGC TACCGAATAA AATCACGGAA AAGACAGATG CAAATGGCGT CGCGCGCATT GCATTAACCA ATACGACAGA TGGCGTGACG GTAGTCACAG CAGAAGTGGA GGGGCAACGG CAAAGTGTTG ATACCCACTT TGTTAAGGGT ACTATCGCGG CGGATAAATC CACTCTGGCT GCGGTACCGA CATCTATCAT CGCTGATGGT CTAATGGCTT CAACCATCAC GTTGGAGTTG AAGGATACCT ATGGGGACCC GCAGGCTGGC GCGAATGTGG CTTTTGACAC AACCTTAGGC AATATGGGCG TTATCACGGA TCACAATGAC GGCACTTATA GCGCACCATT GACCAGTACC ACGTTGGGGG TAGCAACAGT AACGGTGAAA GTGGATGGGG CTGCGTTCAG TGTGCCGAGT GTGACGGTTA ATTTCACGGC AGATCCTATT CCAGATGCTG GCCGCTCCAG TTTCACCGTC TCCACACCGG ATATCTTGGC TGATGGCACG ATGAGTTCCA CATTATCCTT TGTCCCTGTC GATAAGAATG GCCATTTTAT CAGTGGGATG CAGGGCTTGA GTTTTACTCA AAACGGTGTG CCGGTGAGTA TTAGCCCCAT TACCGAGCAG CCAGATAGCT ATACCGCGAC GGTGGTTGGG AATACCGCCG GTGATGTCAC AATCACGCCT CAGGTTGATA CCCTGATACT GAGTACATTG CAGAAAAAAA TATCCCTATT CCCGGTACCT ACGCTGACCG GTATTCTGGT TAACGGGCAA AATTTCGCTA CGGATAAAGG GTTCCCGAAA ACGATCTTTA AAAACGCCAC ATTCCAGTTA CAGATGGATA ACGATGTTGC TAATAATACT CAGTATGAGT GGTCGTCGTC ATTCACACCC AATGTATCGG TTAACGATCA GGGTCAGGTG ACGATTACCT ACCAAACCTA TAGCGAAGTG GCTGTGACGG CGAAAAGTAA AAAATTCCCA AGTTATTCGG TGAGTTATCG GTTCTACCCA AATCGGTGGA TATACGATGG CGGCACTTCG CTGGTATCGA GTATCGAGGC CAGCAGACAA TGCCAAGGTT CAGATATGTC TGCGGTTCTT GAATCCTCAC GTGCAACCAA CGGAACGCGT GCGCCTGACG GGACATTGTG GGGCGAGTGG GGGAGCTTGA CCGCGTATAG TTCTGATTGG CAATCTGGCG AATATTGGGT CAAAAGGACC AGCACGGATT TTGAAACCAT GAATATGAAC ACTGGCGTGC TGCAACCAGG GCCTGCATAC TTGGCGTTCC CGCTCTGTGC GCTGTCAATA TAA
|
Protein sequence | MPEDIYFGKD NLQRNPYAVT AGINYTPVPL LTVGVDQRMG KSSKHETQWN LQMNYRLGES FQSQLSPSAV AGTRLLAESR YNLVDRNNNI VLEYQKQQVV KLTLSPATIS GLPGQVYQVN AQVQGASAVR EIVWSDAELI AAGGTLTPLS TTQFNLVLPP YKRTAQVSRV TDDLTANFYS LSALAVDHQG NRSNSFTLSV TVQQPQLTLT AAVIGDGAPA SGKTAITVEF TVADFEGKPL AGQEVVITTN NGALPNKITE KTDANGVARI ALTNTTDGVT VVTAEVEGQR QSVDTHFVKG TIAADKSTLA AVPTSIIADG LMASTITLEL KDTYGDPQAG ANVAFDTTLG NMGVITDHND GTYSAPLTST TLGVATVTVK VDGAAFSVPS VTVNFTADPI PDAGRSSFTV STPDILADGT MSSTLSFVPV DKNGHFISGM QGLSFTQNGV PVSISPITEQ PDSYTATVVG NTAGDVTITP QVDTLILSTL QKKISLFPVP TLTGILVNGQ NFATDKGFPK TIFKNATFQL QMDNDVANNT QYEWSSSFTP NVSVNDQGQV TITYQTYSEV AVTAKSKKFP SYSVSYRFYP NRWIYDGGTS LVSSIEASRQ CQGSDMSAVL ESSRATNGTR APDGTLWGEW GSLTAYSSDW QSGEYWVKRT STDFETMNMN TGVLQPGPAY LAFPLCALSI
|
| |