Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_0094 |
Symbol | |
ID | 5798462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010158 |
Strand | + |
Start bp | 67588 |
End bp | 72219 |
Gene Length | 4632 bp |
Protein Length | 1543 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641337989 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001604606 |
Protein GI | 162417852 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 225 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 218 |
Fosmid unclonability p-value | 0.000020422 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAACAGT TCAAGAAGAA GAGACTGCCT CTCCTGATTG CAGGTGCTGG CGGTAAGAAA AGCAGTGGCT CCAGCCGCAC ACCGGTTGAA GCCGACGATA CCGTAAACTC GCGTGCTATG GCGTCCATCC TCGACCTGCT CGGGGAAGGT GTCATTGGCG GGCTGGTGGA TGGTGCAAAG TCGATCTTCG TTGATGATCT GCCAATCCTT AACGAAGACG GGTCTTCAAA CTTTAGCGGT ATCACCTGGG ACTTCCGCGA TGGTTCACAA GACCAGACGC CGATGGCTGG GTTCGATTTC GTTGAAACGC CGAAGTCAGT CAACATCCAG TTGAAAAGAA TGCACGACGT TACGATTGCC ATCGATAACG ATGAGGCAGA CCGTGTCCGC GTCATTCTGA AGTTCCCGTC TCTGCGTAGC ATCGACAAAA AGACCGGTGA TACCAACGGT ACGACCGTGA AGTACAAATT CCAGATTGCC AATGGCGATA ATGCCTTCAA GGACGCCATC GCAGAAGGGG AGAGTGCTTC CGAAATTGCG CTGACGGCAA AAAAGACAGG CGTCTACTAC CGCAGCTATG AGCTAAAACT GCCCAAGCCA GGTCGTGCCT ACAAGGTTCG CGTGCTGCGT CTGACCGATG ACAGCAATAC TCAGTACATC TTTAACGATA CGTGGGTGGA CTCTATCGGT GAGATCGTCG ATACGCCGAT GAACTATCCG AACTCCGCGC TGGTTGGCCT CAAGGTCAAC TCAGAGCAGT TCGGCAGCTC GATGCCGTCT CGTTCGTATC TGGTGCGTGG CCTGAAGATC CGCGTACCGT CCAACTACGA TGAACACACA AACACCTATA TCGGCGTATG GGATGGCACA TTCAAGCTGT TGTCATCTTC CAACCCTGCC TGGATTCTCT TCGACCTGCT TACCAACGCT CGTTATGGCC TGGGGCAGTA CGTTTCTGAG TCCATGATTG ACCTCGGGCA GATCTACCAG ATTGGTCGCT ACTGTGACGA AGAAATTGAC AATGGATTCG GGGGCAAAGA GAAGCGCTTC GCTATCAACA CCCAGATCAC TAGCCGTCAG GACGCGTACC GACTGATTCA GGATATCGCT GGCGCCTTCC GCGGTATGGT CTTCTGGGCT GGTGGCATGG TTAACGTCAT GCAGGATAGT CCGTCAGATC CGGTCATGAT GTTCACCAAC GCGAACGTCA AAGACGGCAT GTTCAGCTAC AAGGGATCTG CGCGTAAAGA CCGTCCGTCA GTAGCTCTTG TGACCTACAA CAACAAGGAA GACGGCTACA AGCAGAACAT CGAGTACGTC GAAGACCAGG AGGCGATGCG TCGTTATGGC GAGCGTAAAA CCGAAGTGGT TGCGTTCGGC TGTACAAGCC GTGGCCAGGC GCATCGTGTC GGTCTGTGGC TGCTGTATAC CGCACGCATG GAGTCGGACG TTATCAGCTT TACGGCAGGG CTTGATGCTT CCTTCCTTAT GCCGGGTGAA ACGGTGCTGA TTCAGAACAA ATACCGTGCT GGTAAACGCA ACTCTGGCCG CATTGTGGCG TTCACAAAGA ACAGCATCAC TCTCGACGCA CCGGTTACGC TGAATAAAGC CGGTAGCTAC ATCCGGATCT TGAATCAGGA AGGCGAAATC GTTGAGCGCG ATATTCTTGA GACCGGGGAA GACATTACCA AAGTGACCTT CTCCAAAGCG CTCAATTCCG GTGATATGCC GGTGATGAAT GGCGTCTGGA CGATTACCGA GCCAGATCTG GAGCCAATGC GCGTGCGTGT TATCAACGTT GCCCAGGGCG AGGCTCAGGG GACGTTTAAC GTTACGGTTG TCCAGAATAA TGCATCGAAG TACGAAGCCA TCGACAACGG TGCGACGCTG ATCCCCGAGA ACAACACAGT TCTCGACCCG ACTTATTCGA AGCCGACTAA CCTGCAGGTG ACGGAAGGGA CGTATATCTC CAGTCCGGGT AACCTCTCAA TCAAGCTCGT AGCCACCTGG GAGGGTAAGT CTGCGGAATA TTGGATCAGC TGGCGTCGTT CCGATGAAAA CAACGTTTCT AACTGGCAGT CCGCACGCGT TACCGAAGAG CAGTTCGAGA TCCTCAATAT TGCTGAGAAT GGTCAATACG ACATTCAGCT CTATGCGGTT TCGTTCAGCG GCAAGAAAAC GGACATCATC AGCACCGTTT ATCAGGTGAA AGGTACGATG ACGCCGCCAG GCTCTCCTAC CTCTCTGACG GCCGTTGGTG ACTACCGCAA CGTGATTCTG AATTGGGTCA ACCCGGACTC AATCGACCTT GATCACATCA ACGTGTATGC CTCCCAGACC AACGATCTGG AAACGGCGAA GTTGGTTGCA GAGGCCGCCA GCACCACGTT CACTCATGCC GGTCTGGGAG ATAGTGAGAC CTGGTACTAT TGGGTTCGCG CGGTGAACAA GCGTGGCATG TTAAGTCCGC CGAACTCCAA TCTGGGTACG GAAGCGATGA CGCGAGACGT CCTCTCGTTC CTTACCGGGA AGATCACCTC TTCCGAGCTG GGGCAGGAGC TGCTGGAGGA AATCGACGCT AAAGCCTCTC AGGATGCGGT CGACGCCATC AACAAACAGA TGGAAGAGAG TCTGAAAGAG CTTGATCAGT CCGTTGCCGA TCTGGACAGC AAACTGGAAG ACACCAGCGG TCGGCTTGAG CAGGTGCAGA ACGACCTCAA AAATGAAGTC TCTGGCACGC TGGACAAGGT CAACGACGCG CTGCAACAAG TTGAGGACTC TAATGCGGCT CTGGTCGAGT TGCAGGAAAC CGTTTCCGAG CAGGGCAAAG CCATAGCTGG CGCTGTGGAA GCGGCGCACG CTGCGCTCGA CAACGCCTCC GCGCTGATTG CTGAAGAGCG TGAAGCCCGT GTCGAAGGTG ATAAGGCAAA TGCCAAACAG ATTGAGGCAA TGAAATCCTC CGTCGATGAC AGTGTTGCCG CCGTCGAAGA GATGAAAAAG ACCGTTGCCG AAGTCGAACG CGCCAGCGCG GAAGCGTCGA CCAATATCGA GGCTCTGGCC AAAACCAATA TTGACCTCGC TCTGCGTCAG GATGAAGACC AGCACAAGCA GATGGTCAAT AATGCGAAGA TCGCAACCAC TCAGAAGACG TTTGCCGACG ATATGTCTGC AATGGCCTCA AAAGTGGAAG AAATCCGCGC AGAAATTGGT GAGGACATCC GGGCGTCGAT TCTGGAAGAG ACAACGGCTC GCGTAGAGGC TGACAAGACA ATTGCGACGC ATATCTCCAA GCTGGAAGCC CAGCTCAACG ACGATATTTC AGCGGCAATC GTTTCCGAGC AAGAGGCGCG TGCGACTGCG GATGAAACGC TTTCTCGTCA GATCACCACG TTGCAGGCGA AAGTTGAAGG TGATATCAGC GCTGCACTTA CTGAAGAGCA GATTGCCCGA GCCACAGCGG ATGAGGCGCT ATCGAAGCAA ATTACCCAAC TGAAGGCACA GAATGGTGAG GATATCAAAG CCGCCGTTGC AGAAGAGACC CAGGCTCGAA CCGATGCAGA TGGTGCTCTG GCTTCGCAGA TCAGCTCGCT GAAGGCTCAG ACGGCAGAGG ACATCAAGGC CGCTGTCGAC ACAGAGACGA AAGCGCGTAC CGATGCCGAC TCTGCTCTGG CCGGGCAGAT CACCAATCTC CAGGCTCAGA CCGGCAAAGA TATCAACGCT GCTATCACAT CCGAAGCCAC CGCGCGTGCA AACGCTGACG GTGCTCTCGG TAAGAGAATT GATACGGTTA AGGCTGAAGT TGATGGCAAC TCGGCTCTCA TTCAGCAGCA AGCGAAGGCG ATTGCCGATA CCGATAAGAA GGTTTCTGCT GCCTGGACGC TGAAGATGGA AACATCTACC AGCGGCGGGC AGAAGTACGT TGCAGGTATC GCGCTGGGTA TCGACAGTAC CGGTTTATCA CAATTTTTGG TGCAGGCAGA CCGTTTTGGC CTGGTCAACT CCGTAAACGG GAAGATCACT ACGCCATTTG TCATCGAAAA CAGCGTGGCG TATATGAACG GCGCTTATAT CAAAGACGGC ACAATTACGA ACGCCAAAAT TGGTAATGTC ATTCAGTCGA ACGATTACGC CGCAGGCAGT AGAGGCTGGA TTATCCCCAA AGATGGTAGC CCTGAGTTCA ACAACGGTAC GTTCAGGGGA AATATTGCTG CAAACTCCGG CACGCTGAAT AACGTCACCA TCGCGCAGAA CTGCCAGATT CTGGGGAAAC TGCACGCGAA CCAGATTGAT GGCGATATTG TTAAAGCCTA CATGGTTAAT GGCAGCAGTA TTTATATTGC ACCTCAAACA TTCGCCAGAA TTATCTATGT GGTAAATGGT TACTACTATA ACAAGCCATC GGAGGATATT AACACCTACT CATGGTCAAG AATTACCGAG TATACGGTAA ATGGAGTGAA GCAGCAGATA TATGGAATGA GAGAAGGCTC TAAAAATCAA TCTGGCTTGT TTGGTTATTA CAATTTGCCG GCAGGTCAGT CTGCGACTGT TGATGTTTAT ACCTGGCATA GACAGCGTAA ATACGATCAC CGCGTGAATG AACCTTATCT CATTCTGGTG TTTAAGGCTT AA
|
Protein sequence | MEQFKKKRLP LLIAGAGGKK SSGSSRTPVE ADDTVNSRAM ASILDLLGEG VIGGLVDGAK SIFVDDLPIL NEDGSSNFSG ITWDFRDGSQ DQTPMAGFDF VETPKSVNIQ LKRMHDVTIA IDNDEADRVR VILKFPSLRS IDKKTGDTNG TTVKYKFQIA NGDNAFKDAI AEGESASEIA LTAKKTGVYY RSYELKLPKP GRAYKVRVLR LTDDSNTQYI FNDTWVDSIG EIVDTPMNYP NSALVGLKVN SEQFGSSMPS RSYLVRGLKI RVPSNYDEHT NTYIGVWDGT FKLLSSSNPA WILFDLLTNA RYGLGQYVSE SMIDLGQIYQ IGRYCDEEID NGFGGKEKRF AINTQITSRQ DAYRLIQDIA GAFRGMVFWA GGMVNVMQDS PSDPVMMFTN ANVKDGMFSY KGSARKDRPS VALVTYNNKE DGYKQNIEYV EDQEAMRRYG ERKTEVVAFG CTSRGQAHRV GLWLLYTARM ESDVISFTAG LDASFLMPGE TVLIQNKYRA GKRNSGRIVA FTKNSITLDA PVTLNKAGSY IRILNQEGEI VERDILETGE DITKVTFSKA LNSGDMPVMN GVWTITEPDL EPMRVRVINV AQGEAQGTFN VTVVQNNASK YEAIDNGATL IPENNTVLDP TYSKPTNLQV TEGTYISSPG NLSIKLVATW EGKSAEYWIS WRRSDENNVS NWQSARVTEE QFEILNIAEN GQYDIQLYAV SFSGKKTDII STVYQVKGTM TPPGSPTSLT AVGDYRNVIL NWVNPDSIDL DHINVYASQT NDLETAKLVA EAASTTFTHA GLGDSETWYY WVRAVNKRGM LSPPNSNLGT EAMTRDVLSF LTGKITSSEL GQELLEEIDA KASQDAVDAI NKQMEESLKE LDQSVADLDS KLEDTSGRLE QVQNDLKNEV SGTLDKVNDA LQQVEDSNAA LVELQETVSE QGKAIAGAVE AAHAALDNAS ALIAEEREAR VEGDKANAKQ IEAMKSSVDD SVAAVEEMKK TVAEVERASA EASTNIEALA KTNIDLALRQ DEDQHKQMVN NAKIATTQKT FADDMSAMAS KVEEIRAEIG EDIRASILEE TTARVEADKT IATHISKLEA QLNDDISAAI VSEQEARATA DETLSRQITT LQAKVEGDIS AALTEEQIAR ATADEALSKQ ITQLKAQNGE DIKAAVAEET QARTDADGAL ASQISSLKAQ TAEDIKAAVD TETKARTDAD SALAGQITNL QAQTGKDINA AITSEATARA NADGALGKRI DTVKAEVDGN SALIQQQAKA IADTDKKVSA AWTLKMETST SGGQKYVAGI ALGIDSTGLS QFLVQADRFG LVNSVNGKIT TPFVIENSVA YMNGAYIKDG TITNAKIGNV IQSNDYAAGS RGWIIPKDGS PEFNNGTFRG NIAANSGTLN NVTIAQNCQI LGKLHANQID GDIVKAYMVN GSSIYIAPQT FARIIYVVNG YYYNKPSEDI NTYSWSRITE YTVNGVKQQI YGMREGSKNQ SGLFGYYNLP AGQSATVDVY TWHRQRKYDH RVNEPYLILV FKA
|
| |