Gene YpAngola_0094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_0094 
Symbol 
ID5798462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010158 
Strand
Start bp67588 
End bp72219 
Gene Length4632 bp 
Protein Length1543 aa 
Translation table11 
GC content53% 
IMG OID641337989 
Productfibronectin type III domain-containing protein 
Protein accessionYP_001604606 
Protein GI162417852 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones225 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones218 
Fosmid unclonability p-value0.000020422 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACAGT TCAAGAAGAA GAGACTGCCT CTCCTGATTG CAGGTGCTGG CGGTAAGAAA 
AGCAGTGGCT CCAGCCGCAC ACCGGTTGAA GCCGACGATA CCGTAAACTC GCGTGCTATG
GCGTCCATCC TCGACCTGCT CGGGGAAGGT GTCATTGGCG GGCTGGTGGA TGGTGCAAAG
TCGATCTTCG TTGATGATCT GCCAATCCTT AACGAAGACG GGTCTTCAAA CTTTAGCGGT
ATCACCTGGG ACTTCCGCGA TGGTTCACAA GACCAGACGC CGATGGCTGG GTTCGATTTC
GTTGAAACGC CGAAGTCAGT CAACATCCAG TTGAAAAGAA TGCACGACGT TACGATTGCC
ATCGATAACG ATGAGGCAGA CCGTGTCCGC GTCATTCTGA AGTTCCCGTC TCTGCGTAGC
ATCGACAAAA AGACCGGTGA TACCAACGGT ACGACCGTGA AGTACAAATT CCAGATTGCC
AATGGCGATA ATGCCTTCAA GGACGCCATC GCAGAAGGGG AGAGTGCTTC CGAAATTGCG
CTGACGGCAA AAAAGACAGG CGTCTACTAC CGCAGCTATG AGCTAAAACT GCCCAAGCCA
GGTCGTGCCT ACAAGGTTCG CGTGCTGCGT CTGACCGATG ACAGCAATAC TCAGTACATC
TTTAACGATA CGTGGGTGGA CTCTATCGGT GAGATCGTCG ATACGCCGAT GAACTATCCG
AACTCCGCGC TGGTTGGCCT CAAGGTCAAC TCAGAGCAGT TCGGCAGCTC GATGCCGTCT
CGTTCGTATC TGGTGCGTGG CCTGAAGATC CGCGTACCGT CCAACTACGA TGAACACACA
AACACCTATA TCGGCGTATG GGATGGCACA TTCAAGCTGT TGTCATCTTC CAACCCTGCC
TGGATTCTCT TCGACCTGCT TACCAACGCT CGTTATGGCC TGGGGCAGTA CGTTTCTGAG
TCCATGATTG ACCTCGGGCA GATCTACCAG ATTGGTCGCT ACTGTGACGA AGAAATTGAC
AATGGATTCG GGGGCAAAGA GAAGCGCTTC GCTATCAACA CCCAGATCAC TAGCCGTCAG
GACGCGTACC GACTGATTCA GGATATCGCT GGCGCCTTCC GCGGTATGGT CTTCTGGGCT
GGTGGCATGG TTAACGTCAT GCAGGATAGT CCGTCAGATC CGGTCATGAT GTTCACCAAC
GCGAACGTCA AAGACGGCAT GTTCAGCTAC AAGGGATCTG CGCGTAAAGA CCGTCCGTCA
GTAGCTCTTG TGACCTACAA CAACAAGGAA GACGGCTACA AGCAGAACAT CGAGTACGTC
GAAGACCAGG AGGCGATGCG TCGTTATGGC GAGCGTAAAA CCGAAGTGGT TGCGTTCGGC
TGTACAAGCC GTGGCCAGGC GCATCGTGTC GGTCTGTGGC TGCTGTATAC CGCACGCATG
GAGTCGGACG TTATCAGCTT TACGGCAGGG CTTGATGCTT CCTTCCTTAT GCCGGGTGAA
ACGGTGCTGA TTCAGAACAA ATACCGTGCT GGTAAACGCA ACTCTGGCCG CATTGTGGCG
TTCACAAAGA ACAGCATCAC TCTCGACGCA CCGGTTACGC TGAATAAAGC CGGTAGCTAC
ATCCGGATCT TGAATCAGGA AGGCGAAATC GTTGAGCGCG ATATTCTTGA GACCGGGGAA
GACATTACCA AAGTGACCTT CTCCAAAGCG CTCAATTCCG GTGATATGCC GGTGATGAAT
GGCGTCTGGA CGATTACCGA GCCAGATCTG GAGCCAATGC GCGTGCGTGT TATCAACGTT
GCCCAGGGCG AGGCTCAGGG GACGTTTAAC GTTACGGTTG TCCAGAATAA TGCATCGAAG
TACGAAGCCA TCGACAACGG TGCGACGCTG ATCCCCGAGA ACAACACAGT TCTCGACCCG
ACTTATTCGA AGCCGACTAA CCTGCAGGTG ACGGAAGGGA CGTATATCTC CAGTCCGGGT
AACCTCTCAA TCAAGCTCGT AGCCACCTGG GAGGGTAAGT CTGCGGAATA TTGGATCAGC
TGGCGTCGTT CCGATGAAAA CAACGTTTCT AACTGGCAGT CCGCACGCGT TACCGAAGAG
CAGTTCGAGA TCCTCAATAT TGCTGAGAAT GGTCAATACG ACATTCAGCT CTATGCGGTT
TCGTTCAGCG GCAAGAAAAC GGACATCATC AGCACCGTTT ATCAGGTGAA AGGTACGATG
ACGCCGCCAG GCTCTCCTAC CTCTCTGACG GCCGTTGGTG ACTACCGCAA CGTGATTCTG
AATTGGGTCA ACCCGGACTC AATCGACCTT GATCACATCA ACGTGTATGC CTCCCAGACC
AACGATCTGG AAACGGCGAA GTTGGTTGCA GAGGCCGCCA GCACCACGTT CACTCATGCC
GGTCTGGGAG ATAGTGAGAC CTGGTACTAT TGGGTTCGCG CGGTGAACAA GCGTGGCATG
TTAAGTCCGC CGAACTCCAA TCTGGGTACG GAAGCGATGA CGCGAGACGT CCTCTCGTTC
CTTACCGGGA AGATCACCTC TTCCGAGCTG GGGCAGGAGC TGCTGGAGGA AATCGACGCT
AAAGCCTCTC AGGATGCGGT CGACGCCATC AACAAACAGA TGGAAGAGAG TCTGAAAGAG
CTTGATCAGT CCGTTGCCGA TCTGGACAGC AAACTGGAAG ACACCAGCGG TCGGCTTGAG
CAGGTGCAGA ACGACCTCAA AAATGAAGTC TCTGGCACGC TGGACAAGGT CAACGACGCG
CTGCAACAAG TTGAGGACTC TAATGCGGCT CTGGTCGAGT TGCAGGAAAC CGTTTCCGAG
CAGGGCAAAG CCATAGCTGG CGCTGTGGAA GCGGCGCACG CTGCGCTCGA CAACGCCTCC
GCGCTGATTG CTGAAGAGCG TGAAGCCCGT GTCGAAGGTG ATAAGGCAAA TGCCAAACAG
ATTGAGGCAA TGAAATCCTC CGTCGATGAC AGTGTTGCCG CCGTCGAAGA GATGAAAAAG
ACCGTTGCCG AAGTCGAACG CGCCAGCGCG GAAGCGTCGA CCAATATCGA GGCTCTGGCC
AAAACCAATA TTGACCTCGC TCTGCGTCAG GATGAAGACC AGCACAAGCA GATGGTCAAT
AATGCGAAGA TCGCAACCAC TCAGAAGACG TTTGCCGACG ATATGTCTGC AATGGCCTCA
AAAGTGGAAG AAATCCGCGC AGAAATTGGT GAGGACATCC GGGCGTCGAT TCTGGAAGAG
ACAACGGCTC GCGTAGAGGC TGACAAGACA ATTGCGACGC ATATCTCCAA GCTGGAAGCC
CAGCTCAACG ACGATATTTC AGCGGCAATC GTTTCCGAGC AAGAGGCGCG TGCGACTGCG
GATGAAACGC TTTCTCGTCA GATCACCACG TTGCAGGCGA AAGTTGAAGG TGATATCAGC
GCTGCACTTA CTGAAGAGCA GATTGCCCGA GCCACAGCGG ATGAGGCGCT ATCGAAGCAA
ATTACCCAAC TGAAGGCACA GAATGGTGAG GATATCAAAG CCGCCGTTGC AGAAGAGACC
CAGGCTCGAA CCGATGCAGA TGGTGCTCTG GCTTCGCAGA TCAGCTCGCT GAAGGCTCAG
ACGGCAGAGG ACATCAAGGC CGCTGTCGAC ACAGAGACGA AAGCGCGTAC CGATGCCGAC
TCTGCTCTGG CCGGGCAGAT CACCAATCTC CAGGCTCAGA CCGGCAAAGA TATCAACGCT
GCTATCACAT CCGAAGCCAC CGCGCGTGCA AACGCTGACG GTGCTCTCGG TAAGAGAATT
GATACGGTTA AGGCTGAAGT TGATGGCAAC TCGGCTCTCA TTCAGCAGCA AGCGAAGGCG
ATTGCCGATA CCGATAAGAA GGTTTCTGCT GCCTGGACGC TGAAGATGGA AACATCTACC
AGCGGCGGGC AGAAGTACGT TGCAGGTATC GCGCTGGGTA TCGACAGTAC CGGTTTATCA
CAATTTTTGG TGCAGGCAGA CCGTTTTGGC CTGGTCAACT CCGTAAACGG GAAGATCACT
ACGCCATTTG TCATCGAAAA CAGCGTGGCG TATATGAACG GCGCTTATAT CAAAGACGGC
ACAATTACGA ACGCCAAAAT TGGTAATGTC ATTCAGTCGA ACGATTACGC CGCAGGCAGT
AGAGGCTGGA TTATCCCCAA AGATGGTAGC CCTGAGTTCA ACAACGGTAC GTTCAGGGGA
AATATTGCTG CAAACTCCGG CACGCTGAAT AACGTCACCA TCGCGCAGAA CTGCCAGATT
CTGGGGAAAC TGCACGCGAA CCAGATTGAT GGCGATATTG TTAAAGCCTA CATGGTTAAT
GGCAGCAGTA TTTATATTGC ACCTCAAACA TTCGCCAGAA TTATCTATGT GGTAAATGGT
TACTACTATA ACAAGCCATC GGAGGATATT AACACCTACT CATGGTCAAG AATTACCGAG
TATACGGTAA ATGGAGTGAA GCAGCAGATA TATGGAATGA GAGAAGGCTC TAAAAATCAA
TCTGGCTTGT TTGGTTATTA CAATTTGCCG GCAGGTCAGT CTGCGACTGT TGATGTTTAT
ACCTGGCATA GACAGCGTAA ATACGATCAC CGCGTGAATG AACCTTATCT CATTCTGGTG
TTTAAGGCTT AA
 
Protein sequence
MEQFKKKRLP LLIAGAGGKK SSGSSRTPVE ADDTVNSRAM ASILDLLGEG VIGGLVDGAK 
SIFVDDLPIL NEDGSSNFSG ITWDFRDGSQ DQTPMAGFDF VETPKSVNIQ LKRMHDVTIA
IDNDEADRVR VILKFPSLRS IDKKTGDTNG TTVKYKFQIA NGDNAFKDAI AEGESASEIA
LTAKKTGVYY RSYELKLPKP GRAYKVRVLR LTDDSNTQYI FNDTWVDSIG EIVDTPMNYP
NSALVGLKVN SEQFGSSMPS RSYLVRGLKI RVPSNYDEHT NTYIGVWDGT FKLLSSSNPA
WILFDLLTNA RYGLGQYVSE SMIDLGQIYQ IGRYCDEEID NGFGGKEKRF AINTQITSRQ
DAYRLIQDIA GAFRGMVFWA GGMVNVMQDS PSDPVMMFTN ANVKDGMFSY KGSARKDRPS
VALVTYNNKE DGYKQNIEYV EDQEAMRRYG ERKTEVVAFG CTSRGQAHRV GLWLLYTARM
ESDVISFTAG LDASFLMPGE TVLIQNKYRA GKRNSGRIVA FTKNSITLDA PVTLNKAGSY
IRILNQEGEI VERDILETGE DITKVTFSKA LNSGDMPVMN GVWTITEPDL EPMRVRVINV
AQGEAQGTFN VTVVQNNASK YEAIDNGATL IPENNTVLDP TYSKPTNLQV TEGTYISSPG
NLSIKLVATW EGKSAEYWIS WRRSDENNVS NWQSARVTEE QFEILNIAEN GQYDIQLYAV
SFSGKKTDII STVYQVKGTM TPPGSPTSLT AVGDYRNVIL NWVNPDSIDL DHINVYASQT
NDLETAKLVA EAASTTFTHA GLGDSETWYY WVRAVNKRGM LSPPNSNLGT EAMTRDVLSF
LTGKITSSEL GQELLEEIDA KASQDAVDAI NKQMEESLKE LDQSVADLDS KLEDTSGRLE
QVQNDLKNEV SGTLDKVNDA LQQVEDSNAA LVELQETVSE QGKAIAGAVE AAHAALDNAS
ALIAEEREAR VEGDKANAKQ IEAMKSSVDD SVAAVEEMKK TVAEVERASA EASTNIEALA
KTNIDLALRQ DEDQHKQMVN NAKIATTQKT FADDMSAMAS KVEEIRAEIG EDIRASILEE
TTARVEADKT IATHISKLEA QLNDDISAAI VSEQEARATA DETLSRQITT LQAKVEGDIS
AALTEEQIAR ATADEALSKQ ITQLKAQNGE DIKAAVAEET QARTDADGAL ASQISSLKAQ
TAEDIKAAVD TETKARTDAD SALAGQITNL QAQTGKDINA AITSEATARA NADGALGKRI
DTVKAEVDGN SALIQQQAKA IADTDKKVSA AWTLKMETST SGGQKYVAGI ALGIDSTGLS
QFLVQADRFG LVNSVNGKIT TPFVIENSVA YMNGAYIKDG TITNAKIGNV IQSNDYAAGS
RGWIIPKDGS PEFNNGTFRG NIAANSGTLN NVTIAQNCQI LGKLHANQID GDIVKAYMVN
GSSIYIAPQT FARIIYVVNG YYYNKPSEDI NTYSWSRITE YTVNGVKQQI YGMREGSKNQ
SGLFGYYNLP AGQSATVDVY TWHRQRKYDH RVNEPYLILV FKA