Gene YpAngola_0096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_0096 
Symbol 
ID5798339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010158 
Strand
Start bp72935 
End bp75823 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content50% 
IMG OID641337990 
Productputative phage tail protein 
Protein accessionYP_001604607 
Protein GI162417871 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones185 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones131 
Fosmid unclonability p-value2.12465e-28 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGGTACA GGGAAGGTAC TATCACATTT ACACAGGGAG GTGACGCACT TTCTGGCACT 
GGCACGTACT GGAATGTGAC CGCCAACGGC GTTCTGCCGG GCATGATCGT CATCGGCCCT
GACAACAAGT TGTACGAAAT TAAGCGCGTA ATTAGCGACA CAAGTCTGAT TCTCGCAGAG
CCGTACACAG GGGAGACCCA GAAGGAAGTT CCGTGCCGCA TCATCACAAC CTATGAGGGC
GACTTAACGC AGTTCAGCGC ACGTTTTACC GCGCTTATGA CCCGTATGTC AGCCGACTCG
AAGACGATGC GCAGCTGGTT GACGGCAGTT GATGAGGTAA CGCTTGAGCG TGAAGACGGT
ACGGAAGTGA CCGTGAAGTC GCTGACGCAG ATCGTCGATG AGCACAACGC AAACCAGAAA
TGGTATACGG ATAATGCAGA CGCTATTAAT GCGGCAGGCG AGAAGGCTCG TGAGGCCGCT
GAGCGCGCAT TAGCTGCGGC GCAAAGCTCT TCAGAAGCAA GAGCAAAAGC AGATGAAGCC
GCTCAAAGTT CAGCTTCAGC ATCTGAGTAT AAAACTGCGG CAGAGCTAAG TGCCGCTGCA
TCAAAAGCAT CGGAGCACGG CGCGGCAGAA AGCGCAGCTT CATCGAAGGC AAGTGCCTCT
GCGGCTAAAA CATCTGAAGA TAATTCTGCC GCGTCAGAGA CCAATGCGGC TGAAAGCAAG
GCTGCTGCGG CATTAAGTGC GTCTTCTTCG GCAAATAGCG CCTCAGAAGC ATTGCAATAC
GCGGAGTCAG CCAAGACCTC TAAAGAGGCT GCTGCTGCTT CAGAAGCAGC GGCAGCCAAT
AGTGAAAATG AAGCCAGAAC CTCAAAAGAT ACCGCTGTAG CGGCTGCGGC AGAGGCATCA
GCTAATGCCA CATCAGCTGA TGCCTCCAGA CATGATGTCG ATACCAATAA AGCCGAAGTA
TCGAGAATGA AAGATGAAGT TTTCGCTGCT AGGGACTCAA CGATTCAGTA TAGCGAAGAG
GCTAAAACAG CGGCTGATAC AGCGGCAAGA GAAGCAGCCA CGAAAACATC TGATCAGCTC
CTGTCGGCGG TTAAATCAGA GGCGGAAAAG GCAAACAGTG CTAGCGCAAG TGCTCAAGGT
TTTGCCGATG ACGCCAAGCG ATTTAGAGAC GAAGCTCAGG AAATAGCTGA AGGCAGCAAG
GTAAACGATG CAACAACCTC ACAGCAGGGG GTTGTTCAGT TGAGTAGTGC AACTGATAGC
GAGAGTGAAA CTCTTGCCTC TACACCAAAA GCTGTGAAAA CAGTCATGGA TGCCGTTGCT
CTAAAGGCTC CTATAGATAG CCCCGCGCTA TCTGGCGCAC CAACAGCTCC AACTCCGGCA
ATTACTGCTG CTGGACGTGA GATCGCGACA GCCGCGTTTG TGGCTTCAAA AGTAGCACAA
CTTGTTGGCT CAGCGCCTGA AGCACTGGAT ACGTTAAATG AGCTGGCTGC TGCGTTAGGC
AATGATCCAA ACTTTGCTAC GACTATTACG AACATGTTGG CTAGAAAGCA GCCTTTGGAT
GGAACACTAA CTGCGCTTTC TGGTCGTTCA CCTCAAGGGG TAATTGATTA TCTTGGCTTG
TTGAATACGG TTAACCTGGC GGCTGGCTCA ATTCAGAAAT CCCAGAATGG GGCAGATATT
CCTGACAAAA GATTATTTGT GAAAAATATA GGTGCAGTTA GCTCCGCCAG AATTTCGTTT
GTTAAGGAAT CCGGGTGGTA TAAGTTAGCG ACAGTAACAA TGCCTCAAGG AGCTTCAACC
GCTTTAATTA CTCTTATTGG AGGTGCTGGA TACAACGCGG GGCTTTATGA CCAGGCAGCG
ATAAGCGAAA TAGTGTTGCG ATCAGGGAAC TGGAATCCTG TTGGCATCAC AGCAACATTA
TGGCAACGCT CACCAGCAGG TGCTCAAGGG GTGGCGTGGA TAAATACATC AGGAGATGTT
TACGATATTT ATGTAAACGT TGGACAGTAC TCTATTGATG TTATTGCTCT GAGCGATTGT
ACAAATAATG CAAGCATAGT GTTGTTCGGC ACACCAGAGT ATGTGGCGAC CAAACCTGCA
AGTTCCACGA ACGGCGCAAA TTATATATTG TACAGTAGTG TTCTACCACC GCCTGAGTCA
TATCCAGTAG GTGCCCCTAT TCCGTGGCCG AACGATGTGG CCCCGTCTGG TTTCGCCATC
ATGCAAGGGC AGACGTTTGA CAAGAGTGTG TATCCGAAGC TGGCGGCCGC ATACCCATCA
GGTGTGTTAC CTGACATGCG TGGATGGATG ATTAAGGGTA AACCAACTTC TCGTGCAGTG
TTGTCACTGG AGCAAGATGG AATTAAGTCA CATGCGCACA ATGCAGCCGC TTCCAGTACA
GATCTTGGTA CAAAACCAAC CACAACATTT GATTACGGGA CAAAAACGTC CAGTGGCTTC
GATTATGGAA CGAAATCGTC TAACAGCACT GGTGCTCATG CACACTCGCT GTCTGGCTCT
ACATCGAGTT CAGGTGCCCA TGCGCATACG GTAACTGCTC ATACTCAGTA TCCAAGATCT
ACAGATTCGA GGAACCAGAA TGCTGTCGGT AAGCAATACA ACACACAGCA GACTACAGCC
AATGCTTTCA ATGTCTGGAC AAGTAGTGCA GGTGATCATG CCCACTCAAT CTCCGGTACT
GCTGTCAGTG CCGGTGCTCA TGCCCATACC GTTGGTATTG GCGCACATGC TCACTCATTG
AGTATTGGAT CACACTCGCA TTCAGTGGCA ATTGGGGCGC ACTCACACAC TATCACTATT
GCCGCTTGTG GTAATGCGGA GAACACCGTG AAAAACATTG CATATAACTA CATAGTGAGA
CTCGCATGA
 
Protein sequence
MWYREGTITF TQGGDALSGT GTYWNVTANG VLPGMIVIGP DNKLYEIKRV ISDTSLILAE 
PYTGETQKEV PCRIITTYEG DLTQFSARFT ALMTRMSADS KTMRSWLTAV DEVTLEREDG
TEVTVKSLTQ IVDEHNANQK WYTDNADAIN AAGEKAREAA ERALAAAQSS SEARAKADEA
AQSSASASEY KTAAELSAAA SKASEHGAAE SAASSKASAS AAKTSEDNSA ASETNAAESK
AAAALSASSS ANSASEALQY AESAKTSKEA AAASEAAAAN SENEARTSKD TAVAAAAEAS
ANATSADASR HDVDTNKAEV SRMKDEVFAA RDSTIQYSEE AKTAADTAAR EAATKTSDQL
LSAVKSEAEK ANSASASAQG FADDAKRFRD EAQEIAEGSK VNDATTSQQG VVQLSSATDS
ESETLASTPK AVKTVMDAVA LKAPIDSPAL SGAPTAPTPA ITAAGREIAT AAFVASKVAQ
LVGSAPEALD TLNELAAALG NDPNFATTIT NMLARKQPLD GTLTALSGRS PQGVIDYLGL
LNTVNLAAGS IQKSQNGADI PDKRLFVKNI GAVSSARISF VKESGWYKLA TVTMPQGAST
ALITLIGGAG YNAGLYDQAA ISEIVLRSGN WNPVGITATL WQRSPAGAQG VAWINTSGDV
YDIYVNVGQY SIDVIALSDC TNNASIVLFG TPEYVATKPA SSTNGANYIL YSSVLPPPES
YPVGAPIPWP NDVAPSGFAI MQGQTFDKSV YPKLAAAYPS GVLPDMRGWM IKGKPTSRAV
LSLEQDGIKS HAHNAAASST DLGTKPTTTF DYGTKTSSGF DYGTKSSNST GAHAHSLSGS
TSSSGAHAHT VTAHTQYPRS TDSRNQNAVG KQYNTQQTTA NAFNVWTSSA GDHAHSISGT
AVSAGAHAHT VGIGAHAHSL SIGSHSHSVA IGAHSHTITI AACGNAENTV KNIAYNYIVR
LA