Gene YpAngola_A0185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0185 
Symbol 
ID5798649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp196310 
End bp197650 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content50% 
IMG OID641338206 
Producthypothetical protein 
Protein accessionYP_001604812 
Protein GI162419886 
COG category[R] General function prediction only 
COG ID[COG4099] Predicted peptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000162284 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.000128495 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTAACAC GCCGGAAATT TTTGATGATG AGTGCCGGAG CAGGTCTTTT ACTCTCTGTA 
CCCATGTTGG CTCGAACCGG TGTTCAGCCT GCACAAGCTG CCACCGCCAT TACGCAAGTT
TTTGGTGACG GGATCCGGCT AACCGCTGTT GCGGTCGAAT ATCCGACAGA AGTCAGCGCT
GAAGGGCTAA ACCCCGCCGA CTTTCATGTT GAAGGGCGAA CAGTAACCGG TGTATGGACC
AGCACTTCTA CTAATCCGGC AGATATAGCG CCCTCAGGAC GCTATATGAT TATAGCGCTA
TCACCTGATG ACAAGAATGC AACGCTGGCC GAACAGGTGC AGCCAAATAG TAAAAACAAC
AGCAACAAAT CTGCCAATGG AAGAGGCGGC CCCGGTAATG CAGGCGATAT TCCTGCCTAT
GATACGGTTT ACCGGACAGC TCAAGCCACG GTACTGCGCC TTCCGTCAGT TCATACCGCC
AGTGGTGATA CGCTTCCCGC TAGCGAGAAA GCGTTGACAA CCCAATATGT GGAAAACTTG
ATCGTTGATG ATTTTCAGCA GCTTGAGTTT TATGATGAAA AAACAGGTAA AAAGCTGAAA
TACAACCTTT TCATCCCCAA AGACTATAGC CCTGATAAGG CTTGGCCGCT GGTGTTATTC
ATGCATGATG CTGGCGCCAC CAGCGATGTT ACACGCACCA CCCTGTATCA AGGCTTAGGC
GCTATTGCTT GGGCAAGCCC AGAAGATCAG GCACAGCGCC CCTGCTTTGT TCTTGCACCT
CAGTATGAAG AAATCATTGC CGATGATGAC TCAAAAACAT CTGACATGCT GGACACCACC
ATTGATCTTA TCAATGTACT TTCAGAGCAG TACAACATTG ATAAGAGCCG TATCTATGCC
ACAGGGCAGT CGGGTGGATG CATGATGACG ATAGCGATGA ACATCAAGTA TCCGGATTTC
TTCGCGGCCT CTTTTTTGGT TGCGGGTCAG TGGGATCCCG CGTTGGTGAA ACCTCTTGCC
CAGCAAAAAC TCTGGATTCT GGTTTCTCAG GATGATAACA AAGCCTGGCC AGGTCAGAAT
GCCATCATTG ATGTTCTGGA AAAAGAGGGT GTCCAAATCA GCCGTGCAAT ATGGGACGGA
ACATGGAATG AAGAGCAATT TCGTCAGGCT TTTGAACAAA TAGAGGCAGA AAAAAGCCCG
ATTAACTATG TGGCATTTCG TGAAGGCACC GTGATTCCTG AGGGGCAATC CACCGAAGGT
GCCAGCGGGC ATCGCAATAC CTGGCGAATT GCCTATACCA TCTCCCCCAT ACGCGAATGG
ATTTTCAGGC AACAGCGCTA G
 
Protein sequence
MLTRRKFLMM SAGAGLLLSV PMLARTGVQP AQAATAITQV FGDGIRLTAV AVEYPTEVSA 
EGLNPADFHV EGRTVTGVWT STSTNPADIA PSGRYMIIAL SPDDKNATLA EQVQPNSKNN
SNKSANGRGG PGNAGDIPAY DTVYRTAQAT VLRLPSVHTA SGDTLPASEK ALTTQYVENL
IVDDFQQLEF YDEKTGKKLK YNLFIPKDYS PDKAWPLVLF MHDAGATSDV TRTTLYQGLG
AIAWASPEDQ AQRPCFVLAP QYEEIIADDD SKTSDMLDTT IDLINVLSEQ YNIDKSRIYA
TGQSGGCMMT IAMNIKYPDF FAASFLVAGQ WDPALVKPLA QQKLWILVSQ DDNKAWPGQN
AIIDVLEKEG VQISRAIWDG TWNEEQFRQA FEQIEAEKSP INYVAFREGT VIPEGQSTEG
ASGHRNTWRI AYTISPIREW IFRQQR