Gene YpAngola_A1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1021 
SymbolcueO 
ID5799484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1043052 
End bp1044653 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content53% 
IMG OID641339010 
Productmulticopper oxidase 
Protein accessionYP_001605582 
Protein GI162421062 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCGCC GTGATTTTCT TAAGTTAACG GCCGCTCTTG GAGCTGCCAC ATCACTGCCT 
TTATGGAGCC GAGCGGCATT GGCCGCAGAT TTTTCCCCGT TACCCATTCC CCCTCTGCTC
CAACCGGATG CCAACGGTAA AATTAATCTG AATATTCAGA CGGGGAGTGT GGTCTGGTTA
CCTTCCACTG CGACGCAAAC CTGGGGCTAT AACGGTAATT TATTGGGTCC AGCGATTCGT
TTGCAGCGGG GTAAAGCGGT AACCATTGAT ATCACTAATG CTTTACCGGA AGCGACCACA
GTACATTGGC ATGGTTTGGA GATCCCCGGC GAGGTTGATG GTGGCCCACA GGCGTTGATT
CAGCCAGGGG CAAAGCGTCA GGTTACCTTC GCGGTGGAGC AACCCGCCGC AACGTGTTGG
TTCCATCCGC ACACTCACAG TAAAACGGGT CACCAAGTGG CGATGGGGTT AGGCGGGTTA
GTCCTGATTG ATGACAGCGA CAGTGAGACG CTGCCGTTGC CAAAACAGTG GGGCGTGGAC
GATATTCCCG TAATTTTGCA GGATAAGTTA CTCGATAAAC ATGGGCAGGT TGACTATCAG
CTTGATGTGA TGACCGCCGC AGTCGGCTGG TTTGGTGACC GGATGCTGAC TAACGGCGTT
CCTTATCCGC AACAAATTAC GCCACGTGGC TGGGTGCGAT TACGGCTACT TAATGGCTGT
AATGCCCGTT CGCTGAATCT GGCGCTTAGC GATGGCCGGC CAATGTATGT GATTGCCAGC
GACGGCGGAT TATTAGCCGA ACCCGTTGTG GTGCGTGAGT TACCGATATT GATGGGCGAA
CGTTTCGAAG TGCTGGTGGA TACCCGCGAT GGTCAGTCTC TCGATTTGGT CACCCTGCCC
GTTACGCAGA TGGGCATGAC CTTGGCCCCG TTTGATCAGC CGCTGCCCGT GCTACGGATC
CAACCCTCAC TGGCGATCGG CAGTCAGGTT TTGCCCGAGT CTCTCGTGGT GATCCCGGAA
TTAGCCGATG TCACTGGTGT GCAGGAGCGC TGGTTCCAAC TGATGATGGA TCCAAAGCTC
GATATGCTGG GGATGCAGGC CTTAGTGGCG CGTTATGGCA TGAAAGCCAT GGCCGGTATG
AATATGAATC ATGGTGACAT GGGGGCGATG GACCATGGCA ATAGACCAGA TATGAGCCAG
GGCAAAATGA AAGGCATGGA TCATGGCACA ATGAACGGTG CGCCAGCCTT TAATTTCAGT
CACGCGAATA GGATTAACGG TAAAGCTTTC TCGATGACCG AACCCGCGTT TGACGCGAAG
CAGGGCAAAT ATGAGAAATG GACCATTTCA GGTGAAGGCG ACATGATGCT ACATCCATTC
CATGTTCACG GCACACAGTT CCGTATTTTA ACGGAGAACG GCAAACCGCC AGCAGAGCAT
CGCCGGGGAT GGAAAGACAT AGTACGTGTT GAAGGCGCAC GCAGTGAAAT ATTGGTGCGC
TTTAATTATC TCGCCCCTGC GAGTACGCCT TATATGGCTC ACTGCCACTT ATTGGAACAT
GAAGATACTG GCATGATGCT GGGCTTTACC GTCAGCGCCT GA
 
Protein sequence
MHRRDFLKLT AALGAATSLP LWSRAALAAD FSPLPIPPLL QPDANGKINL NIQTGSVVWL 
PSTATQTWGY NGNLLGPAIR LQRGKAVTID ITNALPEATT VHWHGLEIPG EVDGGPQALI
QPGAKRQVTF AVEQPAATCW FHPHTHSKTG HQVAMGLGGL VLIDDSDSET LPLPKQWGVD
DIPVILQDKL LDKHGQVDYQ LDVMTAAVGW FGDRMLTNGV PYPQQITPRG WVRLRLLNGC
NARSLNLALS DGRPMYVIAS DGGLLAEPVV VRELPILMGE RFEVLVDTRD GQSLDLVTLP
VTQMGMTLAP FDQPLPVLRI QPSLAIGSQV LPESLVVIPE LADVTGVQER WFQLMMDPKL
DMLGMQALVA RYGMKAMAGM NMNHGDMGAM DHGNRPDMSQ GKMKGMDHGT MNGAPAFNFS
HANRINGKAF SMTEPAFDAK QGKYEKWTIS GEGDMMLHPF HVHGTQFRIL TENGKPPAEH
RRGWKDIVRV EGARSEILVR FNYLAPASTP YMAHCHLLEH EDTGMMLGFT VSA