Gene YpAngola_A0874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0874 
Symbol 
ID5799336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp899058 
End bp900212 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content52% 
IMG OID641338870 
Producthypothetical protein 
Protein accessionYP_001605447 
Protein GI162421735 
COG category[S] Function unknown 
COG ID[COG4924] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAC CCATTGATAT TGGCAGGCAA TTGGCCAGAC AGTGGCATCA GTCTAAGGTG 
AGGTCTGAGC GCTTACTCAC ACCCGGCTGC TGGCCTTTGC AACTGCCAAT AGGTAAGCCG
TCGGCAAAGA TTTTTACTGC AAATACGCAG GCAGTGCAGC GCCATGTCGA GGTCTGGCGT
CAGGTCAGTA TCGGTCTTGT TGAATGGGAA TCCGTTAGCT ACCGTGCCAG CCTCACGCCT
GTTTCAATTC CCGTTCGATG GTGTTTGCGC CCCCCCTCTG AGTGGATCAA TGCAGCATCG
AACCGGCAAG TCAGTCAGGA ATTTCAACAT CTTGAGCAAC TGGTAGCGCA GGTTGATAAA
TCCTTTCATG CCTTGCTGAT TAACCAACGG GCGCTTTGGC TGCATAAAGA TCCACATGAG
GTGATTACCG CGGCTAATCT TGCCTGCAAA TTGACCCCCG GCTGTGCTCA GGGGCGACCT
TTACGCCTGC TGTCCGAGTC TGGAGTAGAC ACGAAGTTCT TCGAAAGAAA CTATTCGTTG
CTGACAAAAC TACTTGATGA GCGTTTTGAG GGGGAGGCCT CTGGGCAGGG TCTGGCAACC
TTTCTCGATG CCTTCGAAGA AAGCAGCCAC TGGGTATTGG TCACTCCGTT GGCTCCCGGC
ATACTCCCCT TCAACATGAG TAAGGTCACA ACAACAGAAC TGACGAGCGT AGATTTACCA
TGCTCACGCA TTCTTATCGT AGAAAATGAA CACTGTATTC ACCAGTTACC GCAATTGCCT
GACACCATTG CCATTCTGGG CTCAGGTCTG GACTTGCAGT GGCTGATTTC TCCTAATTTA
AAGAGCAAGT CGATAGGTTA TTGGGGGGAT ATGGATACTT GGGGGCTATT GATGTTGGCG
CGGGCAAGAG CGTTCCAGCC AGCGATTACC GCACTTTTGA TGGGCCGCTT GTTATTTGAA
CACTACGCAT CAGGCTCAGC GGTCGCAGAG CCGGTCAACG CTCAACAGGC GGTTCCCGAA
GGGCTTCATC ACCACGAAGC AGACTTTTAC CGATACTTAT TGTCACAACC GCGGGGGCGT
CTTGAGCAGG AGTATTTACC ACGGGACGTG GTCGAAATGG CGCTTACTGA ATGGGTCGGT
CACATTTCTC TTTAA
 
Protein sequence
MKSPIDIGRQ LARQWHQSKV RSERLLTPGC WPLQLPIGKP SAKIFTANTQ AVQRHVEVWR 
QVSIGLVEWE SVSYRASLTP VSIPVRWCLR PPSEWINAAS NRQVSQEFQH LEQLVAQVDK
SFHALLINQR ALWLHKDPHE VITAANLACK LTPGCAQGRP LRLLSESGVD TKFFERNYSL
LTKLLDERFE GEASGQGLAT FLDAFEESSH WVLVTPLAPG ILPFNMSKVT TTELTSVDLP
CSRILIVENE HCIHQLPQLP DTIAILGSGL DLQWLISPNL KSKSIGYWGD MDTWGLLMLA
RARAFQPAIT ALLMGRLLFE HYASGSAVAE PVNAQQAVPE GLHHHEADFY RYLLSQPRGR
LEQEYLPRDV VEMALTEWVG HISL