Gene YpAngola_A2546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2546 
Symbol 
ID5801017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2663442 
End bp2664722 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content51% 
IMG OID641340416 
Producthypothetical protein 
Protein accessionYP_001606958 
Protein GI162419298 
COG category[S] Function unknown 
COG ID[COG4950] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01926] uncharacterized peroxidase-related enzyme 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCAAT TTCGTAGGAG AAATAATGCC CATTGGTATC ATGAGACTCA GTGTAGCGGC 
AGCCTGGAGC ATTGTAGCGG TAGCCCAGTG AATATTTCCA CGACAGTGAA TGTTCCCACG
ACAGTGAATA ATGAAACCCC CCCTGTTGAT ATTCGGCCAC GTGATCCCAG CGAGAGTGAC
AACATGAGCA CTGGCAGTAA TATAACTGAA CAGAGCATCA TAACTGAACA GGGCATCTTT
CTACTTGGCG TGACAGAAAA CATCGCTCCT ACATTACAAG ACACCCTCTA CCATGAGCAG
CCTATTCTTA CTGCCTCCGA CGCCATGTAT CAGGCCCTGT TCCCAACGAT TATCGAGATC
AACCACACCA ATACCTTCTC ACTTTATGAT CGGTTAAGTA CTGCGCTGAC GGTCGCTCAG
GTTACCGGGA TTCAGCGGCT ATGTAGCCAC TATGCTCTCC GTCTCGCGCC GCTCCCCAGC
CCGGATGCCT CAAGGGAAAG CAATATTAGG CTAACGCAAA TTACGCAATA TGCCCGCCAA
TTGGCCAGCC AACCTACGTT GATCGATAGG CATGCTTTAG CGCAATTGCA TGACGTGGGT
TTAACTGATA GCGATATCAT TATTTTATCG CAAATTATTG GATATGTGGG ATATCAAGCC
CGAGTGGTCG CTGGCATCTC TGCACTGGCT GGTTACCCTA CCGTGATGCT CCCCGGTTTC
CCCCGCATGG AAGATGCCGC CCCCAGCCCA TTACCAGATG TCATGCCCAA TTGGCAAGGT
TGGCTACCGT CTCATGCGGC AAACGACGAT CAACCCGATA AAGAACCTGA CGAAACGGCC
AGCACACTGA CTGAACTGTT GGGCCATCAC CAGCAAAGTT TGCTCGCTTA TCACGCCATT
ACCACTCACC AGCCCAACTC ACCTCAATTG CAACGTGACT GGCTGGAACT GGTGGCATTG
GTCAGCGCAC GAATCAATGG CAGCCTCTAC TGCCAAGCCC GTCACAGGCA ACATTTACAG
CAACTGACGG AGCCGCCCCT GTTGGTCACT GAGCTGTTAA AAGGGATTGA TCACGCGTTA
TTCTTGTTAC CCGAACAACA AATACCCCAT CAGCTAATCA GTGTAACCGC CGAGCTCACT
CGCGCCCCGG AACGCTTTAA TCATCAGCAT GTTAAACGTC TACAGACCCT TGGCGTCAGT
GATACTCAAG TCATGCGAAT TATTTTCAGT ATCGCCATTA CTGGTTGGAC CAACCGCCTA
CGACATACGT TAGGAAAATA G
 
Protein sequence
MVQFRRRNNA HWYHETQCSG SLEHCSGSPV NISTTVNVPT TVNNETPPVD IRPRDPSESD 
NMSTGSNITE QSIITEQGIF LLGVTENIAP TLQDTLYHEQ PILTASDAMY QALFPTIIEI
NHTNTFSLYD RLSTALTVAQ VTGIQRLCSH YALRLAPLPS PDASRESNIR LTQITQYARQ
LASQPTLIDR HALAQLHDVG LTDSDIIILS QIIGYVGYQA RVVAGISALA GYPTVMLPGF
PRMEDAAPSP LPDVMPNWQG WLPSHAANDD QPDKEPDETA STLTELLGHH QQSLLAYHAI
TTHQPNSPQL QRDWLELVAL VSARINGSLY CQARHRQHLQ QLTEPPLLVT ELLKGIDHAL
FLLPEQQIPH QLISVTAELT RAPERFNHQH VKRLQTLGVS DTQVMRIIFS IAITGWTNRL
RHTLGK