Gene YpAngola_A3209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3209 
Symbol 
ID5801684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3401176 
End bp3402366 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content47% 
IMG OID641341038 
ProductPUA domain-containing protein 
Protein accessionYP_001607565 
Protein GI162421356 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0310372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00624197 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGTAC GCCTTATTCT TGCTAAAGGA CGTGAAAAAT CCTTACTTCG TCGCCACCCA 
TGGATTTTCT CTGGGGCGGT TCAACGCCTT GAAGGTGATG CACTTTCCGG TGAAACCATA
GATATCCTTG ATAGTCAGGG AAAATGGTTG GCCCGTGCCG CCTACTCGCC TGAATCACAA
ATTTTGGCAC GAGTCTGGAC GTTCCAACAA GATGAGGTTA TAGATTGCGC ATTCTTTATT
CGCCGCCTAC AACAGGCTCA GAATTGGCGA GATTGGCTGG CACAGCGTGA TGGCCTTAAT
GGCTACCGCC TGATTGCCGG TGAATCGGAT GGCCTGCCGG GCATTACCAT TGACCGTTTC
CAGAATTTCT TGGTGTTACA GTTACTTTCT GCCGGTGCCG AATATCAACG TGAAACGCTG
GTCAGTGCGT TACAACATTG CTATCCAGAA TGTTCTATTT ATGATCGTTC CGATGTTTCT
GTCCGTAAAA AAGAAGGTTT GCCACTGACT CAAGGCCTTA TCTGTGGCGA AATGCCACCA
GCCCTGCTGC CAATTAGCGA AAATGGCATG CAACTCTTCG TTGATATCCA GCAAGGTCAT
AAAACGGGTT TCTATTTAGA TCAACGCGAT AGCCGCTTAG CGGCCCGTAA TTATGCCAAT
GGCCGTCGTG TTTTGAACTG CTTCTCATAC ACGGGGGCTT TTGCTGTCGC CGCGTTGATG
GGCAATTGCC AACAGGTCAT TAGTGTTGAT ACATCGCAAT CCGTGTTAGA TATTGCGAAA
CAAAACATTG AACTGAACCA GTTGGATCTG AGCAAAACGG AGTTTGTCCG TGACGATGTA
TTCCAATTGT TGCGCAGCTA TCGCGCTCAA GGGGAAAAAT TCGACCTGAT CATTATGGAT
CCACCTAAAT TTGTTGAGAA TAAAAGCCAA TTAGCCAGTG CATGCCGTGG CTATAAAGAT
ATCAATATGT TGGCGATTCA ATTACTGCGC CCTGGCGGTA TCTTGCTCAG TTTCTCTTGT
TCAGGTTTGA TGCCGGTCGA TTTATTCCAG AAAATTTTGG CCGATGCCGC GTTAGATGCA
GGCCATGACA TACAGTTTAT AGAGCAGTTC CGCCAAGCTG CGGATCACCC AGTGATCGCG
GCTTATCCAG AAGGTTTGTA TCTAAAAGGG TTCGCGTGTC GGGTAATGTA A
 
Protein sequence
MTVRLILAKG REKSLLRRHP WIFSGAVQRL EGDALSGETI DILDSQGKWL ARAAYSPESQ 
ILARVWTFQQ DEVIDCAFFI RRLQQAQNWR DWLAQRDGLN GYRLIAGESD GLPGITIDRF
QNFLVLQLLS AGAEYQRETL VSALQHCYPE CSIYDRSDVS VRKKEGLPLT QGLICGEMPP
ALLPISENGM QLFVDIQQGH KTGFYLDQRD SRLAARNYAN GRRVLNCFSY TGAFAVAALM
GNCQQVISVD TSQSVLDIAK QNIELNQLDL SKTEFVRDDV FQLLRSYRAQ GEKFDLIIMD
PPKFVENKSQ LASACRGYKD INMLAIQLLR PGGILLSFSC SGLMPVDLFQ KILADAALDA
GHDIQFIEQF RQAADHPVIA AYPEGLYLKG FACRVM