Gene YpAngola_A0508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0508 
SymbolgppA 
ID5798971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp529773 
End bp531269 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content51% 
IMG OID641338513 
Productguanosine pentaphosphate phosphohydrolase 
Protein accessionYP_001605105 
Protein GI162418766 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.577686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0821648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTAA GTTCCACCTC ACTTTATGCT GCCATCGATC TTGGCTCCAA TAGTTTTCAT 
ATGTTGGTAG TACGTGAGGT GGCTGGCAGT ATCCAAACGC TGGCACGTAT TAAGCGGAAG
GTCCGCCTGG CGGCTGGTCT GGATAACCAA AATCATCTAT CGCAGGAAGC GATGGAACGA
GGCTGGCAAT GCCTAAAACT TTTCTCAGAG CGTTTACAGG ATATTCCTCT GGATCAAATC
CGCGTGGTCG CAACGGCAAC CTTGCGCCTG GCCTCTAATG CCGACGAATT CTTGCGTACT
GCAACCGAGA TCCTCGGCTG CCCTATTCAA GTCATCAGTG GCGAAGAAGA AGCCCGTCTG
ATTTATCATG GCGTAGCGCA TACGACTGGC GGGCCAGAAC AGCGGTTGGT CGTCGATATT
GGGGGGGGCA GCACTGAGTT GGTTACAGGC AATGGGGCTC AGGCGAATAT TTTGGTCAGC
CTATCAATGG GTTGTGTTAC CTGGTTAGAA CGTTATTTTG GTGACCGCCA TCTGGCAAAG
GAAAATTTTG AACGCGCTGA ATTGGCCGCT CATGAGATGA TCAAGCCCGT CGCCCAACGT
TTTCGTGAAC ATGGCTGGCA AGTTTGTGTC GGCGCTTCAG GCACCGTTCA GGCACTACAA
GAGATCATGG TCGCTCAAGG TATGGACGAG CTGATCACCT TAGCCAAGCT GCAACAGCTC
AAACAAAGAG CGATTCAGTG TGGCAAATTA GAAGAGTTGG AAATCCCTGG TTTAACCTTG
GAACGAGCGC TGGTCTTCCC CAGTGGTCTG TCCATCTTAA TTGCGATATT CCAGGAGTTG
TCCATTGAAA GCATGACACT GGCAGGTGGC GCACTGCGCG AAGGGCTGGT CTATGGCATG
CTCCATTTAC CGGTCGAGCA AGATATTCGC CGCCGGACAC TACGTAATTT ACAGCGCCGC
TATTTACTGG ATACCGAGCA AGCTAAGCGC GTCAGTTGTT TGGCGGATAA CTTTTTCCTA
CAAGTGGAAA AAGAGTGGCA TCTCGATGGC CGATGTCGCG AATTTTTGCA AAACGCCTGT
TTGATCCATG AAATTGGCCT CAGTGTCGAT TTTAAACATG CTCCGCAACA TGCCGCTTAT
CTGATCCGTA ATCTGGATCT ACCCGGTTTT ACTCCTGCAC AAAAGCTGCT GCTTTCTGCT
CTGTTACAAA ACCAGAGTGA CACTATCGAT CTATCGCTCT TGAACCAGCA GAATGCATTA
CCTGCCGACA TGGCACAGCA TTTGTGTCGT CTACTGCGCT TGGCCATTAT TTTTTCCAGC
CGTCGCCGGG ATGATACCCT GCCAGCAGTC AGGTTGCGGG CCGATAATAA TGCGCTTTAT
GTGCTGGTCC CCCAAGGTTG GTTGGAACAG CACCCCTACC GCGCCGAAGC GTTAGAACAA
GAGAGTCACT GGCAAAGTTA TGTTCAATGG CCACTGCTAT TGGAAGAGCT TAGCTAA
 
Protein sequence
MMLSSTSLYA AIDLGSNSFH MLVVREVAGS IQTLARIKRK VRLAAGLDNQ NHLSQEAMER 
GWQCLKLFSE RLQDIPLDQI RVVATATLRL ASNADEFLRT ATEILGCPIQ VISGEEEARL
IYHGVAHTTG GPEQRLVVDI GGGSTELVTG NGAQANILVS LSMGCVTWLE RYFGDRHLAK
ENFERAELAA HEMIKPVAQR FREHGWQVCV GASGTVQALQ EIMVAQGMDE LITLAKLQQL
KQRAIQCGKL EELEIPGLTL ERALVFPSGL SILIAIFQEL SIESMTLAGG ALREGLVYGM
LHLPVEQDIR RRTLRNLQRR YLLDTEQAKR VSCLADNFFL QVEKEWHLDG RCREFLQNAC
LIHEIGLSVD FKHAPQHAAY LIRNLDLPGF TPAQKLLLSA LLQNQSDTID LSLLNQQNAL
PADMAQHLCR LLRLAIIFSS RRRDDTLPAV RLRADNNALY VLVPQGWLEQ HPYRAEALEQ
ESHWQSYVQW PLLLEELS