Gene YpAngola_A0988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0988 
Symboldgt 
ID5799451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1011460 
End bp1012980 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content43% 
IMG OID641338977 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001605549 
Protein GI162418606 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00819808 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGGA TCGACTTTAA GCAGAAAATA AGTTTCCAGC GGCCTTTTAG TAAGCCCAGT 
TCAGCAGAAG ATGAATATGA AATAACAAGG GTATTTGAAA GTGATCGTGG GCGGATTGTT
AACTCTGCTG CTATCCGGCG TCTGCAACAA AAAACGCAAG TATTCCCGCT GGAACGCAAT
GCCGCCGTTC GTAGCCGATT AACCCATTCG TTGGAAGTGC AACAAGTCGG GCGTTATATC
GCGAAAGAGA TCCTGAACCG CTTTAAACAG GATAAAAAAA TCACGGCCTA CGGTTTGGAT
AAACTACTCG ACCCTTTTGA AAGTATTGTT GAAATGGCCT GTCTGATGCA TGACATTGGT
AACCCGCCAT TTGGTCATTT CGGTGAGTCA GCGATCAATG ATTGGTTTAC AAAACGGATG
GACCCTAACG GCGGCAGCGG TTCTGAACCA CAAAGCACAG ATCAATGTCA GGTAGATGTG
CTGAAGCTAT GTGAGGGAGA AACCGAACTT AATATTCTGC GCAGTAAAAT TCGTCATGAC
CTTAGCCAGT TTGAGGGCAA CGCTCAGGCT ATTCGTTTGG TTCACAGTTT ATTAAAACTG
AATCTGACCT ATGCTCAGGT GGGTTGTATT CTTAAATATA CTAAGCCCGC TTATTGGTCA
GCCCCTATTC CAGCGTCCCA TAACTATTTG ATGAAAAAAC CCGGCTTCTA TCTGGCAGAG
GAAAATTACG TCAAAGAACT GCGTCGCGAA CTCAATATGG AAGAGTTTGA CCGTTTTCCA
CTGACTTATA TTATGGAGGC CGCCGATGAT ATTTCTTACT GTATAGCCGA TTTAGAAGAT
GCAGTAGAAA AAAATATTTT CAGTGTCGAA CAACTCTATG ATCATATGAG CCAAGAGTGG
GGGGCCGTTA CACCGGGGGA TCTGTTTGAT AAAGTCGTGG GTGCCGCTTT TCGTCAATTA
GGCCGTGAGC AAGGCCGGCG TAGCTCAGAA GATCAATTCT TTATGTATCT ACGGGTAAAT
ACTGTGGGGA AATTAGTCCC TCATGCGGCA CAACGCTTTA TTGAAAATCT ACCGGCTGTT
TTTTCAGGCT CTTTTAACCA GGCATTGTTA GAAGATTCCA GTGCCGCTTG TAAGTTATTG
CAAATTTTCA AACGTGTCGC AGTAAAACAT GTATTTAACC ACCCAGAAGT TGAACAGCTT
GAATTACAAG GGTATAGAGT CATCAGTGGG CTGCTTGATA TTTATAGCCC GTTATTAGCA
ATGCCAGAGA CCGCCTTTAC ACAATTAGTT GCAGATGACC GCCACCGTAA GTATCCAATT
GAAACACGGT TATTTCATAA ATTATCGATT AAACATCGGT TAGCTTATGC TGAATCTGCA
GAAAGAATCC GTAATTTACC GTCCGAACAA TATGAGATAT ATGAATATTA TTATCGTGCG
CGGTTAATTC AGGATTATAT CAGTGGGATG ACCGATCTTT ATGCTTATGA TGAATACCGG
CGTTTAATGG CTGCGGAATA G
 
Protein sequence
MSGIDFKQKI SFQRPFSKPS SAEDEYEITR VFESDRGRIV NSAAIRRLQQ KTQVFPLERN 
AAVRSRLTHS LEVQQVGRYI AKEILNRFKQ DKKITAYGLD KLLDPFESIV EMACLMHDIG
NPPFGHFGES AINDWFTKRM DPNGGSGSEP QSTDQCQVDV LKLCEGETEL NILRSKIRHD
LSQFEGNAQA IRLVHSLLKL NLTYAQVGCI LKYTKPAYWS APIPASHNYL MKKPGFYLAE
ENYVKELRRE LNMEEFDRFP LTYIMEAADD ISYCIADLED AVEKNIFSVE QLYDHMSQEW
GAVTPGDLFD KVVGAAFRQL GREQGRRSSE DQFFMYLRVN TVGKLVPHAA QRFIENLPAV
FSGSFNQALL EDSSAACKLL QIFKRVAVKH VFNHPEVEQL ELQGYRVISG LLDIYSPLLA
MPETAFTQLV ADDRHRKYPI ETRLFHKLSI KHRLAYAESA ERIRNLPSEQ YEIYEYYYRA
RLIQDYISGM TDLYAYDEYR RLMAAE