Gene YpsIP31758_3324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3324 
Symboldgt 
ID5387009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3736370 
End bp3737890 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content43% 
IMG OID640866339 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001402281 
Protein GI153950205 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.065001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGGA TCGACTTTAA GCAGAAAATA AGTTTCCAGC GGCCTTTTAG TAAGCCCAGT 
TCAGCAGAAG ATGAATATGA AATAACAAGG GTATTTGAAA GTGATCGTGG GCGGATTGTT
AACTCTGCTG CTATCCGGCG TCTGCAACAA AAAACGCAAG TATTCCCGCT GGAACGCAAT
GCCGCCGTTC GTAGCCGATT AACCCATTCG TTGGAAGTGC AACAAGTCGG GCGTTATATC
GCGAAAGAGA TCCTGAACCG CTTTAAACAG GATAAAAAAA TCACGGCCTA CGGTTTGGAT
AAACTACTCG ACCCTTTTGA AAGTATTGTT GAAATGGCCT GCCTGATGCA TGACATTGGT
AACCCGCCAT TTGGTCATTT CGGTGAGTCA GCGATCAATG ATTGGTTTAC AAAACGGATG
GACCCTAACG GCGGCAGCGG TTCTGAACCA CAAAGCACAG ATCAATGTCA GGTAGAGGTG
CTGAAGCTAT GTGAGGGAGA AACCGAACTT AATATTCTGC GCAGTAAAAT TCGTCATGAC
CTTAGCCAGT TTGAGGGCAA CGCTCAGGCT ATTCGTTTGG TTCACAGTTT ATTAAAACTG
AATCTGACCT ATGCTCAGGT GGGTTGTATT CTTAAATATA CTAAGCCCGC TTATTGGTCA
GCCCCTATTC CAGCGTCCCA TAACTATTTG ATGAAAAAAC CCGGCTTCTA TCTGGCAGAG
GAAAATTACG TCAAAGAACT GCGTCGCGAA CTCAATATGG AAGAGTTTGA CCGTTTTCCA
CTGACTTATA TTATGGAGGC CGCCGATGAT ATTTCTTACT GTATAGCCGA TTTAGAAGAT
GCAGTAGAAA AAAATATTTT CAGTGTCGAA CAACTCTATG ATCATATGAG CCAAGAGTGG
GGGGCCGTTA CACCAGGGGA TCTGTTTGAT AAAGTCGTGG GTGCCGCTTT TCGTCAATTA
GGCCGTGAGC AAGGCCGACG TAGCTCAGAA GATCAATTCT TTATGTATCT ACGGGTAAAT
ACTGTGGGGA AATTAGTCCC TCATGCGGCA CAACGCTTTA TTGAAAATCT ACCGGCTGTT
TTTTCAGGCT CTTTTAACCA GGCATTGTTA GAAGATTCCA GTGCCGCTTG TAAGTTATTG
CAAATTTTCA AACGTGTCGC AGTAAAACAT GTATTTAACC ACCCAGAAGT TGAACAGCTT
GAATTACAAG GGTATAGAGT CATCAGTGGG CTGCTTGATA TTTATAGCCC GTTATTAGCA
ATGCCAGAGA CCGCCTTTAC ACAATTAGTT GCAGATGACC GCCACCGTAA GTATCCAATT
GAAACACGGT TATTTCATAA ATTATCGATT AAACATCGGT TAGCTTATGC TGAATCTGCA
GAAAGAATCC GTAATTTACC GTCCGAACAA TATGAGATAT ATGAATATTA TTATCGTGCG
CGGTTAATTC AGGATTATAT CAGTGGGATG ACCGATCTTT ATGCTTATGA TGAATACCGG
CGTTTAATGG CTGCGGAATA G
 
Protein sequence
MSGIDFKQKI SFQRPFSKPS SAEDEYEITR VFESDRGRIV NSAAIRRLQQ KTQVFPLERN 
AAVRSRLTHS LEVQQVGRYI AKEILNRFKQ DKKITAYGLD KLLDPFESIV EMACLMHDIG
NPPFGHFGES AINDWFTKRM DPNGGSGSEP QSTDQCQVEV LKLCEGETEL NILRSKIRHD
LSQFEGNAQA IRLVHSLLKL NLTYAQVGCI LKYTKPAYWS APIPASHNYL MKKPGFYLAE
ENYVKELRRE LNMEEFDRFP LTYIMEAADD ISYCIADLED AVEKNIFSVE QLYDHMSQEW
GAVTPGDLFD KVVGAAFRQL GREQGRRSSE DQFFMYLRVN TVGKLVPHAA QRFIENLPAV
FSGSFNQALL EDSSAACKLL QIFKRVAVKH VFNHPEVEQL ELQGYRVISG LLDIYSPLLA
MPETAFTQLV ADDRHRKYPI ETRLFHKLSI KHRLAYAESA ERIRNLPSEQ YEIYEYYYRA
RLIQDYISGM TDLYAYDEYR RLMAAE