Gene SbBS512_E0152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0152 
Symboldgt 
ID6271385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp166117 
End bp167634 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content48% 
IMG OID641724404 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001878963 
Protein GI187733138 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAGA TTGATTTCCG AAAAAAAATA AACTGGCATC GTCGTTACCG TTCACCGCAG 
GGCGTTAAAA CCGAACATGA GATCCTGCGG ATCTTCGAGA GCGATCGCGG GCGTATCATC
AACTCTCCGG CAATTCGTCG TCTGCAACAA AAGACCCAGG TTTTTCCACT GGAGCGCAAT
GCCGCCGTGC GCACGCGTCT TACCCACTCG ATGGAAGTCC AGCAGGTGGG GCGCTACATC
GCCAAAGAAA TTCTAAGCCG TATGAAAGAG CTTAAATTAC TGGAAGCATA CGGCCTGGAT
GAACTGACCG GCCCCTTTGA AAGCATTGTT GAGATGTCAT GCCTGATGCA CGATATCGGC
AATCCGCCGT TTGGTCATTT TGGCGAAGCG GCGATAAATG ATTGGTTTCG CCAGCGTTTG
CACCCGGAAG ATGCCGAAAG CCAGCCTCTG ACTGACGATC GCTGCAGCGT GGCGGCACTA
CGTTTACGGG ACGGGGAAGA ACCGCTTAAC GAGCTGCGGC GCAAGATTCG TCAGGACTTA
TGTCATTTTG AGGGGAATGC ACAAGGCATT CGTCTGGTGC ATACATTGAT GAGGATGAAT
CTCACCTGGG CACAGGTTGG CGGTATTTTA AAATATACCC GTCCGGCGTG GTGGCGTGGC
GAAACGCCTG AGACACATCA CTATTTAATG AAAAAGCCGG GTTATTATCT TTCTGAAGAA
GCCTATATTG CCCGGTTGCG TAAAGAACTT AATTTGGCGC TTTACAGTCG TTTTCCATTA
ACGTGGATTA TGGAAGCAGC CGACGACATC TCCTATTGTG TGGCAGACCT TGAAGATGCG
GTAGAGAAAA GAATATTTAC CGTTGAGCAG CTTTATCATC ATTTGCACGA AGCGTGGGGC
CAGCATGAGA AAGGTTCGCT CTTTTCGCTG GTGGTTGAAA ATGCCTGGGA AAAATCACGC
TCAAATAGTT TAAGCCGCAG TACGGAAGAT CAGTTTTTTA TGTATTTACG GGTAAACACC
CTAAATAAAC TGGTACCCTA TGCGGCACAA CGATTTATTG ATAATCTGCC TGCGATTTTC
GCCGGAACGT TTAATCATGC ATTATTGGAA GATGCCAGCG AATGCAGCGA TCTTCTTAAG
CTATATAAAA ATGTCGCTGT AAAACATGTG TTTAGCCATC CAGATGTCGA GCAGCTTGAA
TTGCAGGGCT ATCGGGTCAT TAGCGGATTA TTAGAGATTT ATCGTCCTTT ATTAAGCCTG
TCGTTATCCG ACTTTACTGA ACTGGTAGAA AAAGAACGGG TGAAACGTTT CCCTATTGAA
TCGCGCTTAT TCCACAAACT CTCGACGCGC CATCGGCTGG CCTATGTCGA GGCTGTCAGT
AAATTACCGT CAGATTATCC TGAGTTTCCG CTATGGGAAT ATTATTACCG TTGCCGCCTG
CTGCAGGATT ATATCAGCGG TATGACCGAC CTCTATGCGT GGGATGAATA CCGACGTCTG
ATGGCCGTAG AACAATAA
 
Protein sequence
MAQIDFRKKI NWHRRYRSPQ GVKTEHEILR IFESDRGRII NSPAIRRLQQ KTQVFPLERN 
AAVRTRLTHS MEVQQVGRYI AKEILSRMKE LKLLEAYGLD ELTGPFESIV EMSCLMHDIG
NPPFGHFGEA AINDWFRQRL HPEDAESQPL TDDRCSVAAL RLRDGEEPLN ELRRKIRQDL
CHFEGNAQGI RLVHTLMRMN LTWAQVGGIL KYTRPAWWRG ETPETHHYLM KKPGYYLSEE
AYIARLRKEL NLALYSRFPL TWIMEAADDI SYCVADLEDA VEKRIFTVEQ LYHHLHEAWG
QHEKGSLFSL VVENAWEKSR SNSLSRSTED QFFMYLRVNT LNKLVPYAAQ RFIDNLPAIF
AGTFNHALLE DASECSDLLK LYKNVAVKHV FSHPDVEQLE LQGYRVISGL LEIYRPLLSL
SLSDFTELVE KERVKRFPIE SRLFHKLSTR HRLAYVEAVS KLPSDYPEFP LWEYYYRCRL
LQDYISGMTD LYAWDEYRRL MAVEQ