Gene SeD_A0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0228 
Symboldgt 
ID6871925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp240456 
End bp242015 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content49% 
IMG OID642783474 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_002214168 
Protein GI198242011 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTAT GGGGAAGCGC ATTTCTCAGG CGGGGAGAGG ATATGGCATC GATCGATTTC 
CGAAATAAAA TTAACTGGCA TCGTCGTTAT CGTTCACCGC AGGGCGTAAA GACGGAACAT
GAGATCCTGC GGATTTTTGA AAGCGATCGC GGGCGGATTA TCAACTCTCC GGCTATACGC
CGTTTGCAGC AAAAAACGCA GGTTTTCCCG CTGGAGCGCA ATGCCGCGGT GCGTACTCGT
CTGACGCATT CGATGGAGGT GCAGCAGGTG GGGCGTTATA TCGCGAAAGA GATTTTAAGC
CGCCTGAAAG AGCAAAACCG ACTGGAGGAG TACGGTCTGG ATGCGCTGAC CGGTCCCTTT
GAAAGTATTG TGGAAATGGC CTGCCTGATG CACGACATCG GTAATCCGCC GTTCGGTCAT
TTTGGCGAGG CGGCGATCAA TGACTGGTTT CGTCAGCGGC TGCATCCGGA AGATGCGGAA
AGTCAGCCGC TCACGCATGA TCGCTGTGTG GTTTCCTCGC TACGGCTACA GGAAGGCGAA
GAAAATCTGA ACGATATTCG CCGCAAGGTA CGTCAGGATA TCTGCCATTT TGAAGGCAAT
GCACAGGGAA TTCGTCTGGT ACATACGCTC ATGCGGATGA ATCTTACCTG GGCGCAGGTT
GGCGGAATTT TAAAATATAC CCGTCCGGCA TGGTGGCGAG GGCCGGTGCC GGATTCCCAT
CGCTATTTAA TGAAGAAACC GGGCTATTAT CTTTCTGAAG AGAAGTATAT TGCGAGGTTA
CGTAAAGAAC TGCAGTTAGC GCCTTACAGT CGCTTTCCAT TAACGTGGAT TATGGAAGCC
GCAGATGATA TTTCTTATTG TGTCGCCGAT CTTGAAGACG CGGTAGAGAA AAGAATCTTT
AGCGTTGAGC AGCTTTATCA CCATTTATAT CACGCGTGGG GCCACCATGA GAAGGATTCG
CTGTTTGAGC TGGTGGTAGG AAATGCGTGG GAAAAATCAC GCGCCAATAC ATTAAGCCGC
AGTACCGAAG ATCAGTTTTT TATGTATTTA CGGGTAAATA CATTAAATAA ACTGGTGCCC
TATGCCGCTC AGCGTTTTAT TGATAATTTG CCGCAGATTT TTGCCGGTAC CTTCAATCAG
GCGCTGCTGG AAGATGCCAG CGGTTTTAGC CGCCTGCTTG AACTCTATAA GAATGTGGCG
GTTGAACATG TGTTTAGCCA TCCGGATGTA GAACAGCTTG AACTACAGGG ATACCGGGTG
ATCAGCGGGT TATTAGATAT CTATCAGCCG CTATTAAGCT TGTCGCTTAA CGACTTTCGC
GAGCTGGTGG AAAAAGAACG GTTGAAACGC TTCCCTATAG AATCGCGCTT ATTTCAGAAA
CTTTCTACGC GCCATCGTTT GGCCTACGTG GAAGTCGTCA GTAAATTACC CACGGATTCG
GCGGAGTACC CGGTACTGGA ATATTATTAT CGCTGTCGGT TGATTCAGGA TTATATCAGC
GGGATGACTG ACCTTTACGC ATGGGATGAA TATCGGCGTT TGATGGCGGT CGAACAGTAA
 
Protein sequence
MRLWGSAFLR RGEDMASIDF RNKINWHRRY RSPQGVKTEH EILRIFESDR GRIINSPAIR 
RLQQKTQVFP LERNAAVRTR LTHSMEVQQV GRYIAKEILS RLKEQNRLEE YGLDALTGPF
ESIVEMACLM HDIGNPPFGH FGEAAINDWF RQRLHPEDAE SQPLTHDRCV VSSLRLQEGE
ENLNDIRRKV RQDICHFEGN AQGIRLVHTL MRMNLTWAQV GGILKYTRPA WWRGPVPDSH
RYLMKKPGYY LSEEKYIARL RKELQLAPYS RFPLTWIMEA ADDISYCVAD LEDAVEKRIF
SVEQLYHHLY HAWGHHEKDS LFELVVGNAW EKSRANTLSR STEDQFFMYL RVNTLNKLVP
YAAQRFIDNL PQIFAGTFNQ ALLEDASGFS RLLELYKNVA VEHVFSHPDV EQLELQGYRV
ISGLLDIYQP LLSLSLNDFR ELVEKERLKR FPIESRLFQK LSTRHRLAYV EVVSKLPTDS
AEYPVLEYYY RCRLIQDYIS GMTDLYAWDE YRRLMAVEQ