Gene SNSL254_A0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0229 
Symboldgt 
ID6485036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp244962 
End bp246521 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content49% 
IMG OID642735666 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_002039448 
Protein GI194445486 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.889459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTAT GGGGAAGCGC ATTTCTCAGG CGGGGAGAGG ATATGGCATC GATCGATTTC 
CGAAATAAAA TTAACTGGCA TCGTCGTTAT CGTTCACCGC AGGGCGTAAA GACGGAACAT
GAGATCCTGC GGATTTTTGA AAGCGATCGC GGGCGGATTA TCAACTCTCC GGCTATACGC
CGTTTGCAGC AAAAAACGCA GGTTTTCCCG CTGGAGCGCA ATGCCGCGGT GCGTACTCGC
CTGACGCATT CAATGGAGGT GCAGCAGGTA GGGCGTTATA TCGCGAAAGA GATTTTAAGC
CGCCTGAAAG AGCAAAACCG GCTGGAGGAG TACGGTCTGG ATGCGCTGAC CGGTCCCTTT
GAAAGTATTG TAGAAATGGC CTGCCTGATG CACGACATCG GTAATCCGCC GTTTGGTCAT
TTTGGCGAGG CGGCGATCAA TGACTGGTTT CGTCAGCGGC TGCATCCGGA AGATGCGGAA
AGTCAGCCGC TCACGCATGA TCGCTGTGTG GTTTCCTCGC TACGGTTACA GGAAGGTGAA
GAAAATCTGA ACGATATTCG CCGCAAGGTA CGTCAGGATA TCTGCCATTT TGAAGGCAAT
GCACAGGGAA TTCGTCTGGT GCATACGCTC ATGCGGATGA ATCTTACCTG GGCGCAGGTT
GGCGGAATTT TAAAATATAC CCGTCCGGCA TGGTGGCGAG GGCCGGTGCC GGATTCCCAT
CGCTATTTAA TGAAGAAACC GGGCTATTAT CTTTCTGAAG AGAAGTATAT TGCGAGGTTA
CGTAAAGAAC TGCAGTTAGC GCCTTACAGT CGCTTTCCAT TAACGTGGAT TATGGAAGCC
GCAGATGATA TTTCTTATTG TGTCGCCGAT CTTGAAGACG CGGTAGAGAA AAGAATCTTT
AGCGTTGAGC AGCTTTATCA CCATTTATAT CACGCGTGGG GCCACCATGA GAAGGATTCG
CTGTTTGAGC TGGTGGTAGG AAATGCGTGG GAAAAATCAC GCGCCAATAC ATTAAGCCGC
AGTACCGAAG ATCAGTTTTT TATGTATTTA CGGGTAAATA CATTAAATAA ACTGGTGCCC
TATGCCGCTC AGCGTTTTAT TGATAATTTG CCGCAGATTT TTGCCGGTAC CTTCAATCAG
GCGCTGCTGG AAGATGCCAG CGGTTTTAGC CGCCTACTTG AACTCTATAA GAATGTGGCG
GTTGAGCATG TGTTTAGCCA TCCGGATGTA GAACAGCTTG AACTACAGGG ATACCGGGTG
ATCAGCGGGT TATTAGATAT CTATCAGCCG CTATTAAGCT TGTCGCTTAA CGACTTTCGC
GAGCTGGTGG AAAAAGAACG GTTGAAACGC TTCCCCATAG AATCGCGCTT ATTTCAGAAA
CTTTCTACGC GCCATCGTTT GGCCTACGTG GAAGTCGTCA GTAAATTACC CACGGATTCG
GCGGAGTACC CGGTACTGGA ATATTATTAT CGCTGTCGGT TGATTCAGGA TTATATCAGC
GGGATGACTG ACCTTTACGC ATGGGATGAA TATCGGCGTT TGATGGCGGT CGAACAGTAA
 
Protein sequence
MRLWGSAFLR RGEDMASIDF RNKINWHRRY RSPQGVKTEH EILRIFESDR GRIINSPAIR 
RLQQKTQVFP LERNAAVRTR LTHSMEVQQV GRYIAKEILS RLKEQNRLEE YGLDALTGPF
ESIVEMACLM HDIGNPPFGH FGEAAINDWF RQRLHPEDAE SQPLTHDRCV VSSLRLQEGE
ENLNDIRRKV RQDICHFEGN AQGIRLVHTL MRMNLTWAQV GGILKYTRPA WWRGPVPDSH
RYLMKKPGYY LSEEKYIARL RKELQLAPYS RFPLTWIMEA ADDISYCVAD LEDAVEKRIF
SVEQLYHHLY HAWGHHEKDS LFELVVGNAW EKSRANTLSR STEDQFFMYL RVNTLNKLVP
YAAQRFIDNL PQIFAGTFNQ ALLEDASGFS RLLELYKNVA VEHVFSHPDV EQLELQGYRV
ISGLLDIYQP LLSLSLNDFR ELVEKERLKR FPIESRLFQK LSTRHRLAYV EVVSKLPTDS
AEYPVLEYYY RCRLIQDYIS GMTDLYAWDE YRRLMAVEQ