Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0229 |
Symbol | dgt |
ID | 6485036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 244962 |
End bp | 246521 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642735666 |
Product | deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_002039448 |
Protein GI | 194445486 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 0.889459 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTAT GGGGAAGCGC ATTTCTCAGG CGGGGAGAGG ATATGGCATC GATCGATTTC CGAAATAAAA TTAACTGGCA TCGTCGTTAT CGTTCACCGC AGGGCGTAAA GACGGAACAT GAGATCCTGC GGATTTTTGA AAGCGATCGC GGGCGGATTA TCAACTCTCC GGCTATACGC CGTTTGCAGC AAAAAACGCA GGTTTTCCCG CTGGAGCGCA ATGCCGCGGT GCGTACTCGC CTGACGCATT CAATGGAGGT GCAGCAGGTA GGGCGTTATA TCGCGAAAGA GATTTTAAGC CGCCTGAAAG AGCAAAACCG GCTGGAGGAG TACGGTCTGG ATGCGCTGAC CGGTCCCTTT GAAAGTATTG TAGAAATGGC CTGCCTGATG CACGACATCG GTAATCCGCC GTTTGGTCAT TTTGGCGAGG CGGCGATCAA TGACTGGTTT CGTCAGCGGC TGCATCCGGA AGATGCGGAA AGTCAGCCGC TCACGCATGA TCGCTGTGTG GTTTCCTCGC TACGGTTACA GGAAGGTGAA GAAAATCTGA ACGATATTCG CCGCAAGGTA CGTCAGGATA TCTGCCATTT TGAAGGCAAT GCACAGGGAA TTCGTCTGGT GCATACGCTC ATGCGGATGA ATCTTACCTG GGCGCAGGTT GGCGGAATTT TAAAATATAC CCGTCCGGCA TGGTGGCGAG GGCCGGTGCC GGATTCCCAT CGCTATTTAA TGAAGAAACC GGGCTATTAT CTTTCTGAAG AGAAGTATAT TGCGAGGTTA CGTAAAGAAC TGCAGTTAGC GCCTTACAGT CGCTTTCCAT TAACGTGGAT TATGGAAGCC GCAGATGATA TTTCTTATTG TGTCGCCGAT CTTGAAGACG CGGTAGAGAA AAGAATCTTT AGCGTTGAGC AGCTTTATCA CCATTTATAT CACGCGTGGG GCCACCATGA GAAGGATTCG CTGTTTGAGC TGGTGGTAGG AAATGCGTGG GAAAAATCAC GCGCCAATAC ATTAAGCCGC AGTACCGAAG ATCAGTTTTT TATGTATTTA CGGGTAAATA CATTAAATAA ACTGGTGCCC TATGCCGCTC AGCGTTTTAT TGATAATTTG CCGCAGATTT TTGCCGGTAC CTTCAATCAG GCGCTGCTGG AAGATGCCAG CGGTTTTAGC CGCCTACTTG AACTCTATAA GAATGTGGCG GTTGAGCATG TGTTTAGCCA TCCGGATGTA GAACAGCTTG AACTACAGGG ATACCGGGTG ATCAGCGGGT TATTAGATAT CTATCAGCCG CTATTAAGCT TGTCGCTTAA CGACTTTCGC GAGCTGGTGG AAAAAGAACG GTTGAAACGC TTCCCCATAG AATCGCGCTT ATTTCAGAAA CTTTCTACGC GCCATCGTTT GGCCTACGTG GAAGTCGTCA GTAAATTACC CACGGATTCG GCGGAGTACC CGGTACTGGA ATATTATTAT CGCTGTCGGT TGATTCAGGA TTATATCAGC GGGATGACTG ACCTTTACGC ATGGGATGAA TATCGGCGTT TGATGGCGGT CGAACAGTAA
|
Protein sequence | MRLWGSAFLR RGEDMASIDF RNKINWHRRY RSPQGVKTEH EILRIFESDR GRIINSPAIR RLQQKTQVFP LERNAAVRTR LTHSMEVQQV GRYIAKEILS RLKEQNRLEE YGLDALTGPF ESIVEMACLM HDIGNPPFGH FGEAAINDWF RQRLHPEDAE SQPLTHDRCV VSSLRLQEGE ENLNDIRRKV RQDICHFEGN AQGIRLVHTL MRMNLTWAQV GGILKYTRPA WWRGPVPDSH RYLMKKPGYY LSEEKYIARL RKELQLAPYS RFPLTWIMEA ADDISYCVAD LEDAVEKRIF SVEQLYHHLY HAWGHHEKDS LFELVVGNAW EKSRANTLSR STEDQFFMYL RVNTLNKLVP YAAQRFIDNL PQIFAGTFNQ ALLEDASGFS RLLELYKNVA VEHVFSHPDV EQLELQGYRV ISGLLDIYQP LLSLSLNDFR ELVEKERLKR FPIESRLFQK LSTRHRLAYV EVVSKLPTDS AEYPVLEYYY RCRLIQDYIS GMTDLYAWDE YRRLMAVEQ
|
| |