Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0145 |
Symbol | |
ID | 4078812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 159057 |
End bp | 160424 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638005439 |
Product | deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_612140 |
Protein GI | 99079986 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000232928 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATGGA CACAACTGCT GAACCCAAGA CGGCTGTGTC GCCCGGACTA TGCCGACAAA CCCAACCGTC CCGCCTATTT GCAGGACTAC GACCGCATCT TGTTCTCAGA GCCGTTCCGC CGTCTGGCGC AGAAAACGCA GGTCCATCCG CTCTATGACC ATGATCACGT CCACCACCGG ATGATCCACT CAATGGAGAC CTCCTCGGTC GGGCGCTCCC TTGGAATTCA GGTCGGCGAG GCGCTCGTGG CCGATGGCCG GCTCGAGGAC GGGCTGCAGC ATGTGATGGC GGGCACGGTG CAGGCGGCCT GTCTGGTACA TGACATTGGC AATCCGCCCT TTGGGCATTC CGGCGAAGCC AGCATCGGTG CGTGGTTTGC GCAGCAATTT GCCGCCAACA ATGGCACAGG GATCGGCATC GCCGCGGGCA TCGCGCCCGA GCATCGCGCT GAATTTGAGG CCTTTGAAGG CAACGCTCAG GGCTTTCGGA TCGTGTCCCG GCTGGAAATG GCGCGGCGCG AGGGGGGCAT GCGGCTCTCA TATGCAACGC TCGGTGCATT TGCGAAATAC CCCTGCACCG CCAGTGCCGC CGCCGATGCG CAGGACACCT ATGTGGGCCT CAAGAAGTTC GGCTGCTTTG CCGGCGAAGA AGCGCTTTTC GCCGAAGTCG CAAGCGCCCT TGGCCTGCCC CAGGAACGCA CCCCTTCTGG CGAGCGGTGG TGGCGCCGCC ACCCGCTGGC GTTCTTGGTC GAGGCGGCAG ATGACATATG CTATCGCATT CTCGACCTCG AAGACGCGGC GACCGTGGGC GATCTAGGCG GCGAGGTGGT TTCTGAAATC CTCGAAGAGA TCACCGGCAA GCCCAACCGC TCGCCCGAGC CAGAAATGAC CCTGCGCGAG CGCACCGGCA TGCAGCGCGC GATGGCGATT GGCGCTGCCA TCGACAGCGC GGTTGAGGCG TTTCTTGAAC ACTACGATGC CATCATGGAC GGCACCTTCA ATGATGGGCT GATGGAAGTG TCGAGCAAGG CTGCCACCTT TGCCCGGCTG AAAGAGATCT CAAACGCGCG CATTTTCACC GCCCAGCGTA AAACTGCGCT CGAAGTGGTG GGACGCAAAG TGATCTTCAC GATCCTCGAC GAATTCCACG CGTTGTTTGT GGCCCTAAAG GCCTGCGACT GGGATGCGCA GCGCTTGCTC AAGGAACATG GCTACTGGAC CCAGCTCGTG CGCGCTGTCG ATCTCGACCT GCGCGGCGTA ACGGACGACT ACACGGCCGC CCATGCGCTG ACCGATTTTG TTTCCGGCAT GACCGACCGC TACGCGATCC GCGTGCGCGA CATGATCACC GGTCAGGTGC CAAGCTGA
|
Protein sequence | MEWTQLLNPR RLCRPDYADK PNRPAYLQDY DRILFSEPFR RLAQKTQVHP LYDHDHVHHR MIHSMETSSV GRSLGIQVGE ALVADGRLED GLQHVMAGTV QAACLVHDIG NPPFGHSGEA SIGAWFAQQF AANNGTGIGI AAGIAPEHRA EFEAFEGNAQ GFRIVSRLEM ARREGGMRLS YATLGAFAKY PCTASAAADA QDTYVGLKKF GCFAGEEALF AEVASALGLP QERTPSGERW WRRHPLAFLV EAADDICYRI LDLEDAATVG DLGGEVVSEI LEEITGKPNR SPEPEMTLRE RTGMQRAMAI GAAIDSAVEA FLEHYDAIMD GTFNDGLMEV SSKAATFARL KEISNARIFT AQRKTALEVV GRKVIFTILD EFHALFVALK ACDWDAQRLL KEHGYWTQLV RAVDLDLRGV TDDYTAAHAL TDFVSGMTDR YAIRVRDMIT GQVPS
|
| |