Gene TM1040_0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0145 
Symbol 
ID4078812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp159057 
End bp160424 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content62% 
IMG OID638005439 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_612140 
Protein GI99079986 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000232928 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATGGA CACAACTGCT GAACCCAAGA CGGCTGTGTC GCCCGGACTA TGCCGACAAA 
CCCAACCGTC CCGCCTATTT GCAGGACTAC GACCGCATCT TGTTCTCAGA GCCGTTCCGC
CGTCTGGCGC AGAAAACGCA GGTCCATCCG CTCTATGACC ATGATCACGT CCACCACCGG
ATGATCCACT CAATGGAGAC CTCCTCGGTC GGGCGCTCCC TTGGAATTCA GGTCGGCGAG
GCGCTCGTGG CCGATGGCCG GCTCGAGGAC GGGCTGCAGC ATGTGATGGC GGGCACGGTG
CAGGCGGCCT GTCTGGTACA TGACATTGGC AATCCGCCCT TTGGGCATTC CGGCGAAGCC
AGCATCGGTG CGTGGTTTGC GCAGCAATTT GCCGCCAACA ATGGCACAGG GATCGGCATC
GCCGCGGGCA TCGCGCCCGA GCATCGCGCT GAATTTGAGG CCTTTGAAGG CAACGCTCAG
GGCTTTCGGA TCGTGTCCCG GCTGGAAATG GCGCGGCGCG AGGGGGGCAT GCGGCTCTCA
TATGCAACGC TCGGTGCATT TGCGAAATAC CCCTGCACCG CCAGTGCCGC CGCCGATGCG
CAGGACACCT ATGTGGGCCT CAAGAAGTTC GGCTGCTTTG CCGGCGAAGA AGCGCTTTTC
GCCGAAGTCG CAAGCGCCCT TGGCCTGCCC CAGGAACGCA CCCCTTCTGG CGAGCGGTGG
TGGCGCCGCC ACCCGCTGGC GTTCTTGGTC GAGGCGGCAG ATGACATATG CTATCGCATT
CTCGACCTCG AAGACGCGGC GACCGTGGGC GATCTAGGCG GCGAGGTGGT TTCTGAAATC
CTCGAAGAGA TCACCGGCAA GCCCAACCGC TCGCCCGAGC CAGAAATGAC CCTGCGCGAG
CGCACCGGCA TGCAGCGCGC GATGGCGATT GGCGCTGCCA TCGACAGCGC GGTTGAGGCG
TTTCTTGAAC ACTACGATGC CATCATGGAC GGCACCTTCA ATGATGGGCT GATGGAAGTG
TCGAGCAAGG CTGCCACCTT TGCCCGGCTG AAAGAGATCT CAAACGCGCG CATTTTCACC
GCCCAGCGTA AAACTGCGCT CGAAGTGGTG GGACGCAAAG TGATCTTCAC GATCCTCGAC
GAATTCCACG CGTTGTTTGT GGCCCTAAAG GCCTGCGACT GGGATGCGCA GCGCTTGCTC
AAGGAACATG GCTACTGGAC CCAGCTCGTG CGCGCTGTCG ATCTCGACCT GCGCGGCGTA
ACGGACGACT ACACGGCCGC CCATGCGCTG ACCGATTTTG TTTCCGGCAT GACCGACCGC
TACGCGATCC GCGTGCGCGA CATGATCACC GGTCAGGTGC CAAGCTGA
 
Protein sequence
MEWTQLLNPR RLCRPDYADK PNRPAYLQDY DRILFSEPFR RLAQKTQVHP LYDHDHVHHR 
MIHSMETSSV GRSLGIQVGE ALVADGRLED GLQHVMAGTV QAACLVHDIG NPPFGHSGEA
SIGAWFAQQF AANNGTGIGI AAGIAPEHRA EFEAFEGNAQ GFRIVSRLEM ARREGGMRLS
YATLGAFAKY PCTASAAADA QDTYVGLKKF GCFAGEEALF AEVASALGLP QERTPSGERW
WRRHPLAFLV EAADDICYRI LDLEDAATVG DLGGEVVSEI LEEITGKPNR SPEPEMTLRE
RTGMQRAMAI GAAIDSAVEA FLEHYDAIMD GTFNDGLMEV SSKAATFARL KEISNARIFT
AQRKTALEVV GRKVIFTILD EFHALFVALK ACDWDAQRLL KEHGYWTQLV RAVDLDLRGV
TDDYTAAHAL TDFVSGMTDR YAIRVRDMIT GQVPS