Gene TM1040_0902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0902 
Symbol 
ID4076272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp962323 
End bp963465 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content62% 
IMG OID638006204 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_612897 
Protein GI99080743 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGCGC CCTATGCCAC CATGCCCGAC CGTTCGCGCG GGCGGGCTGT TCCCGAAGAA 
GAGAGCAGTT TTCGGTCTCC CTTTCAGCGC GATAGGGACC GGATCATCCA TGCCAGCGCC
TTTCGGCGCC TGAAGCACAA GACGCAAGTG TTTGTGGAGC ATGAGGGCGA CAATTATCGC
ACCCGGCTTA CCCACTCCAT CGAAGTAGGA CAGGTGGCGC GCACCATTGC AGGCGCGCTG
GGGCTCAATC AGGAGCTCAC CGAGGCCGTC GCGCTGGCGC ATGATCTTGG TCATACGCCC
TTTGGCCACA CCGGCGAGGA CGCGCTGCAT GAGATGATGG CGCCCTATGG CGGATTTGAC
CACAACGCGC AGGCCATTCG CATCGTGACG GCGCTGGAGC GTCACTATGC AGAGTTTGAT
GGTCTGAACC TCACCTGGGA GACGCTGGAG GCGATTGCCA AGCACAATGG CCCGGTTGTG
GGGGAGCTGC CCTGGGCCTT GGCGGCCTGC AACCGGGGCA TCGATCTGGA GCTGCACACC
CACGCCAGCG CCGAGGCGCA GGTGGCGGCC CTGGCGGATG ACATCGCCTA CAATCACCAC
GATCTGCACG ACGGTTTGCG GGCCGGGCTT TTCACGGATG ACGATGTGTG CAGCTTGTCG
ATCATCGCGC CCGCTTACGC GGAGGTGGAT GAGATCTATC CGGGGCTGGA TCATAATCGC
CGTCGCCACG AGGCGCTGCG GCGGTTCTTT GGGGTTATGG TCAGTGATGT GATCGAGACC
TCGCGGCGTA AGATTGCGGC CTCTGGTGCG CAGTCGGTGG AGGAGATCCG GGCGCTGGAT
CATGCGGTTG TGACCTTTTC GGATGAGATC TGGACCCAGC TCAGAGAGCT GCGGGCCTTC
ATGTTCACCC GCATGTACCG CGCTCCTTCG GTGATGGTGG TGCGCGAGCG TGTCGCCGTC
GTGGTGAAGG CGCTGTTTGC CTATTACCTC GAAAACACCA TGGCGATGCC CGAGCGCTGG
CATGGCGACA TTCGCAAGGC CGAAACAGAG ACTGACCGGG CGCGGATCGT ATCGGACTAT
ATCGCCGGGA TGACGGATCG TTTTGCGCTG CAGCTCTATG ATCGCTTGGC GCTTGGGGCT
TGA
 
Protein sequence
MYAPYATMPD RSRGRAVPEE ESSFRSPFQR DRDRIIHASA FRRLKHKTQV FVEHEGDNYR 
TRLTHSIEVG QVARTIAGAL GLNQELTEAV ALAHDLGHTP FGHTGEDALH EMMAPYGGFD
HNAQAIRIVT ALERHYAEFD GLNLTWETLE AIAKHNGPVV GELPWALAAC NRGIDLELHT
HASAEAQVAA LADDIAYNHH DLHDGLRAGL FTDDDVCSLS IIAPAYAEVD EIYPGLDHNR
RRHEALRRFF GVMVSDVIET SRRKIAASGA QSVEEIRALD HAVVTFSDEI WTQLRELRAF
MFTRMYRAPS VMVVRERVAV VVKALFAYYL ENTMAMPERW HGDIRKAETE TDRARIVSDY
IAGMTDRFAL QLYDRLALGA