Gene Amir_5142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5142 
Symbol 
ID8329340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp6123800 
End bp6125368 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content72% 
IMG OID644945577 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003102809 
Protein GI256379149 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCAGG ACCAGACCGA TCCGCGGGCC AGTCGCCGTT CCGGCGGTGA GGGGACCACG 
GCCGCCGATC TCGCCTCCAG TCCCTTCCGG GTCGACCGGG ACCGGGTGGC TTCGTCGCCG
TTCTTCTCGC GGCTCGGCGG GGTCACGCAG GTGGTCAGCT CGACCGGGTC GGGGTTGTTG
GTGCACAACC GGCTCACGCA CAGCCTCAAG GTCGCGCAGG CGGCCCGTGC CATCGCGGAG
CGGCTCTGCT CGCGGCCCGA GCTGGGTGGG GCGCTGGAGA AGCTCGGGGG GTGCGACCCG
GACGTCGTGG AGGCCGCCTC GCTGGCGCAC GACCTCGGGC ACCCGCCGTT CGGGCACCTC
GGGGAGCAGG TGCTCGACCG GATCGCCCGG CACCGGTTCG GGCTGTCGGA CGGGTTCGAG
GGGAACGCGC AGTCGTTCCG CATCGTGACG ACCACGGACG TGCGCGGGCC CGAGGCGGTG
GGGTTGGACC TCACGGTGGC GGTGCGCGCG GCGATGCTCA AGTACCCGTG GACCCGCCTG
TCCTACCCGG ACCCGCACCC GCGCGACATG GCCTCGCCGC CGCGCGGCGC CGCCGAGCCG
TCCGAGGCGC CGGGGACCGG GTCGGGGAAG TTCTCCGCCT ACGTGACCGA GATGGACGAC
GTGCACGGGG CGCGGCTGCC GTTCGACGGG AAGGTCGAGT CGTGGCAGCA GACCGTCGAG
GCCTCGATCA TGGACACCGC CGACGACATC GCCTACGCGA TCCACGACCT GGAGGACTTC
TACCGGGTCG GGGTGCTCCA GCACGCCACC GTCGCGGCCG AGCTGGGGAC GTGGCTGAAG
CAAGGGCTTG AGCTGGCCTC GTTGAGCGCG GCCGAGCTGG ACGCCCAGGA GCGGCGGCCG
GGGCGGTCGC TGGAAGCGCT GCGGCGGCGG CTGCACCTGA AGGACTCGTG GGCGGTGGAC
GACGACGTGT TCGCCGCCGC CGTGGCCAAG GTGCGCGCCG AGCTGGTCGA CGGGCTGCTC
GCGGTGCCGT TCGACGGGTC CACCGAGGCC GAGCAGGCGG TGGCCGGGTT CTCCGCGCGG
TGGACGCGCA GACTCGTGGA CGCGGTGGGC GTGCTGGAGG AGCCCACGAC GCGGTCCGGG
CACGTGGTGC TCGCGCAAGC TCAGTGGCAC GAGGTGCAGG TGCTCAAGTT CGTGCACCGG
CGGTTCGTGC TGCTGCGGCC GGATCTCGCG CTGCACCAGC GTGGTCAGGC CCGGTTGCTC
ACCGCGCTCG TGGAGGCGCT GGAGCAGTGG GTGACCGACC GGCACGAGGT CGGCAGGTTG
CCGCGCAGGT TGCACGACCT GGTCGAACTG GCCGAGCAGG AGTACGCCCG GTTGGCACTG
GACGACCCTG GGGCTCTTGT CGGCGCTACC GGTGAAAAAC CGTCCGGCCC GGACGCGGTG
CGCTCGCTCG CGCGTGGTCG GGCGGTCGTG GATTTCACCG CCTCGCTCAC CGACAACCAG
GCATCCGCGC TGCTGGAGGC CCTGTCCGGA CGAACCGGGC AACTCTGGAC AGACGCCTTC
GTTCTGTGA
 
Protein sequence
MHQDQTDPRA SRRSGGEGTT AADLASSPFR VDRDRVASSP FFSRLGGVTQ VVSSTGSGLL 
VHNRLTHSLK VAQAARAIAE RLCSRPELGG ALEKLGGCDP DVVEAASLAH DLGHPPFGHL
GEQVLDRIAR HRFGLSDGFE GNAQSFRIVT TTDVRGPEAV GLDLTVAVRA AMLKYPWTRL
SYPDPHPRDM ASPPRGAAEP SEAPGTGSGK FSAYVTEMDD VHGARLPFDG KVESWQQTVE
ASIMDTADDI AYAIHDLEDF YRVGVLQHAT VAAELGTWLK QGLELASLSA AELDAQERRP
GRSLEALRRR LHLKDSWAVD DDVFAAAVAK VRAELVDGLL AVPFDGSTEA EQAVAGFSAR
WTRRLVDAVG VLEEPTTRSG HVVLAQAQWH EVQVLKFVHR RFVLLRPDLA LHQRGQARLL
TALVEALEQW VTDRHEVGRL PRRLHDLVEL AEQEYARLAL DDPGALVGAT GEKPSGPDAV
RSLARGRAVV DFTASLTDNQ ASALLEALSG RTGQLWTDAF VL