Gene Namu_2286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2286 
Symbol 
ID8447897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2522163 
End bp2523743 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content74% 
IMG OID645041408 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003201652 
Protein GI258652496 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0244087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00386241 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCCGC GCCTGGCCCG CCGGTGGGCC GAGGTCCCCA CGCTGAGCCC GGCCGCCGTC 
GCCGCGGACG GCGCGCGGGG CGCCGACGCC GAACTGGCCG CGGCCACCGA GGGGGCGTTC
CGGGACGACC TGGAACGAAT CCGGTTCTCC CCGTACTTCT CCCGCTTGGC CGCCGTCACC
CAGGTGATCT CCCAGGGGGC GTCCGGCCAG GTCGTGCACA ACCGGCTGAC CCACACGGTC
AAGGTCACCT CGGTGGCCCG GGCGATCGCC GTCGGGTTGC GCCGCGGTCC CTACGCGCAG
CTTGCCGACG ATCTCGGCGG CTGCGACGCG GTCGTCGTGC AGGCGGCGGC CAGCGCGCAC
GACCTGGGCC ACCCGCCGTT CGGTCATCTG GGGGAGCGGA TCCTGGACCG GATCGCCCGC
TCCCGGTTCG GCCTGGCCGA CGGTTTCGAG GGCAACGCGC AGACCTTCCG CATCCTGACC
GAGCTGGACG TGCACGGGGA GTCCGGCGAG GGACTGAACC TGACCGCGGC CGTGCGGGCC
GCCGTGCTGA AGTACCCGTG GTCGCGGCTG CACGTGCCCG ACCCGCACCC GAGCACCTTG
GCCCAGCCGC CCCGCGGCGG CGGGCCCGGG GAAGAGGGGG CCGGGTCGGG CAAGTACTCG
GCCTACGTGC TCGACGTCGG TGAGATGCGC GAGGTGCTGG CCGCGTACCC CAAGATCGGC
CCGTTGCGGC AGACCGTCGA GTGCTCGGTG ATGGACGCCG CCGACGACAT CGCCTACTCC
CTGCACGATC TGGACGACTT CCACCGGGCC GGGGTGCTCC AGCACGCCTC CGTCGCGGCC
GAGTTCCGCA GCTGGTTGCG CCGCCGGGCC GAGTTCTCCC GGCGCACCCT GCCCGAGGAC
GACCGGCGGC CCGGGGTGGC CCTGGAACGG CTGCGCCGGC GGCTGCAGGA CCGGGACGAG
TGGATCTTCC AGGACGAGGC GTTCGCGGTC GCGGTCGGTC GGGTGGCCAC TGACCTGCTG
GACGGGTTGC TGGCCGTGCC GTTCGATTCC TCGCTGGCCG CGGAGCGGGC CATCGGCACC
TTTACCCGGA GCTGGATCGC GCACCTGCAG GAGTCGGTGG AGATGACCGC CGACCCGCCG
ATCCGCTCCG GACACGTTCA GCTGGGCCGG CAGGCCTGGC ACGAGGTCGC CGTGCTCAAG
TTCGTGCACC AGCGGTTCGT GCTCGAGCGG CCGGATCTGG CCCTGTACCA GCGGGGCCAG
GCGCAGTCGC TGTCCTCGCT GGTTGCCGAC CTGGAGTCGT GGCTGACCGA CCCGATCGAC
TCGGGCCGGG CGCCGCGCCG GCTGGTCGAC CTGGTGGCCC TGGCTACCGC CGGCTACCGG
CGGGTCGCCC GCGAGGAACC GGAGCTGCTG GTCGGCCCGA CCGGGGAACC GATGTCCGGG
CGCGAGGACA TCGTCCGGCT GGGCCGGGGC CGCGGCATCA TCGACTACGT CGCCTCGCTG
ACCGACGACC GGGCCGGCGC CGCCGCCCGC ACGCTGTCGG GTCTGACCGG GCAGCTGTTC
GAAGCCGGGT CCGGGTTGTG A
 
Protein sequence
MDPRLARRWA EVPTLSPAAV AADGARGADA ELAAATEGAF RDDLERIRFS PYFSRLAAVT 
QVISQGASGQ VVHNRLTHTV KVTSVARAIA VGLRRGPYAQ LADDLGGCDA VVVQAAASAH
DLGHPPFGHL GERILDRIAR SRFGLADGFE GNAQTFRILT ELDVHGESGE GLNLTAAVRA
AVLKYPWSRL HVPDPHPSTL AQPPRGGGPG EEGAGSGKYS AYVLDVGEMR EVLAAYPKIG
PLRQTVECSV MDAADDIAYS LHDLDDFHRA GVLQHASVAA EFRSWLRRRA EFSRRTLPED
DRRPGVALER LRRRLQDRDE WIFQDEAFAV AVGRVATDLL DGLLAVPFDS SLAAERAIGT
FTRSWIAHLQ ESVEMTADPP IRSGHVQLGR QAWHEVAVLK FVHQRFVLER PDLALYQRGQ
AQSLSSLVAD LESWLTDPID SGRAPRRLVD LVALATAGYR RVAREEPELL VGPTGEPMSG
REDIVRLGRG RGIIDYVASL TDDRAGAAAR TLSGLTGQLF EAGSGL