Gene Namu_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1931 
Symbol 
ID8447538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2128614 
End bp2129888 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content76% 
IMG OID645041061 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003201309 
Protein GI258652153 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.228057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00209932 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCGCCG CCAGCGAGAC GCCGGCCAGC CCCGGCTACG ACGCCTACGA CCTGATGCGG 
CGGCTGGCGG AGCTGCCCAA GACCGCGCCG CTGCCCGGCA CCGCCGACGC CGGCGGCCGC
AGCCCGTTCG CCCGGGACCG GGCCCGGGTG CTGCACTCCA AATCGTTCCG CCGGTTGGCC
GGCAAGACCC AGGTGGTGGC GCCGGACGAG GAGGGGGTGC CCCGGACGCG GCTGACCCAC
TCGCTGGAGG TCGCGCAGAT CGCCCGGGAG ATCGGCGCCC AGCTGGGCTG CGACCCCGAC
CTGGTCGACC TGGCCGGGCT GGCCCACGAC ATCGGGCACC CGCCGTTCGG GCACAACGGG
GAGGCAGCCC TGGACCGGAT CGGGGCCGCC GCCGGCGGGT TCGAGGCCAA TGCGCAGAAC
CTGCGGCTGC TGGCCCGGCT CGAACCCAAG GTCGTCGCCG TCGACGGCCG GCCGGGCGGG
CTGAACCTGA CCCGGGCGGC GCTGGACGCG GTGATCAAGT ACCCGTGGTC GCGGCCGGCC
GGCGGCGGCA AGTTCGGCGT CTACGCCGAC GAGCAGGCGG TGTTCGGCTG GGTCCGCGAA
TCGGCGCCCG GGACCCGGCG CTGCCTGGAG GCCCAGGTGA TGGACTGGGC CGACGACGTC
GCCTACTCGG TGCACGACGT CGAGGACGGT CTGGACGCCG GCCGGATCGA CCTGACCCGG
CTGGCCGACC CGGACGAGCG GGACGCGGTC TGCGCCGCCG CCCGTCCCTA CAGCGACGAG
TCCACCGATG ACCTGCGCAC CGTGCTGGAC GACCTGCTGG CCCTGCCGGC GGTGGCCGGC
CGCGGGCAGT ACCCGCCCGG CGCGCTCGCC GACGCCGCGG TCAAGGCCAT GACCAGCGAG
CTGACCGGGC GGTTCTGCAC CGGCGCGATC GCCGCCACCC GGGCCGCGGC CGGCGACGGA
CCGCTGCTGC GGTACCGCGC CGACCTGCAG GTGCCGCGGC GGCTGCGAGC CGAGGTGGCC
GTGCTCAAGG CGGTCGCCGG CCGGTACGTG ATAGCCGACC CGAGCCGGCT GCGCGCCCAG
GAACGCGAGC AGCAGATCCT CACCGACCTG GTGCGGGTGA CCGCGGACCG CGGCGTCGAC
GCCCTGGACC CGGAGTTCCG GTCCGGCTTC GCGGCGGCCA CCGACGACGC GGCCCGGCTG
CGGATCGTGC TGGACCAGAT CAGCCTGCTC ACCGACGCGC AGGCGATCGC CCGGCACCAA
CGACTGCGCG GCTGA
 
Protein sequence
MLAASETPAS PGYDAYDLMR RLAELPKTAP LPGTADAGGR SPFARDRARV LHSKSFRRLA 
GKTQVVAPDE EGVPRTRLTH SLEVAQIARE IGAQLGCDPD LVDLAGLAHD IGHPPFGHNG
EAALDRIGAA AGGFEANAQN LRLLARLEPK VVAVDGRPGG LNLTRAALDA VIKYPWSRPA
GGGKFGVYAD EQAVFGWVRE SAPGTRRCLE AQVMDWADDV AYSVHDVEDG LDAGRIDLTR
LADPDERDAV CAAARPYSDE STDDLRTVLD DLLALPAVAG RGQYPPGALA DAAVKAMTSE
LTGRFCTGAI AATRAAAGDG PLLRYRADLQ VPRRLRAEVA VLKAVAGRYV IADPSRLRAQ
EREQQILTDL VRVTADRGVD ALDPEFRSGF AAATDDAARL RIVLDQISLL TDAQAIARHQ
RLRG