Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1931 |
Symbol | |
ID | 8447538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2128614 |
End bp | 2129888 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 645041061 |
Product | deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_003201309 |
Protein GI | 258652153 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.228057 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00209932 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTCGCCG CCAGCGAGAC GCCGGCCAGC CCCGGCTACG ACGCCTACGA CCTGATGCGG CGGCTGGCGG AGCTGCCCAA GACCGCGCCG CTGCCCGGCA CCGCCGACGC CGGCGGCCGC AGCCCGTTCG CCCGGGACCG GGCCCGGGTG CTGCACTCCA AATCGTTCCG CCGGTTGGCC GGCAAGACCC AGGTGGTGGC GCCGGACGAG GAGGGGGTGC CCCGGACGCG GCTGACCCAC TCGCTGGAGG TCGCGCAGAT CGCCCGGGAG ATCGGCGCCC AGCTGGGCTG CGACCCCGAC CTGGTCGACC TGGCCGGGCT GGCCCACGAC ATCGGGCACC CGCCGTTCGG GCACAACGGG GAGGCAGCCC TGGACCGGAT CGGGGCCGCC GCCGGCGGGT TCGAGGCCAA TGCGCAGAAC CTGCGGCTGC TGGCCCGGCT CGAACCCAAG GTCGTCGCCG TCGACGGCCG GCCGGGCGGG CTGAACCTGA CCCGGGCGGC GCTGGACGCG GTGATCAAGT ACCCGTGGTC GCGGCCGGCC GGCGGCGGCA AGTTCGGCGT CTACGCCGAC GAGCAGGCGG TGTTCGGCTG GGTCCGCGAA TCGGCGCCCG GGACCCGGCG CTGCCTGGAG GCCCAGGTGA TGGACTGGGC CGACGACGTC GCCTACTCGG TGCACGACGT CGAGGACGGT CTGGACGCCG GCCGGATCGA CCTGACCCGG CTGGCCGACC CGGACGAGCG GGACGCGGTC TGCGCCGCCG CCCGTCCCTA CAGCGACGAG TCCACCGATG ACCTGCGCAC CGTGCTGGAC GACCTGCTGG CCCTGCCGGC GGTGGCCGGC CGCGGGCAGT ACCCGCCCGG CGCGCTCGCC GACGCCGCGG TCAAGGCCAT GACCAGCGAG CTGACCGGGC GGTTCTGCAC CGGCGCGATC GCCGCCACCC GGGCCGCGGC CGGCGACGGA CCGCTGCTGC GGTACCGCGC CGACCTGCAG GTGCCGCGGC GGCTGCGAGC CGAGGTGGCC GTGCTCAAGG CGGTCGCCGG CCGGTACGTG ATAGCCGACC CGAGCCGGCT GCGCGCCCAG GAACGCGAGC AGCAGATCCT CACCGACCTG GTGCGGGTGA CCGCGGACCG CGGCGTCGAC GCCCTGGACC CGGAGTTCCG GTCCGGCTTC GCGGCGGCCA CCGACGACGC GGCCCGGCTG CGGATCGTGC TGGACCAGAT CAGCCTGCTC ACCGACGCGC AGGCGATCGC CCGGCACCAA CGACTGCGCG GCTGA
|
Protein sequence | MLAASETPAS PGYDAYDLMR RLAELPKTAP LPGTADAGGR SPFARDRARV LHSKSFRRLA GKTQVVAPDE EGVPRTRLTH SLEVAQIARE IGAQLGCDPD LVDLAGLAHD IGHPPFGHNG EAALDRIGAA AGGFEANAQN LRLLARLEPK VVAVDGRPGG LNLTRAALDA VIKYPWSRPA GGGKFGVYAD EQAVFGWVRE SAPGTRRCLE AQVMDWADDV AYSVHDVEDG LDAGRIDLTR LADPDERDAV CAAARPYSDE STDDLRTVLD DLLALPAVAG RGQYPPGALA DAAVKAMTSE LTGRFCTGAI AATRAAAGDG PLLRYRADLQ VPRRLRAEVA VLKAVAGRYV IADPSRLRAQ EREQQILTDL VRVTADRGVD ALDPEFRSGF AAATDDAARL RIVLDQISLL TDAQAIARHQ RLRG
|
| |