Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2286 |
Symbol | |
ID | 8447897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2522163 |
End bp | 2523743 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645041408 |
Product | deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_003201652 |
Protein GI | 258652496 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0244087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00386241 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATCCGC GCCTGGCCCG CCGGTGGGCC GAGGTCCCCA CGCTGAGCCC GGCCGCCGTC GCCGCGGACG GCGCGCGGGG CGCCGACGCC GAACTGGCCG CGGCCACCGA GGGGGCGTTC CGGGACGACC TGGAACGAAT CCGGTTCTCC CCGTACTTCT CCCGCTTGGC CGCCGTCACC CAGGTGATCT CCCAGGGGGC GTCCGGCCAG GTCGTGCACA ACCGGCTGAC CCACACGGTC AAGGTCACCT CGGTGGCCCG GGCGATCGCC GTCGGGTTGC GCCGCGGTCC CTACGCGCAG CTTGCCGACG ATCTCGGCGG CTGCGACGCG GTCGTCGTGC AGGCGGCGGC CAGCGCGCAC GACCTGGGCC ACCCGCCGTT CGGTCATCTG GGGGAGCGGA TCCTGGACCG GATCGCCCGC TCCCGGTTCG GCCTGGCCGA CGGTTTCGAG GGCAACGCGC AGACCTTCCG CATCCTGACC GAGCTGGACG TGCACGGGGA GTCCGGCGAG GGACTGAACC TGACCGCGGC CGTGCGGGCC GCCGTGCTGA AGTACCCGTG GTCGCGGCTG CACGTGCCCG ACCCGCACCC GAGCACCTTG GCCCAGCCGC CCCGCGGCGG CGGGCCCGGG GAAGAGGGGG CCGGGTCGGG CAAGTACTCG GCCTACGTGC TCGACGTCGG TGAGATGCGC GAGGTGCTGG CCGCGTACCC CAAGATCGGC CCGTTGCGGC AGACCGTCGA GTGCTCGGTG ATGGACGCCG CCGACGACAT CGCCTACTCC CTGCACGATC TGGACGACTT CCACCGGGCC GGGGTGCTCC AGCACGCCTC CGTCGCGGCC GAGTTCCGCA GCTGGTTGCG CCGCCGGGCC GAGTTCTCCC GGCGCACCCT GCCCGAGGAC GACCGGCGGC CCGGGGTGGC CCTGGAACGG CTGCGCCGGC GGCTGCAGGA CCGGGACGAG TGGATCTTCC AGGACGAGGC GTTCGCGGTC GCGGTCGGTC GGGTGGCCAC TGACCTGCTG GACGGGTTGC TGGCCGTGCC GTTCGATTCC TCGCTGGCCG CGGAGCGGGC CATCGGCACC TTTACCCGGA GCTGGATCGC GCACCTGCAG GAGTCGGTGG AGATGACCGC CGACCCGCCG ATCCGCTCCG GACACGTTCA GCTGGGCCGG CAGGCCTGGC ACGAGGTCGC CGTGCTCAAG TTCGTGCACC AGCGGTTCGT GCTCGAGCGG CCGGATCTGG CCCTGTACCA GCGGGGCCAG GCGCAGTCGC TGTCCTCGCT GGTTGCCGAC CTGGAGTCGT GGCTGACCGA CCCGATCGAC TCGGGCCGGG CGCCGCGCCG GCTGGTCGAC CTGGTGGCCC TGGCTACCGC CGGCTACCGG CGGGTCGCCC GCGAGGAACC GGAGCTGCTG GTCGGCCCGA CCGGGGAACC GATGTCCGGG CGCGAGGACA TCGTCCGGCT GGGCCGGGGC CGCGGCATCA TCGACTACGT CGCCTCGCTG ACCGACGACC GGGCCGGCGC CGCCGCCCGC ACGCTGTCGG GTCTGACCGG GCAGCTGTTC GAAGCCGGGT CCGGGTTGTG A
|
Protein sequence | MDPRLARRWA EVPTLSPAAV AADGARGADA ELAAATEGAF RDDLERIRFS PYFSRLAAVT QVISQGASGQ VVHNRLTHTV KVTSVARAIA VGLRRGPYAQ LADDLGGCDA VVVQAAASAH DLGHPPFGHL GERILDRIAR SRFGLADGFE GNAQTFRILT ELDVHGESGE GLNLTAAVRA AVLKYPWSRL HVPDPHPSTL AQPPRGGGPG EEGAGSGKYS AYVLDVGEMR EVLAAYPKIG PLRQTVECSV MDAADDIAYS LHDLDDFHRA GVLQHASVAA EFRSWLRRRA EFSRRTLPED DRRPGVALER LRRRLQDRDE WIFQDEAFAV AVGRVATDLL DGLLAVPFDS SLAAERAIGT FTRSWIAHLQ ESVEMTADPP IRSGHVQLGR QAWHEVAVLK FVHQRFVLER PDLALYQRGQ AQSLSSLVAD LESWLTDPID SGRAPRRLVD LVALATAGYR RVAREEPELL VGPTGEPMSG REDIVRLGRG RGIIDYVASL TDDRAGAAAR TLSGLTGQLF EAGSGL
|
| |