Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3805 |
Symbol | |
ID | 5705300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4335085 |
End bp | 4336350 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641273227 |
Product | putative deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_001538589 |
Protein GI | 159039336 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.220802 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCGG TCGGCCCGGA CGAGCGGCGG TGGGTGGACG AGCCGGCAAA GGACAGCGGG CACGGGCGGT CGGCCTACGA ACGGGACCGT GCCCGGGTGC TGCACTCGGC GGCCTTCCGG CGGCTCGCCG CCAAGACCCA GGTGCACACC GCCGGCACCG ACGACTTTCT GCGGACCAGG TTGACGCACT CGTTGGAGGT CGCCCAGGTC GCCCGCGAGA TGGGCAGCCG GCTCGGCTGC GATCCCGACG TGGTGGACAC CGCCGGGCTC GCCCACGACC TCGGGCACCC GCCGTTCGGA CACAGTGGCG AGGAGGCGCT GGACGCCCTC GCCGCCGCGT GCGGCGGTTT TGAGGGCAAC GCACAGACGC TGCGCGTCCT CACCCGCCTG GAGGCGAAGG TGATCGGTCC GGACGGTGCC TCCGCCGGGC TGAACCTCAC CCGGGCGTCG CTCGACGCGG TCAGCAAGTA CCCGTGGCCG CGCCGGCCGG GCGAACGCAA GTTCGGCGTG TACGCCGACG ACCGCCCGGT CTTCGCGTGG CTGCGTGCCG ACGTGCCGGA CCGGCGGCGG TGCCTGGAAG CGCAGGTGAT GGACTGGGCC GATGACGTCG CGTACTCGGT GCACGACGTC GAGGACGGCA TCCACGGCGG CTACGTGACG CTGCGCCCGT TGTTGGCGCA GGCCGACGAG CGGGCGGCGC TGTGCGCCGA CGTCGCCGCG ACGTACTCCG GCGAGTCTCC GGCCGACCTC GCGGAGGTGC TGGTCGACCT GCTCGCCGAT CCGCTGCTCG CGCCCCTCGT GGGCTACGAC GGCAGCCACC GGGCGCAGGT CGCGCTGAAG GCGACCACCA GCGTGCTCAC CGGGCGTTTC GTCGCCGCCG CCGTGGCCGC CACCGGGCGC CGGTTCGGGC CCGGCCCGCA TCGCCGGTAC GCCGCCGACC TGGTCGTGCC GCGCGAGGTC CGGGCCCGGT GCGCTGTGCT CAAGGGCATC GCCCTGCGGT ACGTACTGCG TCGCCCCGGC TCCGTGGCCC GTCTCGAGCG GCAGCAGCAG ATCCTCGCCG ACCTGGTCGC TGGCCTGGCC GACCGGGCCC CCGAGGCGTT GGATGCCGTG TTCGCTCCCT TGTGGCGCGC CGCCGGGAAC GATGCGAGCC GGCTGCGGGT GGTTGTCGAT CAGGTGGCGT CGTTGACGGA TCCGGCGGCG GTGGAGCGGC ATGCCCGGCT GTTCGGTGGT CCGACCGCGT CCGGCGGGCA GACCGACTTA GGTTAA
|
Protein sequence | MTSVGPDERR WVDEPAKDSG HGRSAYERDR ARVLHSAAFR RLAAKTQVHT AGTDDFLRTR LTHSLEVAQV AREMGSRLGC DPDVVDTAGL AHDLGHPPFG HSGEEALDAL AAACGGFEGN AQTLRVLTRL EAKVIGPDGA SAGLNLTRAS LDAVSKYPWP RRPGERKFGV YADDRPVFAW LRADVPDRRR CLEAQVMDWA DDVAYSVHDV EDGIHGGYVT LRPLLAQADE RAALCADVAA TYSGESPADL AEVLVDLLAD PLLAPLVGYD GSHRAQVALK ATTSVLTGRF VAAAVAATGR RFGPGPHRRY AADLVVPREV RARCAVLKGI ALRYVLRRPG SVARLERQQQ ILADLVAGLA DRAPEALDAV FAPLWRAAGN DASRLRVVVD QVASLTDPAA VERHARLFGG PTASGGQTDL G
|
| |