Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0704 |
Symbol | |
ID | 3786166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 810049 |
End bp | 811170 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810786 |
Product | deoxyguanosinetriphosphate triphosphohydrolase-like protein |
Protein accession | YP_411403 |
Protein GI | 82701837 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGAGC TGGCGGCCTA CGCGGTATCA ACTGCCAACT CCCGGGGGCG CCGCATTGCG GAAGAAGCCT CTCCCGGTCG CACGCCTTTT CAGCGCGATC GCGATCGCAT CATTCATTCC ACCGCTTTTC GCAGGCTTGA GTACAAGACC CAGGTTTTCG TCAATCATGA AGGGGATTTG TTCCGCACAC GCCTGACGCA CAGTCTCGAA GTGGCCCAGA TCGGCCGCTC CGTAGCCCGG AATTTGCGCC TGAATGAGGA CCTGGTTGAG GCTATCGCGC TGGCGCATGA TCTCGGCCAT ACCGCCTTCG GTCATGCCGG GCAGGACGCG CTGAACGAGT GCATGAAGGA ATATGGCGGC TTTGAGCATA ATCTCCAATC CCTGCGGGTG GTGGATGTGC TGGAAGAGCA CTATGGCGCA TTCGACGGGT TGAATCTGTG CTTTGAAACC CGCGAAGGTA TTCTCAAGCA TTGTTCGAAG AAAAATGCGC TGGAGCTGGG GGACGTTGGT GAGCGCTTCC TCACGAATCG TCGGCCTTCG CTGGAAGCGC AAGTGGCCAA TCTCGCTGAC GAGATCGCTT ACAACAATCA CGACGTGGAC GACGGCTTGA GATCCGGTCT CGTCACGCAG CAGCAACTTG AGGGAGTCGG CATATTTGCG CGTCATCTGG CAATGGCCAG ACAGCAATAC CCGAAAATCT CCGGAAGGCG GCTGGTCCAT GAAACCGTGC GGCGCATGAT CAATACGCTG GCGGGAGATT TGATCAGACA AAGCGCAGTG AATATCGCGC AGGCCAGCCC GGTCACACTG GACGAAATCC GGGCCGCTCC CCCGCTGATC GGATTCAGCA GGGAAATTGC CCAGGAGCAG CAGGAGCTCA AAAAGTTTCT GCGGGAGCAT CTTTACCGCC ACTACAAGGT ATCGCGCATG AGTGCAAAGG CACGGTACAT CATCCGCCAG CTCTTTGACG CGTTCAATTC CGATATCCGT TTGTTGCCTC CGGAATTCCA GTCCAGGTAT CAGCAGGACA AACATCAGGC TATCGCCGAC TATATCGCAG GCATGACGGA CCGGTATGCC ATTCGTGAAT ACCGGCGTCT CTTCGTTGTT GAGGAAAGCT GA
|
Protein sequence | MHELAAYAVS TANSRGRRIA EEASPGRTPF QRDRDRIIHS TAFRRLEYKT QVFVNHEGDL FRTRLTHSLE VAQIGRSVAR NLRLNEDLVE AIALAHDLGH TAFGHAGQDA LNECMKEYGG FEHNLQSLRV VDVLEEHYGA FDGLNLCFET REGILKHCSK KNALELGDVG ERFLTNRRPS LEAQVANLAD EIAYNNHDVD DGLRSGLVTQ QQLEGVGIFA RHLAMARQQY PKISGRRLVH ETVRRMINTL AGDLIRQSAV NIAQASPVTL DEIRAAPPLI GFSREIAQEQ QELKKFLREH LYRHYKVSRM SAKARYIIRQ LFDAFNSDIR LLPPEFQSRY QQDKHQAIAD YIAGMTDRYA IREYRRLFVV EES
|
| |