Gene Nmul_A0704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0704 
Symbol 
ID3786166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp810049 
End bp811170 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content57% 
IMG OID637810786 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_411403 
Protein GI82701837 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGAGC TGGCGGCCTA CGCGGTATCA ACTGCCAACT CCCGGGGGCG CCGCATTGCG 
GAAGAAGCCT CTCCCGGTCG CACGCCTTTT CAGCGCGATC GCGATCGCAT CATTCATTCC
ACCGCTTTTC GCAGGCTTGA GTACAAGACC CAGGTTTTCG TCAATCATGA AGGGGATTTG
TTCCGCACAC GCCTGACGCA CAGTCTCGAA GTGGCCCAGA TCGGCCGCTC CGTAGCCCGG
AATTTGCGCC TGAATGAGGA CCTGGTTGAG GCTATCGCGC TGGCGCATGA TCTCGGCCAT
ACCGCCTTCG GTCATGCCGG GCAGGACGCG CTGAACGAGT GCATGAAGGA ATATGGCGGC
TTTGAGCATA ATCTCCAATC CCTGCGGGTG GTGGATGTGC TGGAAGAGCA CTATGGCGCA
TTCGACGGGT TGAATCTGTG CTTTGAAACC CGCGAAGGTA TTCTCAAGCA TTGTTCGAAG
AAAAATGCGC TGGAGCTGGG GGACGTTGGT GAGCGCTTCC TCACGAATCG TCGGCCTTCG
CTGGAAGCGC AAGTGGCCAA TCTCGCTGAC GAGATCGCTT ACAACAATCA CGACGTGGAC
GACGGCTTGA GATCCGGTCT CGTCACGCAG CAGCAACTTG AGGGAGTCGG CATATTTGCG
CGTCATCTGG CAATGGCCAG ACAGCAATAC CCGAAAATCT CCGGAAGGCG GCTGGTCCAT
GAAACCGTGC GGCGCATGAT CAATACGCTG GCGGGAGATT TGATCAGACA AAGCGCAGTG
AATATCGCGC AGGCCAGCCC GGTCACACTG GACGAAATCC GGGCCGCTCC CCCGCTGATC
GGATTCAGCA GGGAAATTGC CCAGGAGCAG CAGGAGCTCA AAAAGTTTCT GCGGGAGCAT
CTTTACCGCC ACTACAAGGT ATCGCGCATG AGTGCAAAGG CACGGTACAT CATCCGCCAG
CTCTTTGACG CGTTCAATTC CGATATCCGT TTGTTGCCTC CGGAATTCCA GTCCAGGTAT
CAGCAGGACA AACATCAGGC TATCGCCGAC TATATCGCAG GCATGACGGA CCGGTATGCC
ATTCGTGAAT ACCGGCGTCT CTTCGTTGTT GAGGAAAGCT GA
 
Protein sequence
MHELAAYAVS TANSRGRRIA EEASPGRTPF QRDRDRIIHS TAFRRLEYKT QVFVNHEGDL 
FRTRLTHSLE VAQIGRSVAR NLRLNEDLVE AIALAHDLGH TAFGHAGQDA LNECMKEYGG
FEHNLQSLRV VDVLEEHYGA FDGLNLCFET REGILKHCSK KNALELGDVG ERFLTNRRPS
LEAQVANLAD EIAYNNHDVD DGLRSGLVTQ QQLEGVGIFA RHLAMARQQY PKISGRRLVH
ETVRRMINTL AGDLIRQSAV NIAQASPVTL DEIRAAPPLI GFSREIAQEQ QELKKFLREH
LYRHYKVSRM SAKARYIIRQ LFDAFNSDIR LLPPEFQSRY QQDKHQAIAD YIAGMTDRYA
IREYRRLFVV EES