Gene Nmul_A1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1007 
Symbol 
ID3785838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1167307 
End bp1168284 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content58% 
IMG OID637811091 
Producthypothetical protein 
Protein accessionYP_411702 
Protein GI82702136 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase
[COG1051] ADP-ribose pyrophosphatase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCCCG CTCCCTCCAT CGTTGAAGTT GTTGCCGCCA TCATCATCGG GAGCGACGGT 
TCCTTTCTTT TGGCCCGTCG CCCGGAAGGC AAACCCTATG CCGGCTACTG GGAATTTCCG
GGTGGCAAGG TCAATCCGGA AGAATCCCTG TTGCGCGCAC TCAAGCGCGA ACTGCTGGAA
GAATTGGGCA TACATGTCAA GCATGCGTAC CCCTGGATAA CCCGCACATT CACCTATCCT
CATGCCAGGG TACGGCTTCA CTTTTATCGG GTCGTGGAGT GGCATGGCGA GCCTCATCCG
CATGAAGACC AGGAGCTGTC GTGGCAGTTT GCGGACAATG TCTCTGTAGA GCCGTTGCTC
CCTGCCAACG CACCCGTATT GCGGGCACTT GCCTTGCCCC CCGTGTATGG GATTACGAAT
GCCGCGGAGT GGGGGCCACA AATTGCAGCA GCCCGAATCG GGCACGCTCT CCAGAAAGGA
CTACGGCTGG TGCAGCTACG CGAAAAAGGA ATGCGGAGTA AAGCGCTGGA TGCCTTTGCC
CGTGAAGTAA CCGCACTGGC CCATCATTAC GGCGCCCGCA TTCTCGTAAA CAGCGGCACG
GGTAACGAGA GCCTCTGCCA GGAATTGGAT ATGGATGGAA TCCACTTTAC ATCGGCTGAT
CTGATGAATC TTTCGAAACG ACCCGATGTA GAATGGTGCG CGGCCTCCTG TCACAACGCC
GAAGAGCTTT TCCGGGCCGA GCAACTGGAA ATGGATTTCG CCGTTCTGGC TCCCGTACTT
CCGACATTGA GCCATCCCGA TTCCCCCGTG CTGGGCTGGC GAAAGCTTGC CAGAATCATC
CACGGCAGCG CCATCCCTGT CTATGCGCTG GGAGGGCTGC AAAGCGAAGA TCTTGCCATT
GCCTGGGAAC ACGGTGCCCA CGGCATCGCC CTGATGCGCC GTATTGAGCA GGTGCGCGGC
ACCGGGCAGA AGGCCTGA
 
Protein sequence
MRPAPSIVEV VAAIIIGSDG SFLLARRPEG KPYAGYWEFP GGKVNPEESL LRALKRELLE 
ELGIHVKHAY PWITRTFTYP HARVRLHFYR VVEWHGEPHP HEDQELSWQF ADNVSVEPLL
PANAPVLRAL ALPPVYGITN AAEWGPQIAA ARIGHALQKG LRLVQLREKG MRSKALDAFA
REVTALAHHY GARILVNSGT GNESLCQELD MDGIHFTSAD LMNLSKRPDV EWCAASCHNA
EELFRAEQLE MDFAVLAPVL PTLSHPDSPV LGWRKLARII HGSAIPVYAL GGLQSEDLAI
AWEHGAHGIA LMRRIEQVRG TGQKA