Gene Nmul_A0013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0013 
Symbol 
ID3786451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp13916 
End bp14941 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content56% 
IMG OID637810081 
Productthiamine-monophosphate kinase 
Protein accessionYP_410714 
Protein GI82701148 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTTG TAGGTGGAAG TAAAGTGCTG TCGGAATTTG ACATCATCCG GTCTTTTTTC 
ACCCGCTCCG CTCCGGCTAC AGTGTTGGGT ATAGGGGACG ATGCGGCGCT CATCCGGCCT
GCACCCGGGA TGGAACTGGC AATCTCGAGC GACATGCTGG TTTCCGGGCG TCATTTTTTC
GAAGATGTGG ATCCGTACAA GCTCGGCCAC AAATCTCTGG CAGTGAATCT TTCCGATATG
GCGGCAATGG GTGCCCGTCC CCGTTGGGCA ACCCTTTCGC TGGCGCTGCC AGAATCAGTA
GTACAAAAGG ATGAGTCATG GCTGCGAGCG TTTGCTGACG GATTTTTTGC GCTTGCGCAT
GTCCATCGGG TTGATCTGAT CGGCGGCGAT ACCACCAATG GTCCCCTGAA TATCTGCGTG
ACGATCATTG GCGAAGCGCC GGAAGGAAAA GCCTTGCGCC GCAGCGGCGC CAGGGCGGGG
GATGATATCT GGGTTTCGGG CTACCTCGGG GATGCGGCTC TCGCGCTTGC TTACCAAAAA
AAAAAGATCA TGCCGAAACC GGGCGAAGTG GAAGCCGCAG AATTGGAAGC TGCGGAGTTG
GAATTCCTTA TGGCCGCGCT TGAGATGCCC ATTCCCCGTG TGGAACTGGG GCAGCGTCTG
ATCGGCCTTG CCCATAGCGC CATCGATATT TCAGACGGTC TGCTGGCCGA CCTCGGACAT
ATTCTGGAGT GTTCAAGAGT GGCGGCTGTT GTCAGAATCG ATCGGATACT CCGGTCGGCA
GCGATGGAGA AACATTTTCC TCACCCCCTT GCGATAGAAT GCCTGCTTGC GGGGGGAGAC
GATTATGAGT TGTGCTTTAC CGTGCCGAAG TCGGAGCGAA TAAAGGTGGA ACTGCTTTCG
CGCGAGGAAG GTATTCCGTT AACACGTATT GGCAGCATCG AGGAGGGGGC AGGCTTGGTC
GTGCTCGATT CCGCAGGCAG AACAGTCACT ACGAGGGTCA AAGGTTATGA CCATTTCCAA
GTCTGA
 
Protein sequence
MDLVGGSKVL SEFDIIRSFF TRSAPATVLG IGDDAALIRP APGMELAISS DMLVSGRHFF 
EDVDPYKLGH KSLAVNLSDM AAMGARPRWA TLSLALPESV VQKDESWLRA FADGFFALAH
VHRVDLIGGD TTNGPLNICV TIIGEAPEGK ALRRSGARAG DDIWVSGYLG DAALALAYQK
KKIMPKPGEV EAAELEAAEL EFLMAALEMP IPRVELGQRL IGLAHSAIDI SDGLLADLGH
ILECSRVAAV VRIDRILRSA AMEKHFPHPL AIECLLAGGD DYELCFTVPK SERIKVELLS
REEGIPLTRI GSIEEGAGLV VLDSAGRTVT TRVKGYDHFQ V