Gene Nmul_A0010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0010 
Symbol 
ID3786448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp11535 
End bp12668 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content59% 
IMG OID637810078 
Productbifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II-like protein 
Protein accessionYP_410711 
Protein GI82701145 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCATCA GTTCAATTCA GGAAATCATT GCTGACATCA AGACCGGCCG CATGGTCATT 
CTCGTCGATG AGGAAAACCG CGAAAACGAG GGGGACCTGG TTCTGGCGGC GGATTTCATT
ACACCGGCAG CCATCAACTT CATGGCAACG TACGGACGCG GCCTGATCTG CCTGACGTTG
ATGGAGGAAC GTTGCCGCCA GCTCAACCTG CCTCTGATGG TGGCGGCTAA CCGGTCTCCG
CTCGGCACCA ATTTCACGGT TTCCATCGAA GCCGCGACCG GGGTGACCAC GGGCATTTCC
GCTGCTGACC GCGCACGCAC GGTACAGGTG GCCGTGAACC GGGGCGCAAA ACCGGATGAC
ATCGTCCAGC CGGGGCATAT TTTCCCCCTG ATGGCGCAAA ACGGCGGCGT GCTCGTGCGC
GCCGGCCACA CGGAAGCTGG TTGCGATCTG GCGCATCTGG CGGGACTCAC CCCCGCCTCC
GTGATCTGCG AAATACTGAA TGAAGACGGG AGCATGGCGC GCCTACCCGA TCTGGTCGAG
TTTGCGGCAA AGCATCAACT CAAGATAGGT GCGATAGCCG ACCTCATCCA TTATCGCAGC
CGTACGGAAA GCCTGGTGGA ACGGATTGTA GAACGTCCCC TCCAGACTGC GCACGGCAGA
TTCCAGCTCG TTGCCTACCT GGACAAAACC GTCAATACTG TCCACCTGGC GTTGGTAAAG
GGGGCTATCG CTCCCGACGA TGAAACCCTG GTAAGAGTGC ATGAGCCTCT TTCGGTCATG
GACCTTCTCG ATCTGGAGGA TGATACCCAC TCCTGGAATC TGAACGAGGC CCTGCGCATC
ATCAGCGATG CGGGACGCGG CGTTATCGTA CTGCTTCATT GCGGGGAAAG CGGGTCCGGG
CTGATGGAAA GGGTGTTGCC GGCAAAGCTG CCGCATCGTC CGGTCGCCAA GCCCGACCTG
CGCAATTATG GTATTGGCGC GCAGATCCTC AAGGATCTCA ACGTGAGGAA GATGCGCTTA
CTGGCTGTTC CCCGAAAAAT GCCGAGCATG GCGGGCTTCG GACTTGAAGT CACCGGCTAT
CTGGAGCCGG AAGATAAAAG CCGGCTGGCG GAAAGCCGCC AAGCGGTCCG TTGA
 
Protein sequence
MIISSIQEII ADIKTGRMVI LVDEENRENE GDLVLAADFI TPAAINFMAT YGRGLICLTL 
MEERCRQLNL PLMVAANRSP LGTNFTVSIE AATGVTTGIS AADRARTVQV AVNRGAKPDD
IVQPGHIFPL MAQNGGVLVR AGHTEAGCDL AHLAGLTPAS VICEILNEDG SMARLPDLVE
FAAKHQLKIG AIADLIHYRS RTESLVERIV ERPLQTAHGR FQLVAYLDKT VNTVHLALVK
GAIAPDDETL VRVHEPLSVM DLLDLEDDTH SWNLNEALRI ISDAGRGVIV LLHCGESGSG
LMERVLPAKL PHRPVAKPDL RNYGIGAQIL KDLNVRKMRL LAVPRKMPSM AGFGLEVTGY
LEPEDKSRLA ESRQAVR