Gene Nmul_A0006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0006 
Symbol 
ID3786444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp7704 
End bp8804 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content61% 
IMG OID637810074 
Productriboflavin biosynthesis protein RibD 
Protein accessionYP_410707 
Protein GI82701141 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCTCGC CAACCGATTA CCGCTTCATG GCCCAGGCGC TGCGGCTTGC AGAAAAAGGG 
CTTTATAGTA CAAGCCCGAA CCCCCGCGTG GGTTGTGTGC TGGTACGCGA TGGACAGGTG
GTGGGTACGG GCTGGCACGA ACGCGCTGGC GAAGCGCATG CGGAAATAAA TGCACTGGCT
GCCGCCGGAC CGGCAGCCCG GGGCGCCCTT GCCTATCTCA CGCTGGAGCC GTGCAGTCAC
TATGGCCGCA CTCCACCCTG CGCCGATGCC CTGGTTCAGG CAGGTGTCGC GAAAGTCATT
ACAGCCATGC AGGACCCGAA TCCCCTCGTG GCCGGTCGCG GCTGCGCCCT TCTGGAAGAG
GCGGGGATAG AAGTGAAAAC CGGCTTGATG GAAGCGGAAG CGAAAGCTTT GAATATCGGA
TTTGTCTCGC GCATGACCCG CGGTCGTCCC TGGGTCAGGG TCAAGATCGC GGCAAGTCTC
GATGGCAAGA CGGCGCTCAA CAACGGGTCC AGTCAATGGA TCACGAGCGC GGCGGCACGC
CGGGACGGGC ACCGCTGGCG CGCCCGTTCC TGCGCGGTAA TGACCGGCAT TGGTACGGTG
TTGGCCGATG ACCCGCAGCT CACGGTGCGC CATATCCATA CCTCCAGGCA ACCTATGCCG
GTAGTGGTGG ACAGGGGACT GGATATACCG CTGGATGCAG GATTGCTGCG AGGCGCAGGC
GAACTGGTTT TTACCGCTGC TGCCAGTGAA GGCAAAATTG TCGCGTTACG GGACGTGGGG
GCGCACGTCA TCCTGTTGCC GGATAGCGCT GGCAACGTGG ATCTGGCTGC TATGATGCGA
CGGCTTGCGG ATCTCGAAAT AAACGAAGTG CTGGTGGAGG CGGGATCCGG CTTGAATGGA
GGGCTTATCC AGGCGGATTT GGTGGATGAG TTCGTCATTT ATCTTGCCCC CTGTCTGATC
GGGAATGCGG CGCGGGACAT GCTCAAATTA CCGGAACTCT CGAATCTGGA AGACAAGCGA
GCCCTCAAGA TCCACAATGT ACGCGCCGTG GGGCAGGATA TTCGCATTAT TGCCCGTGGC
CCTGACGGGA TGGCTACATA A
 
Protein sequence
MFSPTDYRFM AQALRLAEKG LYSTSPNPRV GCVLVRDGQV VGTGWHERAG EAHAEINALA 
AAGPAARGAL AYLTLEPCSH YGRTPPCADA LVQAGVAKVI TAMQDPNPLV AGRGCALLEE
AGIEVKTGLM EAEAKALNIG FVSRMTRGRP WVRVKIAASL DGKTALNNGS SQWITSAAAR
RDGHRWRARS CAVMTGIGTV LADDPQLTVR HIHTSRQPMP VVVDRGLDIP LDAGLLRGAG
ELVFTAAASE GKIVALRDVG AHVILLPDSA GNVDLAAMMR RLADLEINEV LVEAGSGLNG
GLIQADLVDE FVIYLAPCLI GNAARDMLKL PELSNLEDKR ALKIHNVRAV GQDIRIIARG
PDGMAT