Gene Nmul_A0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0233 
Symbol 
ID3786315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp247736 
End bp248833 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content53% 
IMG OID637810305 
ProductRieske (2Fe-2S) region 
Protein accessionYP_410933 
Protein GI82701367 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0567601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGATC TGATTGATGT TTGCCAATTG GCTCCAACGC AACTGCCGGT TGACTGGTAT 
CTCGACTCTC AAATCCTTGA ACTGGAAAAA CGCATTCTTT TCGACCAGGG ACCGGGCTAT
GTCGGTCATG AGATCATGGT GCCCAACATT GGTGACTATT ATGTGCCCGA ATGGATGGAT
AACGCGAAAA TGCTGGTGCG TAATAAGGAT GGTATCGAGC TGCTGTCCAA CGTCTGCCGC
CACAGGCAGT CACTGCTGCT CAAAGGGAGT GGCAACACCA GAAACATCGT TTGCCCGGTG
CATCGCTGGA CGTATGACTT GAAGGGCACG TTGCTCGGGG CGCCCCATTT CCCCGAAAAC
CCTTGCCTCA ATCTTTCCAA TTCACCTTTG CAGAACTGGA ACGGACTCCT GTTTCAAGGC
AGGCGGAATG TTGCCCGCGA TCTGGGCAAC TTGCAAGTCC TGAAGAATTT CGACTTCTCG
GGCCATGTGC TGGAACGCCT CCAGATAGAT GAATACGCCT GCAACTGGAA AACCTTCATA
GAAGTTTATC TGGAGGACTA CCACGTCGAG CCATACCATC CGGGCCTGGG CAACTTTGTT
GACACGGCTG CGCTGGAATG GGAGTTCGGG GAGTGGTACA ACGTGCAGAC TGTGGGGATC
AATAACGCCC TGACCCGTCC GGGAACGCCC GTCTATGCAA AATGGCACGA GCAACTGTTG
CTGCAGACAG CGGGCGAAAT ACCGAGGCAT GGCGCAATCT GGATGCTGTA TTATCCCAAT
GTCATGATGG AATGGTATCC CCATGTATTG GTCGTCAGCA CGGTGCTTCC TACCGGAACA
GAGCGTTGCA CCAATGTGGT GGAGTTCTAT TATCCTGAAG ATATCGCTTT GTTTGAGCGT
GAATTCATTG AAGCAGAACA GGCCGCCTAT CGCGAAACCG CTGCCGAGGA TGATGAAATC
TGCAGGCTGA TGACGGAAGG GCGCCGTGCG CTGTACAAAC AAGGCGTGAG CGAGGTCGGA
CCGTACCAAT CGCCGATGGA AGATGGCATG GTGCACTTCC ACAAGTTTCT GCGGCGAGAA
ATTGAACCGC ACATCTGA
 
Protein sequence
MVDLIDVCQL APTQLPVDWY LDSQILELEK RILFDQGPGY VGHEIMVPNI GDYYVPEWMD 
NAKMLVRNKD GIELLSNVCR HRQSLLLKGS GNTRNIVCPV HRWTYDLKGT LLGAPHFPEN
PCLNLSNSPL QNWNGLLFQG RRNVARDLGN LQVLKNFDFS GHVLERLQID EYACNWKTFI
EVYLEDYHVE PYHPGLGNFV DTAALEWEFG EWYNVQTVGI NNALTRPGTP VYAKWHEQLL
LQTAGEIPRH GAIWMLYYPN VMMEWYPHVL VVSTVLPTGT ERCTNVVEFY YPEDIALFER
EFIEAEQAAY RETAAEDDEI CRLMTEGRRA LYKQGVSEVG PYQSPMEDGM VHFHKFLRRE
IEPHI