Gene Nmul_A1846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1846 
Symbol 
ID3785955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2129543 
End bp2130874 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content55% 
IMG OID637811931 
Productputative ferredoxin 
Protein accessionYP_412533 
Protein GI82702967 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000698817 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACTC GCGAAGGCAG CCTGGAGGCG CCGAAGCGCC TTCCCATCGA CTGGGAAAAT 
CCGGAATATT ATGACGAACC CAAGCTGTTC GCGGAGATGG AGCGTGTCTT CGACATATGT
CACGGTTGCC GCCGCTGTGT GAATCTGTGC ACCGCATTTC CGCGCTTGTT CGATCTCATA
GATGAGGGGT CGACCGGCGA ACTGGACGCG GTGGAGAAAT CGAGATATTG GGAGGTAGTG
GATCGCTGCT ATCTCTGCGA TATGTGCTCC ATGACGAAAT GCCCGTATGT GCCCCCTCAT
CCATGGAATG TCGATTTTCC ACACCTGATG CTGCGCGCCA AGGCAGTCAA GCACGAAAAA
GGCGGTACGA GTTTCCGTGA CAAGCTATTA TCCAGCACCG ACGCGGTGGG CAAGCTTGCC
AGCATTCCCG TCGTGGTCCA GGCAGTCAAT GCGATCAACA AGGCGCCCGC TACCCGCAAA
CTGATGGACA GCATGCTTGG CATTCATGCA GAGCGCAGAT TGCCGGAGTA CGACAGTCAC
AGATTTCGCA AGACCGCCCA GCCGAACGCG GACCACGCGG TGCGCAACGG CGAGCGAACG
CCCGGGAAAG TGGCAATTTA TTCCACCTGT TACATCAACT ATAACGAGCC CGGAATAGGG
CACGATTTAT TGAAGATACT GGACCATAAC GAGATTCCAG TATGCCTGGT GGAGAAAGAA
GCCTGTTGTG GAATGCCGAA GCTGGAGCTG GGTGACTTGC AAGCCGTGAA AGAACTCAAG
GATAAAAATA TCCCTCCCTT GGCGAAACTG GCCAGGGAAG GTTATGCAAT ACTTACAGCG
GTGCCGTCCT GCACGCTCAT GTACAAACAG GAATTGCCAC TCATGTTTCC GGACAACGAG
GATGTGAAGG CGGTAGCCAA AGCCATGCTC GATCCCTTCG AATATTTGAC GATGCGAAAT
CGTGATGGGC TGTTGAAAAC GGATTTCAGG AAATCGCTAG GCAAGGTTTC ATATCATATT
CCTTGCCATT TGCGCGTGCA GAATATCGGG CAGAAGACGA AAGATCTGTT GCAGATGGTG
CCCGACACAA AGGTGACCGT AGTGGAGCGC TGCTCCGGGC ACGACGGCAC CTGGGGCGTA
AAAAGCGAAT ACTTTGCCGA CTCCATGAAG ATCGGCAGGC CCGTGTTCAG GCAGATGGCC
CAGCATGATC CTGACTATAT CAGTTCCGAT TGCGCGATCG CTGGTCGCCA TATCGAGCAA
GGCATGGGCG AAACAAGCGC GCAGAAGGCG CATCCCTTGA CCCTCATCCG CATCGCATAC
GGGCTGCCCT GA
 
Protein sequence
MTTREGSLEA PKRLPIDWEN PEYYDEPKLF AEMERVFDIC HGCRRCVNLC TAFPRLFDLI 
DEGSTGELDA VEKSRYWEVV DRCYLCDMCS MTKCPYVPPH PWNVDFPHLM LRAKAVKHEK
GGTSFRDKLL SSTDAVGKLA SIPVVVQAVN AINKAPATRK LMDSMLGIHA ERRLPEYDSH
RFRKTAQPNA DHAVRNGERT PGKVAIYSTC YINYNEPGIG HDLLKILDHN EIPVCLVEKE
ACCGMPKLEL GDLQAVKELK DKNIPPLAKL AREGYAILTA VPSCTLMYKQ ELPLMFPDNE
DVKAVAKAML DPFEYLTMRN RDGLLKTDFR KSLGKVSYHI PCHLRVQNIG QKTKDLLQMV
PDTKVTVVER CSGHDGTWGV KSEYFADSMK IGRPVFRQMA QHDPDYISSD CAIAGRHIEQ
GMGETSAQKA HPLTLIRIAY GLP