Gene Nmul_A1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1998 
Symbol 
ID3784489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2295049 
End bp2296404 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content55% 
IMG OID637812087 
Productnitrite reductase (NO-forming) 
Protein accessionYP_412685 
Protein GI82703119 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID[TIGR02376] nitrite reductase, copper-containing 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAGGA CCTTTTTACT GCTTACAGCC GTTTGCGCAA CGTTCGGTTA TTTATCCCCG 
GTCGCTGCCG AACAAGGCAG CAACAGCGGT GTCACCTACC GACCCGACGT TACCTTTACC
CTTCGTACGG ATATCGGCGA GGGTAAACTG GTATTTGTCG GTGAATCAGG TACGATCGCG
GGCAAAATCA ACCCTGACCT GGAGGTAAAC CCGGGCGCAG TGGTTCAGAT CAATCTGGTC
AACGGCGATG GTGCCATGCA TGACATCGCC ATTCCTGAGC TGGGCGCCAA ATCGGATAAT
ATCACAGCGA AAGGCGCTGC CACTTCGATG GTATTTCGTG CAGTCCGTAC CGGCACCTTT
GAGTACCTCT GCACACTGCC CGGGCACAAG GCCGCCGGCA TGTTCGGCCG TCTCATTGTC
GGCAAGCCGC CGGAGGAAGC GAAAGGATAT GTAACCAACG TGGCCCAGGA CCCGCGCGCC
GTGGGAGAGC CGGTCGGAAC GCGGGCTTCA AGGCATTTGA CACTCAATCT CGAAGCAACC
GAAATCGAAG GGCAACTTTC GGACAAGAAG CTCTATAAAT ACTGGACCTT CAATAACAGG
GTGCCGGGAC CCCTCCTGCG CGTGAAGGTA GGGGATACCA TCACTATCAA TCTGCATAAT
AGCGCAAGCA GTACCAACAT CCATTCGATA GATTTCCATG CCGTAACAGG ACCGGGTGGA
GGTGCCGCGG TCACGCAGGC GGCACCGGGC GAGACAAAGA GCTTCACCTT CAAGTCGCTT
CATCCTGGAC TGTTCGTTTA CCATTGTGCA ACTCCCATGG TTGCGCATCA TATTGCGAAC
GGCATGTATG GCATGGTTCT GGTCGAGCCC GAGGGAGGTC TGCCCATGGC TGACAGGGAA
TTCTATGTAA TGCAGGGTGA GCTCTACACC ACCCACTCCC ACAATGTGCG AGGTCTGCAG
GAATTTTCGC TGGAAAATCT GCTCGCGGAG AATCCCCAGC ACCTAGTGTT CAATGGCACC
GTGGATGCGC TTACGAAAAT GTACACCATG GAAGCGAACA CGGGCGATAA CGTGCGCATC
TATTTTGGTG TGGGCGGTCC CAACCTCACC TCCAGCTTCC ACATCATCGG GGAAGTCTTC
GATAAGGTGT ACGACCAGGC TTCATTGACC AGTCCGCCAT TGACAGATGT GCAAACGACG
CTCGTTCCGC CAGGCGGTGC CACCATTGTG GAATTCAAGG TCGATTATCC CGGCCGCTAT
ATTCTGGTTG ATCACGCGCT GTCGCGTATG GAGAAAGGAT TAGCAGGTTA TCTCACGGTG
CGTGGGAAGG CAAACGCGGA AATATTCAAG CCGTAA
 
Protein sequence
MGRTFLLLTA VCATFGYLSP VAAEQGSNSG VTYRPDVTFT LRTDIGEGKL VFVGESGTIA 
GKINPDLEVN PGAVVQINLV NGDGAMHDIA IPELGAKSDN ITAKGAATSM VFRAVRTGTF
EYLCTLPGHK AAGMFGRLIV GKPPEEAKGY VTNVAQDPRA VGEPVGTRAS RHLTLNLEAT
EIEGQLSDKK LYKYWTFNNR VPGPLLRVKV GDTITINLHN SASSTNIHSI DFHAVTGPGG
GAAVTQAAPG ETKSFTFKSL HPGLFVYHCA TPMVAHHIAN GMYGMVLVEP EGGLPMADRE
FYVMQGELYT THSHNVRGLQ EFSLENLLAE NPQHLVFNGT VDALTKMYTM EANTGDNVRI
YFGVGGPNLT SSFHIIGEVF DKVYDQASLT SPPLTDVQTT LVPPGGATIV EFKVDYPGRY
ILVDHALSRM EKGLAGYLTV RGKANAEIFK P