Gene Nmul_A1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1017 
Symbol 
ID3786542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1176644 
End bp1177921 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content60% 
IMG OID637811101 
ProductNADH-quinone oxidoreductase, F subunit 
Protein accessionYP_411712 
Protein GI82702146 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACAC CGCTGACGCA TAACATTTCA CCGGGCAGGG AACCTCCGGA TCTTGCGCAG 
TACGAGAAGG CCGGAGGTTA TGGCGCGCTG CGCAAGGCGC TCGGTATGGC GCCGGCGGAA
ATTCAGGCAG CGGTCAAGGA ATCCAACCTG CGCGGCCGTG GCGGAGCGGG GTTTCCAACT
GCACAGAAAT GGAGTTTTGT GCCGATGGGC GATGATGCGC CGCGGCCCAA GTATCTCGTT
TGCAATGCCG ACGAGATGGA GCCGGGTACA TTCAAGGATC GTATGTTGCT CGAGGGAGAC
CCCCATCAGT TGATCGAAGG CATTATCATC AGCGCCTATG CGATCCAGGC CGATGTGGCT
TACGTATTTC TGCGCTGGGC CTACAAACTG GCGGCCCGGC GCGTGGAGCG CGCAATCGTT
GAAGCATACC GCCACGGCTA CCTGGGTAAA AATATCCTGG GCTCGGCTTA CAGTCTGGAG
ATGCACCTGC ACGTCAGTGC GGGACGCTAC ATATGCGGAG AGGAGACCGC GCTGCTCAAT
GCCCTCGAAG GCAAGCGCGC CAACCCACGG GCCAAACCTC CCTATCCCCA GGTGAGTGGC
CTGTGGGGAA AGCCTACCAT TGTCAATAGC GTGGAAACCT TGTGCAACGT TCCGCATATC
GTGAAGCAGG GGGCCGAATG GTTCAGGAGC CTGAGCCGCA GCGACGACGG CGGAACGAAG
CTGTATGGGG CAAGCGGGAG AGTGAAGAAC CCGGGATTAT GGGAATTGCC CATGGGCACT
CCTCTGCGCG AGATTCTGGA AGAGCATGCG GGCGGCATGC GTGACGGCTA TCAGTTTCGC
GGGGTGCTGC CGGGTGGCGC TTCGACTGAT TTTGTTACTG CCGAGCACCT CGACGTGGCG
ATGGATTTCG ATTCGGTGCA GAAGGCCGGC AGCCGCCTCG GCACCGGCAC CATGATCATC
CTTGATGACA AGACCTGCCC CGTTGGCATG CTGCTCAATC TGGAACATTT CTTCGCCCAG
GAATCATGCG GCTGGTGCAC CCCGTGCTGG TCGGGGCTTT CCTGGATCGA ACAGATATTG
CAGGACATGG AGGAGGGTCG CGGCCGGGCT TCCGACCTTG AATTGCTGGA ATCCCATACG
CGCCTCCTGG GTCCCGGACA TACTTTTTGT GCGCTTGCCC CCGGAGCGGC CGAGCCCCTG
CAAAGCGGCC TCAAGTATTT CCGTGATGAT TTCGAGCGCC ATATCCATGA GAAACGCTGT
CCCTGGAGCC CGACGTGA
 
Protein sequence
METPLTHNIS PGREPPDLAQ YEKAGGYGAL RKALGMAPAE IQAAVKESNL RGRGGAGFPT 
AQKWSFVPMG DDAPRPKYLV CNADEMEPGT FKDRMLLEGD PHQLIEGIII SAYAIQADVA
YVFLRWAYKL AARRVERAIV EAYRHGYLGK NILGSAYSLE MHLHVSAGRY ICGEETALLN
ALEGKRANPR AKPPYPQVSG LWGKPTIVNS VETLCNVPHI VKQGAEWFRS LSRSDDGGTK
LYGASGRVKN PGLWELPMGT PLREILEEHA GGMRDGYQFR GVLPGGASTD FVTAEHLDVA
MDFDSVQKAG SRLGTGTMII LDDKTCPVGM LLNLEHFFAQ ESCGWCTPCW SGLSWIEQIL
QDMEEGRGRA SDLELLESHT RLLGPGHTFC ALAPGAAEPL QSGLKYFRDD FERHIHEKRC
PWSPT