Gene Nmul_A1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1103 
Symbol 
ID3784718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1269813 
End bp1271300 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content54% 
IMG OID637811188 
ProductNADH dehydrogenase subunit M 
Protein accessionYP_411798 
Protein GI82702232 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGTTTG GTTTCCCCCT GTTAAGTCTG GTTATCTGGC TGCCTATCCT CGCCGGTGTT 
GCTGTACTCG CTACCGGTGG AGATCGTAAT GCTCCCCTTG CGCGCATGAT CGCCCTCGTC
GGATCTATTG CGGGTTTTCT GGTGGCGATT CCGCTTTATA CCAGCTTCGA TCCGTCGACG
AGCACTATGC AATTTGTCGA GAGCCATGTG TGGATCGAAC GCTTCAACGT CCACTATCAC
CTCGGGGTGG ATGGAATCGC CATGCCCCTG ATACTGCTGA ATGCTTTTAC CACCCCTCTG
GTAGTGATCG CGGGATGGGA AGTGATTACC CGGCGCGTAT CGCAGTATAT GGGGGCCTTT
CTCATCATGT CCGGCATCGT CAACGGTGTT TTTTCGTCGC TGGATGCAAT TCTCTTCTAT
GTCTTCTGGG AAGCTTCCCT CATTCCGATG TTTCTTATCA TTGGCGTGTG GGGGGGACCC
AACCGGGTTT ATGCGGCAAT CAAGTTTTTC CTTTATACGC TGCTCGGTTC ACTGCTGATG
CTGGTGGCAT TCATCTATCT TTACCAGGTT TCCGAGGGTA GCTTCTCGAT ACTTGAATAT
CATAAACTGC CGTTGTCGAT GGCATCGCAG ATCCTGATAT TCATCGCCTT CCTGCTGGCT
TTTGCTGTCA AAGTCCCCAT GTGGCCCGTC CATACGTGGC TACCCGACGC GCACGTGGAA
GCGCCGACCG GAGGTTCGGT GGTGCTTGCC GCTATCCTGC TGAAAATGGG AGGCTACGGG
TTCCTGCGGT TTTCGCTGCC GATCCTGCCG GATGCGAGTC ACCAGCTCGC GGGCATGATG
ATCGCATTGT CGCTGATCGC GGTCGTCTAT ATCGGCCTGG TTGCCCTGGT GCAGGCGGAC
ATGAAAAAGC TGATCGCCTA CTCATCGGTG GCACATATGG GTTTCGTCAC CCTCGGTTTT
TTCCTGTTCA ATAATTACGG CCTCGAAGGC GCCATGGTCC AGATGGTTTC ACATGGTTTT
ATTTCGGCTG CAATGTTTCT TTGTATTGGC GTCATGTATG ACAGGCTGCA TTCCCGCCAG
ATCGTGGATT ATGGGGGAGT GGCGCACCGC ATGCCTGCCT TTGCCGCTTT TTTCATGCTG
TTTGCCATGG CTAACTCCGG GTTGCCCGGC ACCAGCGGTT TCGTCGGCGA GTTCATGGTC
ATCATGGCAT CGATGAAAGT GAATTTCTGG TATGCGTTTC TGGCCGCCAC GACGCTCATC
ACAGGCGCAG CTTATACCCT GTGGATGTAC AAGCGCGTGA TATTCGGCGC CGTTGTACAT
CCCGCAGTGG AGGAAATGAA AGATATCGGC GCGCGCGAGA TTCTTGTATT GACCGTACTC
GCGGTGGCGG TATTGGGGAT GGGACTATAT CCGCTACCCT TGACGGAAGT CATGCATACC
ACAGTTGATG ATTTACTTGC GCATGTTGCT CGCAGCAAAT TGCAGTGA
 
Protein sequence
MLFGFPLLSL VIWLPILAGV AVLATGGDRN APLARMIALV GSIAGFLVAI PLYTSFDPST 
STMQFVESHV WIERFNVHYH LGVDGIAMPL ILLNAFTTPL VVIAGWEVIT RRVSQYMGAF
LIMSGIVNGV FSSLDAILFY VFWEASLIPM FLIIGVWGGP NRVYAAIKFF LYTLLGSLLM
LVAFIYLYQV SEGSFSILEY HKLPLSMASQ ILIFIAFLLA FAVKVPMWPV HTWLPDAHVE
APTGGSVVLA AILLKMGGYG FLRFSLPILP DASHQLAGMM IALSLIAVVY IGLVALVQAD
MKKLIAYSSV AHMGFVTLGF FLFNNYGLEG AMVQMVSHGF ISAAMFLCIG VMYDRLHSRQ
IVDYGGVAHR MPAFAAFFML FAMANSGLPG TSGFVGEFMV IMASMKVNFW YAFLAATTLI
TGAAYTLWMY KRVIFGAVVH PAVEEMKDIG AREILVLTVL AVAVLGMGLY PLPLTEVMHT
TVDDLLAHVA RSKLQ