Gene Nmul_A1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1104 
Symbol 
ID3784719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1271333 
End bp1272778 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content54% 
IMG OID637811189 
ProductNADH dehydrogenase subunit N 
Protein accessionYP_411799 
Protein GI82702233 
COG category[C] Energy production and conversion 
COG ID[COG1007] NADH:ubiquinone oxidoreductase subunit 2 (chain N) 
TIGRFAM ID[TIGR01770] proton-translocating NADH-quinone oxidoreductase, chain N 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTA TGCTGCCTAA TTTCGCGCCA GCCTATCCGG AAATATTCCT TCTGGTGATG 
GTGTGCGGAG TATTAATGGC TGATCTTGCC TGGGGCGACA AGAAGCCCGG CACTGCCTAT
CTTTTAGCCC AACTGACGTT GTTCGGCTGC ATGCTGATCA CCTTCGGTAC CTTGCAACCC
GACACAGTCC ATACGTTTTC CGGCATGTTT GTCGATGACA GGCTCGCCGA CATTCTCAAG
ATGCTGGTTT ATATCACCGT CTCCATAGTC CTGGTTTATT CCCGTACTTA TATTTCGGAG
CGGGGAATAC TCAGCGGAGA ATTCTTCAGC CTGGCGCTGT TCGCGACCCT CGGAATGATG
GTCATGATTT CCGCCACCCA CTTCATGACA CTCTACCTGG GGTTGGAACT CCTTTCCTTG
TCCCTCTATG CGATGGTTGC GTTGAGGCGT GACTCGGCAG GAGCCACGGA GGCCGCAATA
AAGTTTTTTG TCCTGGGTGC TCTGGCATCC GGCTTTCTGC TGTATGGCAT GTCGATGATT
TATGGGGCCA CTGGCTCGCT TGATATCGCC AGTGTGACAA AAGCGATCGA AGGCGGAATC
ATCAGCCGGG GTGTGCTGGT TGTCGGTCTG GTTTTCATTG TGGCCGGCAT TAGTTTCAAG
CTGAGCGCAG CGCCTTTCCA TATGTGGGCG CCCGATGTTT ATGAAGGAGC GTCTTCAGCA
GTGGTGCTGT TTGTCGGTTC AGCGCCGAAG CTCGCGGCAT TCGGTTTTGT CATGCGCCTG
CTGGTGGAAG GCCTGGGTGC AATGTCCGGC GACTGGCAGG GCATGCTGAT CATACTTGCA
ATCACGTCAA TGGTGATCGG CAATATTGCC GCTATCGCCC AGAGCAACAT CAAGCGCATG
CTGGCCTATT CCACCATCTC GCACATGGGG TTCATGCTGC TTGGCCTTAT CGGCGCGAAT
GAAAACGGCT ACAGTGCCGC AATGTTTTAC GTCGTGGTCT ACGTGCTCAT GACAATGGGT
ACGTTCGGCA TTATCATGCT TCTCTCGCGC GCGGGTTTTG AAGCCGACAA ACTGGATGAT
TACAAAGGGC TCAACCGGCG CAATCCATGG TATGCTTTTA TCATGCTGCT GCTGATGTTC
TCCATGGCTG GCATCCCCCC CACCGTGGGC ATTTATGCCA AGCTGTCAGT GCTTCAGGCC
GTGCTGAACG CAGGGTACAC CTGGCTCGCC GTGCTGGCTG TGCTCCTTTC GCTGATCGGG
GTATTCTATT ACCTGCGTAT CGTCAAGCTC ATGTATTTCG ACGAGCCTGA AACCGATGCG
GTCATCGCAC CAAAAGGTGA CGTAAAGGTA TTGCTGAGCG CCAACGGCCT CGCAATACTC
GCATTCGGCA TCTTCCCCCA GTCGCTTATT GCCCTCTGCA CCTACGCGAT ACAGCAATCT
GCTTGA
 
Protein sequence
MNFMLPNFAP AYPEIFLLVM VCGVLMADLA WGDKKPGTAY LLAQLTLFGC MLITFGTLQP 
DTVHTFSGMF VDDRLADILK MLVYITVSIV LVYSRTYISE RGILSGEFFS LALFATLGMM
VMISATHFMT LYLGLELLSL SLYAMVALRR DSAGATEAAI KFFVLGALAS GFLLYGMSMI
YGATGSLDIA SVTKAIEGGI ISRGVLVVGL VFIVAGISFK LSAAPFHMWA PDVYEGASSA
VVLFVGSAPK LAAFGFVMRL LVEGLGAMSG DWQGMLIILA ITSMVIGNIA AIAQSNIKRM
LAYSTISHMG FMLLGLIGAN ENGYSAAMFY VVVYVLMTMG TFGIIMLLSR AGFEADKLDD
YKGLNRRNPW YAFIMLLLMF SMAGIPPTVG IYAKLSVLQA VLNAGYTWLA VLAVLLSLIG
VFYYLRIVKL MYFDEPETDA VIAPKGDVKV LLSANGLAIL AFGIFPQSLI ALCTYAIQQS
A