Gene Nmul_A1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1094 
Symbol 
ID3784709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1259407 
End bp1260660 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content54% 
IMG OID637811179 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_411789 
Protein GI82702223 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.402289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGA TACGTAATTA CACGATGAAC TTCGGCCCGC AGCATCCGGC CGCCCACGGT 
GTCTTGCGAC TGGTGCTGGA GCTCGACGGG GAAGTCATAC AACGTGCCGA TCCGCACATC
GGTCTGCTGC ATCGTGCAAC GGAAAAGCTT GCCGAGTACA AGACATATAT CCAGTCAGTC
CCTTACATGG ACAGATTGGA CTACGTCTCC ATGATGGCCA ACGAGCACGC GTATGTGATG
GCGATCGAGA AGTTGCTTCA ACTCGAAGTG CCCATACGCG CGCAGTATAT CCGGGTGATG
TTCGACGAAA TCACGCGGAT ACTCAATCAT CTTCTGTGGC TGGGAGCACA CGCGCTGGAT
GTGGGCGCGA TGACGGTATT CCTTTACGCT TTCCGTGACC GGGAAGACTT GATGGATGCC
TACGAATCGG TTTCGGGCGC AAGGATGCAT GCCGCTTACT ATCGGCCTGG CGGCGTTTAT
CGGGACTTGC CGGATTCAAT GCCGCAGTAT AAGGCGTCAA AAATTCACGA CGAAAAAACA
ACCAAGGCGC GCAATGAAAA CCGCCAGGGT TCGCTGCTCG ATTTCATCGA AGACTTCACC
AACCGGTTTC CCACGTACGT CGATGAGTAC GAGACCCTGC TTACCGATAA CCGTATCTGG
AAACAGAGAC TGGTAGGCAT TGGAACGGTT TCGCCCGAGC GGGCCATGGC TCTGGGATTC
ACCGGCCCCA TGCTGCGCGG GTCCGGCGTC GAATGGGATC TGCGGAAGAA GCAGCCCTAT
GAAGTTTATG ATCAGCTCGA TTTCGATATA CCTGTCGGCG TCAATGGGGA TTGCTACGAC
CGCTATCTGG TCCGGATCGA AGAATTCCGG CAGTCCAATC GCATCATCAG GCAATGTGTC
GACTGGCTTC GTAAAAATCC GGGGCCGGTC ATAACGGATA ACCACAAGGT CGCGCCGCCT
TCCCGTGTGA ACATGAAGCA GAACATGGAG GAACTGATCC ATCATTTCAA GCTTTTCACT
GAAGGGTTTC ACGTGCCGCC CGGTGAAACC TATGCGGCAG TCGAGCACCC GAAAGGGGAA
TTCGGCATTT ACCTGATATC GGATGGCGCC AACATGCCTT ACCGCATGAA AATCCGCGCT
CCCGGCTTTG CCCATCTGGC AGCGCTGGAC GAGATGTCGC GCGGCCATAT GATTGCCGAT
GTGGTTGCCA TCATTGGTAC CCAGGATATT GTGTTTGGTG AAATAGACAG ATGA
 
Protein sequence
MAEIRNYTMN FGPQHPAAHG VLRLVLELDG EVIQRADPHI GLLHRATEKL AEYKTYIQSV 
PYMDRLDYVS MMANEHAYVM AIEKLLQLEV PIRAQYIRVM FDEITRILNH LLWLGAHALD
VGAMTVFLYA FRDREDLMDA YESVSGARMH AAYYRPGGVY RDLPDSMPQY KASKIHDEKT
TKARNENRQG SLLDFIEDFT NRFPTYVDEY ETLLTDNRIW KQRLVGIGTV SPERAMALGF
TGPMLRGSGV EWDLRKKQPY EVYDQLDFDI PVGVNGDCYD RYLVRIEEFR QSNRIIRQCV
DWLRKNPGPV ITDNHKVAPP SRVNMKQNME ELIHHFKLFT EGFHVPPGET YAAVEHPKGE
FGIYLISDGA NMPYRMKIRA PGFAHLAALD EMSRGHMIAD VVAIIGTQDI VFGEIDR