Gene Nmul_A1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1151 
Symbol 
ID3784207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1328140 
End bp1329267 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content59% 
IMG OID637811236 
ProductNAD(P)(+) transhydrogenase (AB-specific) 
Protein accessionYP_411846 
Protein GI82702280 
COG category[C] Energy production and conversion 
COG ID[COG3288] NAD/NADP transhydrogenase alpha subunit 
TIGRFAM ID[TIGR00561] NAD(P) transhydrogenase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.325327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATATAG GAATACCGGC AGAGACCCGT GGGGGAGAAA CCCGGGTTGC CGCCACGCCG 
GAGACGGTCA AGAAATTTAC CGCCAAAGGT TTGCATGTCG TTCTGGTGCA GTCGGGCGCG
GGTGCGGGCG CGAGCATAGC GGATGAGGAA TACCAGGCTG CCGGCGCAAG TATCGTGACC
GATCCCGGCG AACTGTATGG GCAATCTCAG ATCGTGCTCA AGGTGCGGGC CCCCGAAGCG
TCTGAACTCG CATTGATGCG CAAGGATGCC GTATTGGTTG GACTGCTTTC TCCACATCAG
GCCGAGGGTA TCGAAGTGCT TGCCGCTCAC GGTATAACCG CTTTTTCGAT GGAGAAACTG
CCGCGTATTT CGCGTGCCCA GAGCATGGAT GTGCTGTCGT CACAGGCCAA CATCGCCGGA
TACAAGGCGG TGATCATGGC AGCCAATATC TACCAGAAAT TTTTCCCCAT GCTGATGACA
GCGGCGGGTA CGGTAAAGGC GGCGAGAGTA CTGGTTCTGG GCGCAGGAGT GGCGGGATTG
CAGGCCATTG CCACCGCCAA ACGGCTGGGG GCGGTAATCG AAGCATTCGA TGTGCGCCCG
GCAGCCAAGG AACAGGTGGA AAGCCTGGGC GCCAAGTTTG TCGAGGTTGC GCTCAGCGAC
GAGGAAAAGG CGCAAGCGGA AACCGCAGGT GGATACGCGC GGGAAATGTC GGAGGATTAC
AAACGCCGCC AGGGCGAACT GGTGCACCAG CGCGCCTCTG CAGCCGACAT CATCATTACG
ACGGCGCTGA TTCCCGGCCG TCCGGCCCCC GTGCTGATCC GGGAAGAAAC GGTGCAGGCG
ATGAAACCGG GTTCCGTCAT TGTCGACCTG GCGGTTGAAG CCGGTGGCAA CTGTCCCTTG
TCTGAATTGA ACAAGGTCGT CGTGAAACAT GGCGTGCATC TCGTCGGCAT TGCCAATCTG
CCCGGACTGG TAGCCGCCGA TTCCAGCGCC CTGTATGCGC GCAACCTGAT GAATTTCGTG
AACCTGATGC TCGATGCAAA GACAGGCGAA CTCAACATAA ATCGTGAAGA CGAAATCATC
GCCGGAACCT TGGTATGCGC CAACGGGGAA GTCATCGGGA AAACCTGA
 
Protein sequence
MHIGIPAETR GGETRVAATP ETVKKFTAKG LHVVLVQSGA GAGASIADEE YQAAGASIVT 
DPGELYGQSQ IVLKVRAPEA SELALMRKDA VLVGLLSPHQ AEGIEVLAAH GITAFSMEKL
PRISRAQSMD VLSSQANIAG YKAVIMAANI YQKFFPMLMT AAGTVKAARV LVLGAGVAGL
QAIATAKRLG AVIEAFDVRP AAKEQVESLG AKFVEVALSD EEKAQAETAG GYAREMSEDY
KRRQGELVHQ RASAADIIIT TALIPGRPAP VLIREETVQA MKPGSVIVDL AVEAGGNCPL
SELNKVVVKH GVHLVGIANL PGLVAADSSA LYARNLMNFV NLMLDAKTGE LNINREDEII
AGTLVCANGE VIGKT