Gene Nmul_A0358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0358 
SymbolaceE 
ID3784550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp388273 
End bp390942 
Gene Length2670 bp 
Protein Length889 aa 
Translation table11 
GC content56% 
IMG OID637810434 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_411058 
Protein GI82701492 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCTA TACCGGATAT AGACCCGACT GAAACCCAGG AATGGCTGGA GGCGCTGGAA 
TCCGTGTTGA CGCACGAGGG GACGGAGCGG GCGCACTATC TGCTGGAAAG ACTGGTGGAA
AAAGCGCGCC GGTCAGGTGC CTATCTTCCC TATAGCGCCA CCACGGCTTA TATCAATACC
ATTCCTCCGG GAAAGGAAGA GTGGTCACCG GGAAATCATG CGCTGGAGCA CCGCATCCGT
TCCTATGTCC GCTGGAATTC CATGGCCATG GTGTTGCGCG CCAACCGGAA TTCCAATGTC
GGCGGGCATA TTGCCAGCTT TGCCTCGGCC GCTACGCTAT ATGATGTCGG CTACAACCAT
TTCTGGCATG CCAGGTCGGA GAATCACGGT GGCGACCTGA TATTTGCACA GGGGCATTCA
TCGCCCGGTC TTTATGCCTA TGCTTTTCTG CTGGGGGAAC TGACAGAGGA GCAGCTCAAT
AATTTCCGGC GGGAGGTGGG GGGAAAGGGA CTGTCTTCCT ATCCGCATCC ATGGCTGATG
CCTGACTTCT GGCAGTTCCC GACGGTATCA ATGGGACTTG GGCCGCTGAT GGCGATATAT
CATGCGCGCT TCATGAAGTA TCTGGATAGC CGCGGACTGG TCAAGACGGA GGGCCGAAAA
GTGTGGGCCT TCATGGGCGA TGGAGAAATG GATGAACCTG AATCACTGGG GTCCATTTCG
CTTGCGTCGC GCGAGAACCT GGACAACCTG ATTTTTGTCA TCAACTGCAA CCTTCAGCGT
CTCGACGGGC CGGTGCGTGG AAATGGCAAG ATCATTCAGG AACTGGAAGC CGCTTTTCGC
GGTTCGGGCT GGAACGTCAT CAAGGTAATC TGGGGCTCAT ACTGGGACCC ATTGCTTGCC
AAAGACACGA AGGGGTTGCT GCAACAACGC ATGATGGAAT GCGTGGATGG CGAATATCAG
ACTTTCAAGT CGAGAGACGG CGCCTATGTG CGTGAACATT TCTTCGGCAA ATATCCGGAG
CTGCTCGAGA TGGTGGCGAA TATGTCCGAC GATGACATAT GGCGCCTGAA CCGGGGCGGA
CACGATCCGC ATAAGGTCTA TGCCGCGTAT TCAGCGGCGG TGAAGCACAA GGGCCAGCCC
ACCGTGATCC TCGCCAAGAC GATCAAGGGT TACGGGATGG GTGAAGCGGG CGAAGCCCAG
AATATCACTC ATCAGCAGAA AAAGATGGGT ACCACTTCAC TCAAAGCGTT TCGTGACCGT
TTCGGTCTGC CAATCAGCGA TGATGATATC GAGTCGGTAC CCTACCTGAA ATTCGACAAG
GATTCGCCGG AATCCATCTA CATGCACCAG CGACGCGAGG CATTGGGGGG GTTCATCCAT
CGTCGCCAGC GTAAGGCCGA GCCCTTGCAG ATACCCCCGC TGTCCGCCTT CGACACTTTG
CTCAAGGCAA GCGGTGAGGG AAGGGAATCT TCCACCACCA TGGCTTTTGT GCGCATCCTC
AATATCCTCA TAAAGGACAA GAACATCGGC AAGCGGGTCG TGCCGATTGT GGCGGACGAG
TCGCGCACCT TTGGCATGGA AGGCATGTTC CGGCAACTGG GCATCTGGTC TTCCACCGGG
CAGCTTTACA CGCCCGAGGA TGCTGAGCAA CTGATGTACT ATAAGGAAGA CAAGAACGGC
CAGATCCTGC AGGAAGGTAT CAATGAGGCG GGGGCCATGT CATCATGGAT GGCCGCGGCA
ACTGCCTACA GTTCTCATGG CGTGCAGATG ATCCCGTTCT ACATTTACTA TTCGATGTTC
GGCTTCCAGC GCGTGGGGGA TCTTTGCTGG GCGGCGGGAG ACATGCGCTG CCGCGGCTTC
CTGCTGGGGG GCACTGCCGG CCGCACCACA CTGAATGGCG AGGGATTGCA GCATGAGGAC
GGTCACAGCC ATCTGGCGGC ATCGACTGTT CCCAATTGCA TATCCTATGA CCCGACCTTT
GCCTACGAAC TCACGGTCAT TATCCGGGAT GGCCTGCGCC GCATGTGCGA AATGCAGGAG
GATGTCTATT ACTACATCAC AGTGATGAAC GAAAACTATT CGCATCCGGA AATGCCTGCA
GGGGCGGAAG AAGGAATTCT GAAAGGCATG TACCTGTTCC GTGAAGGCAA GCCGGCAGGA
GAAAAGGATT CCGGGTTGCG TGTTCAATTG CTCGGGTCCG GCGCCATTCT GCGCGAGGTG
ATTGCTGCGG CTGAAATACT GGAGGAGGAA TTCGGTGTGA CGGGTGATAT CTGGAGTGTA
ACCAGCTTTA CCCAGTTGAG ACGCGAAGCG CTGGCGACAA CGCGCTGGAA CATGCTGCAC
CCCACAGAAC CGGCAAGGCT GTCGCACGTC GGCACGTGTC TCAAGGACCG GGAAGGTCCA
GTGGTGGCGG CTACGGACTA CATGAAGATT TTTGCCGACC AGATACGTGA ATTTATTCCG
GGGCGATACA AGGTGCTGGG TACAGATGGA TTCGGACGTT CCGATACGCG CGAGCAATTG
CGCCGCTTCT TCGAGGTCGA CCGCCATTAC ATTACGATAG CCGCTCTGAA AGCGCTGGCC
GAAGATGGCA GAATCGAGGC GGAAAGGGTG GCGCAGGCCA TGGGCAAATT CGGCCTCGAT
CCTGATAAAC CCAATCCGAT GACCATATAA
 
Protein sequence
MEPIPDIDPT ETQEWLEALE SVLTHEGTER AHYLLERLVE KARRSGAYLP YSATTAYINT 
IPPGKEEWSP GNHALEHRIR SYVRWNSMAM VLRANRNSNV GGHIASFASA ATLYDVGYNH
FWHARSENHG GDLIFAQGHS SPGLYAYAFL LGELTEEQLN NFRREVGGKG LSSYPHPWLM
PDFWQFPTVS MGLGPLMAIY HARFMKYLDS RGLVKTEGRK VWAFMGDGEM DEPESLGSIS
LASRENLDNL IFVINCNLQR LDGPVRGNGK IIQELEAAFR GSGWNVIKVI WGSYWDPLLA
KDTKGLLQQR MMECVDGEYQ TFKSRDGAYV REHFFGKYPE LLEMVANMSD DDIWRLNRGG
HDPHKVYAAY SAAVKHKGQP TVILAKTIKG YGMGEAGEAQ NITHQQKKMG TTSLKAFRDR
FGLPISDDDI ESVPYLKFDK DSPESIYMHQ RREALGGFIH RRQRKAEPLQ IPPLSAFDTL
LKASGEGRES STTMAFVRIL NILIKDKNIG KRVVPIVADE SRTFGMEGMF RQLGIWSSTG
QLYTPEDAEQ LMYYKEDKNG QILQEGINEA GAMSSWMAAA TAYSSHGVQM IPFYIYYSMF
GFQRVGDLCW AAGDMRCRGF LLGGTAGRTT LNGEGLQHED GHSHLAASTV PNCISYDPTF
AYELTVIIRD GLRRMCEMQE DVYYYITVMN ENYSHPEMPA GAEEGILKGM YLFREGKPAG
EKDSGLRVQL LGSGAILREV IAAAEILEEE FGVTGDIWSV TSFTQLRREA LATTRWNMLH
PTEPARLSHV GTCLKDREGP VVAATDYMKI FADQIREFIP GRYKVLGTDG FGRSDTREQL
RRFFEVDRHY ITIAALKALA EDGRIEAERV AQAMGKFGLD PDKPNPMTI