Gene Nmul_A0359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0359 
Symbol 
ID3784551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp391164 
End bp392516 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content55% 
IMG OID637810435 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_411059 
Protein GI82701493 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.582345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGAGA CTAAACAGGT ATTGATTCCC GATATTGGCG ACTTTAAAGA CGTTCCCATA 
ATAGAAGTGC TGGTGAAAGC CGGCGATTCG ATCAAGGCGG AAGACTCCCT GATTGTGCTT
GAATCGGACA AGGCCACCAT AGAAGTGCCT TCTCCTTTTG CTGGCATAAT CAGAGAGTTA
TCCGTAAAGG TGGGCGATAA GGTATCGGAA GGTTCGCCCA TCCTGACACT GGAAGCTTCA
GAAGCGGAGC AAGCGCCGCC CGCTGAGCCA CGAGAAGCGG CACCCGCTTC AACGCCGGCT
CCCGCTCCAA CCACCGCTTC TCCGGAACAG GCGCCGCGGC CGGCGACGCA ACCTCGTGCG
CAATCCCAAT CTTCTGCGCA ACCCCAATCA TCGGGTTCTT CTCCACGCTC TGCATTCGTT
CCTTCTCCGA TAGATGAAGC TACGTTTGCG AAGGCACACG CCAGTCCTTC GGTTCGACGC
TTTGCGCGCG AACTTGGCGT GAATCTGGGG CTGGTAAAGG GTAGCGGTGC CAAGCAGCGC
ATTCTCAAGG AAGATGTGCA GTCTTTTGTC AAGACTGAAC TCTCCAAGCC AAGGGGTAGC
GGGACCGAGC TCAATCTGCT GCCCTGGCCT CAACCCGATT TTGCGAAATT CGGTCCCGTG
GAATTTAAGC CGCTATCGCG GATCAAAAAA ATATCCGGAG CGAATCTGCA CCGCAACTGG
GTCATGATTC CACACGTGAC GCAGTTCGAC GAAGCGGATA TTACCGAGCT GGAAACCCTG
CGCAAGGAAA CAAACGAATC TTCAAAAGAA GAAGGGGTGA AAGTTACTCT GCTCGCGTTT
CTCCTGCGGG CTTCGATAGC GGCCCTAAAG AAGTTTCCCG AGTTCAATGC CTCGCTGACC
AGTGAAGGTG ATGAAATGAA TCTCGTGGTC AAGAATTATT ACCATCTCGG CTTTGCAGCG
GATACACCTC ATGGACTGGT GGTCCCCGTG ATTCGGGATG TGGAAAAGAA AGGGGTCATC
GCCATTGCCA AGGAAATGTC TGATCTCGCA GCTTCGGCGC GGGCAGGCAA ACTCAAGCCC
ACCGATATGC AGGGGGCGAG TTTTACCATT TCCAGCCTCG GGGGCATCGG CGGCACTGCG
TTCACGCCCA TTATCAATGC GCCGGAAGTG GCGATTCTCG GTGTCTCGCG CGCAGTGATG
AAGCCTGTTT ATCGGGATGG CGAATTTGTC CCGCGCCTGA TGCTGCCATT ATCCCTTTCC
TATGATCATA GAGTAATCGA CGGGGCGACA GCAGCGCGCT TTACGACGCA CCTGGTCGAA
GTGCTGGCTG ATCTGCGTCG TGTGCTGTTG TAA
 
Protein sequence
MAETKQVLIP DIGDFKDVPI IEVLVKAGDS IKAEDSLIVL ESDKATIEVP SPFAGIIREL 
SVKVGDKVSE GSPILTLEAS EAEQAPPAEP REAAPASTPA PAPTTASPEQ APRPATQPRA
QSQSSAQPQS SGSSPRSAFV PSPIDEATFA KAHASPSVRR FARELGVNLG LVKGSGAKQR
ILKEDVQSFV KTELSKPRGS GTELNLLPWP QPDFAKFGPV EFKPLSRIKK ISGANLHRNW
VMIPHVTQFD EADITELETL RKETNESSKE EGVKVTLLAF LLRASIAALK KFPEFNASLT
SEGDEMNLVV KNYYHLGFAA DTPHGLVVPV IRDVEKKGVI AIAKEMSDLA ASARAGKLKP
TDMQGASFTI SSLGGIGGTA FTPIINAPEV AILGVSRAVM KPVYRDGEFV PRLMLPLSLS
YDHRVIDGAT AARFTTHLVE VLADLRRVLL