Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0359 |
Symbol | |
ID | 3784551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 391164 |
End bp | 392516 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637810435 |
Product | branched-chain alpha-keto acid dehydrogenase subunit E2 |
Protein accession | YP_411059 |
Protein GI | 82701493 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.582345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGAGA CTAAACAGGT ATTGATTCCC GATATTGGCG ACTTTAAAGA CGTTCCCATA ATAGAAGTGC TGGTGAAAGC CGGCGATTCG ATCAAGGCGG AAGACTCCCT GATTGTGCTT GAATCGGACA AGGCCACCAT AGAAGTGCCT TCTCCTTTTG CTGGCATAAT CAGAGAGTTA TCCGTAAAGG TGGGCGATAA GGTATCGGAA GGTTCGCCCA TCCTGACACT GGAAGCTTCA GAAGCGGAGC AAGCGCCGCC CGCTGAGCCA CGAGAAGCGG CACCCGCTTC AACGCCGGCT CCCGCTCCAA CCACCGCTTC TCCGGAACAG GCGCCGCGGC CGGCGACGCA ACCTCGTGCG CAATCCCAAT CTTCTGCGCA ACCCCAATCA TCGGGTTCTT CTCCACGCTC TGCATTCGTT CCTTCTCCGA TAGATGAAGC TACGTTTGCG AAGGCACACG CCAGTCCTTC GGTTCGACGC TTTGCGCGCG AACTTGGCGT GAATCTGGGG CTGGTAAAGG GTAGCGGTGC CAAGCAGCGC ATTCTCAAGG AAGATGTGCA GTCTTTTGTC AAGACTGAAC TCTCCAAGCC AAGGGGTAGC GGGACCGAGC TCAATCTGCT GCCCTGGCCT CAACCCGATT TTGCGAAATT CGGTCCCGTG GAATTTAAGC CGCTATCGCG GATCAAAAAA ATATCCGGAG CGAATCTGCA CCGCAACTGG GTCATGATTC CACACGTGAC GCAGTTCGAC GAAGCGGATA TTACCGAGCT GGAAACCCTG CGCAAGGAAA CAAACGAATC TTCAAAAGAA GAAGGGGTGA AAGTTACTCT GCTCGCGTTT CTCCTGCGGG CTTCGATAGC GGCCCTAAAG AAGTTTCCCG AGTTCAATGC CTCGCTGACC AGTGAAGGTG ATGAAATGAA TCTCGTGGTC AAGAATTATT ACCATCTCGG CTTTGCAGCG GATACACCTC ATGGACTGGT GGTCCCCGTG ATTCGGGATG TGGAAAAGAA AGGGGTCATC GCCATTGCCA AGGAAATGTC TGATCTCGCA GCTTCGGCGC GGGCAGGCAA ACTCAAGCCC ACCGATATGC AGGGGGCGAG TTTTACCATT TCCAGCCTCG GGGGCATCGG CGGCACTGCG TTCACGCCCA TTATCAATGC GCCGGAAGTG GCGATTCTCG GTGTCTCGCG CGCAGTGATG AAGCCTGTTT ATCGGGATGG CGAATTTGTC CCGCGCCTGA TGCTGCCATT ATCCCTTTCC TATGATCATA GAGTAATCGA CGGGGCGACA GCAGCGCGCT TTACGACGCA CCTGGTCGAA GTGCTGGCTG ATCTGCGTCG TGTGCTGTTG TAA
|
Protein sequence | MAETKQVLIP DIGDFKDVPI IEVLVKAGDS IKAEDSLIVL ESDKATIEVP SPFAGIIREL SVKVGDKVSE GSPILTLEAS EAEQAPPAEP REAAPASTPA PAPTTASPEQ APRPATQPRA QSQSSAQPQS SGSSPRSAFV PSPIDEATFA KAHASPSVRR FARELGVNLG LVKGSGAKQR ILKEDVQSFV KTELSKPRGS GTELNLLPWP QPDFAKFGPV EFKPLSRIKK ISGANLHRNW VMIPHVTQFD EADITELETL RKETNESSKE EGVKVTLLAF LLRASIAALK KFPEFNASLT SEGDEMNLVV KNYYHLGFAA DTPHGLVVPV IRDVEKKGVI AIAKEMSDLA ASARAGKLKP TDMQGASFTI SSLGGIGGTA FTPIINAPEV AILGVSRAVM KPVYRDGEFV PRLMLPLSLS YDHRVIDGAT AARFTTHLVE VLADLRRVLL
|
| |