Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1974 |
Symbol | |
ID | 3917292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2090753 |
End bp | 2092093 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444724 |
Product | branched-chain alpha-keto acid dehydrogenase subunit E2 |
Protein accession | YP_497248 |
Protein GI | 87199991 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.015684 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAACCT ACACATTCCG CCTGCCCGAT ATTGGCGAGG GTATCGCCGA GGCAGAAATC GTCGCCTGGC ATGTCAAGGT CGGCGACACT GTCGAGGAAG ACGGTCGCCT GGCTGACATG ATGACCGACA AGGCCACGGT CGAGATGGAA AGCCCGGTCG CGGGCAAGGT CGTCTCGGTT GCGGGGGAAG TTGGCGATGT CGTGGCGATC GGCTCGGCGC TGGTTGTGAT CGAGACCGAG GGGGAGGACG AGGCACCGGC GCCTGCTGCG GCGCCCGCGC CCAAGGCGGC GATCGTCGAA GAGCGCATCG AGGTCGAAAC GCCCGAGCCA CCGCAACCGC CATCACCGCC CCAGCCGCTG TTCGTTTCGC GCGAAGTCGA GGCACCGCCC GCAGTGCCGG CTACAGGTTC TGGCGTGGCG CCTGGCCCGC GTGCCTCGAC CGCGCCTGAC ACGATCGGTG GGGCGGGGGC AAAGGTCCTC GCCAGTCCGG CCGTGCGGCA GCGTGCCCGC GATCTTGGCA TAGACCTGTC GGAAGTCCGT CCGTCTGAGG AAGGCCGCAT TCGCCACGCC GACCTCGATC AGTTCCTCTC CTACAATGCC TCTGGCGGTT ACCGTGCAGC CGGTGCCGAG CGCGGCGACG AAGTGATCAG GGTCATCGGT ATGCGGCGAC GCATCGCCGA GAACATGGCC GCGTCGAAAC GACACATCCC GCACTTCTCC TACGTCGAGG AATGCGATGT GACCGCGCTT GAAATCATGC GGGAACAACT CAACGCGGGC CGGGGCGACA AGCCCAAGCT GACGATGTTG CCCCTGCTTA TCACCGCGAT CTGCCGTGCT CTGCCGCAGT ACCCGATGAT CAACGCCCGC TATGACGACG AGGCCGGCGT GGTTACCCGC TATGGTGCGG TGCATCTCGG CATGGCGGCG CAAACGCCTG CGGGCCTTAT GGTGCCTGTC ATCCGCAACG CCCAGACCCT GAATCTCTGG CAACTCGCCC GCGAGATTGT CCGCCTGGCA GAGGCCGCGC GCAGCGGCAG CGCAAAATCG GACGAGCTTT CCGGTTCGAC GTTGACGGTG ACGTCCCTTG GCCCACTTGG CGGCGTGGCG ACCACGCCGG TCATCAACCG CCCGGAAGTT GCCATCATCG GGCCCAATCG CATCGTCGAG CGGCCGATGT TCGTGTCCGA TGGCATGGGG GGCGAGCGGA TCGAAAAGCG CAAGCTGATG AACATCTCGA TCAGTTGCGA CCATCGCGTG GTCGATGGCC ACGATGCGGC AAGTTTCATC CAGGCGGTGA AGAAGCTGAT CGAAACGCCG GTGCTGCTGC TGGCGGACTG A
|
Protein sequence | MGTYTFRLPD IGEGIAEAEI VAWHVKVGDT VEEDGRLADM MTDKATVEME SPVAGKVVSV AGEVGDVVAI GSALVVIETE GEDEAPAPAA APAPKAAIVE ERIEVETPEP PQPPSPPQPL FVSREVEAPP AVPATGSGVA PGPRASTAPD TIGGAGAKVL ASPAVRQRAR DLGIDLSEVR PSEEGRIRHA DLDQFLSYNA SGGYRAAGAE RGDEVIRVIG MRRRIAENMA ASKRHIPHFS YVEECDVTAL EIMREQLNAG RGDKPKLTML PLLITAICRA LPQYPMINAR YDDEAGVVTR YGAVHLGMAA QTPAGLMVPV IRNAQTLNLW QLAREIVRLA EAARSGSAKS DELSGSTLTV TSLGPLGGVA TTPVINRPEV AIIGPNRIVE RPMFVSDGMG GERIEKRKLM NISISCDHRV VDGHDAASFI QAVKKLIETP VLLLAD
|
| |