Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1975 |
Symbol | |
ID | 3917293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2092097 |
End bp | 2093152 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640444725 |
Product | branched-chain alpha-keto acid dehydrogenase E1 component |
Protein accession | YP_497249 |
Protein GI | 87199992 |
COG category | [C] Energy production and conversion |
COG ID | [COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.230446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTACG AAGAATCTCC GGTCAGCCTC GCCGAGGCCC CGACCCGTCG TCTGAACATG ATCGAAGCGA TCAATGACGC TCTCGACATC ATGATGGAGC GTGATCCCAA CGTCGTGGTC ATGGGCGAGG ACGTCGGTTA TTTCGGCGGC GTGTTCCGTG CGACGGCGGG ACTCCAGAAG AAATACGGCA AGACCCGCGT GTTTGACACG CCGATCAGCG AGTGCGGTAT CATCGGCGTG GCTGTCGGCA TGGGCGCTTA TGGCCTGCGC CCGGTGCCCG AGATCCAGTT CGCCGACTAC ATCTATCCAG GCCTCGATCA GCTCGTTTCG GAGGCTGCGA GGTTGCGTTA CCGTTCGGCC GGCGAATTCA TTGCGCCGAT GACGGTGCGT TCGCCATTTG GCGGCGGCAT CTTCGGCGGG CAGACCCACA GCCAAAGCCC CGAGGCGCTG TTTACCCACG TCGCCGGGCT GAAGACGGTG GTTCCTAGCA CTCCGCACGA TGCGAAGGGT CTCTTGATCG CAGCGATCGA GGATAACGAT CCGGTGATCT TCTTCGAGCC CAAGCGCATC TATAACGGGC CTTTCAACGG CTACTACGAC AAGCCTGTCG AGCCCTGGAG CAAGCATGCG GACAGCGCCG TTCCGGAGGG CTATTATTCG ATACCGCTAG GAAAGGCCCG CGTCGTGCGC CCGGGGCAGG CGTTCACTGT ATTGGCCTAT GGCACCATGG TCCACGTCGC TGCAGCGGTC TGCGCGGAGA AGGGCGTCGA TGCCGAAATC ATCGACCTCA GGACACTTGT CCCGCTGGAT ATCGAGACGG TGGAAAAGTC GGTGGAAAAG ACCGGCAAAT GCCTGATCGT CCATGAAGCC ACGCGCACTT CGGGCTTTGG CGCGGAGTTG TCCGCCCTGG TTCAGGAGCG TTGCTTCTAC CACCTCGAAG CACCGATAGA GCGCGTGACC GGCTTCGACA CACCCTATCC ACACAGCCTC GAATGGGCCT ACTTCCCTGG CCCGGTCCGC ATCGGCGAGG CCGTCGACCG ACTGATGAAG GCCTGA
|
Protein sequence | MTYEESPVSL AEAPTRRLNM IEAINDALDI MMERDPNVVV MGEDVGYFGG VFRATAGLQK KYGKTRVFDT PISECGIIGV AVGMGAYGLR PVPEIQFADY IYPGLDQLVS EAARLRYRSA GEFIAPMTVR SPFGGGIFGG QTHSQSPEAL FTHVAGLKTV VPSTPHDAKG LLIAAIEDND PVIFFEPKRI YNGPFNGYYD KPVEPWSKHA DSAVPEGYYS IPLGKARVVR PGQAFTVLAY GTMVHVAAAV CAEKGVDAEI IDLRTLVPLD IETVEKSVEK TGKCLIVHEA TRTSGFGAEL SALVQERCFY HLEAPIERVT GFDTPYPHSL EWAYFPGPVR IGEAVDRLMK A
|
| |