Gene Saro_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1045 
Symbol 
ID3915827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1083306 
End bp1085522 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content67% 
IMG OID640443779 
Productbranched-chain alpha-keto acid dehydrogenase E1 component 
Protein accessionYP_496324 
Protein GI87199067 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit
[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTTG ATGCCGCCGA GCAGGTCCAC CGGCAGTTCC TTGATGCGCT GGACAAGGGA 
ACCGCACGAC GGCGTTCCAA CCTTGGGCTG AAGGATGTGG GGCTGGCGCC GGAGAGGGCG
GCAGCGCTGT TCCGGTCGCA AGCGCTGTCG CGCCAGCTTG ACCGGCTCAG CCGCAAATTG
CAGGCGCGGG GCGAGGGGTT CTATACGATC GGATCGTCGG GCCACGAGGG CAATGCAGTG
CTGGCCGAAG TGCTGCGCAT GGATGACATG GCGTTCCTTC ACTATCGCGA CGCCGCGTTC
CAGATCCACC GCGCCCACCG CGTGCCGGGC GAGAATCCGG CGTGGGACAT GCTGCTGAGC
TTCACGGCCA GCATGGAGGA CCCGATTTCG GGCGGGCGCC ACAAGGTGCT GGGATCCAAG
CGGCTGTTCA TTCCGCCGCA GACGTCGACC ATCGCCAGCC ACTTGCCCAA GGCGGTGGGC
GCGGCCTTTT CCATCGGCAT CGCGCGCAGG ATGGGATTCG ACGACACCGT GCTGTCGAAG
GACGGGGTCG TGCTGTGCAG CTTCGGCGAT GCTAGCGCGA ACCATTCGAC CGCGCTGGGC
GCGATCAACA CTGCGTGCTG GGCGGCGTTT CAGGGCACGC CAATGCCGAT CATCTTCCTG
TGCGAGGACA ACGGCATCGG CATCTCCACA CGCACGCCGC CGGGATGGAT CGAGGCGAAC
TTCTCGGGCA GGGCGGGGCT GAACTACATT CCCTGCGACG GATCGGACCT TGTCGATACC
TGCGCGGCCG CAAGACAGGC ACTGGAGATC GCGAGGCGGC AGCGAAAGCC GGTGTTCCTG
CACATGAAGA CGGTGCGGCT CTACGGCCAC GCGGGCAATG ACGTGCAGCT TGCCTATCGC
AGCAAGGAGG AGATCCGAGC CGAGGAAGAG CGCGATCCGC TGCTGGCGAG CGCGGCCTTG
CTGATCGAGG AAGGCGTCAT GTCGGCGGCG CAGGTGCGCG GCGTCTATGA CGAGATCGAG
GCCACGCTGG AACGGCAGGT GGAGCTTGCC ATCAAGCGCC CCAAGCTGCC CGACGCGGCG
GCGGTGATGG CCAGCATCGT GCCCCCCAGG CGCGAAGGGG CGGCGCGCCC TCAGGCTTCG
GCGCACGAGC GTGCCGCGCT CTTTGCCGAC GATGCCGCCG CGATGGACAA GCCGCAGCAT
ATGGCGAAGC TCATCAGTTG GGCCATGGCG GACCTGCTGT TGCAGTACCC CAACGCGATC
GTCTGCGGCG AGGACGTGGG GCCGAAGGGC GGGGTCTATG CCGCGACGCA AAAGCTGCAC
GCGCGGTTCG GATCGGCGCG GGTGATCAAT ACCCTCCTCG ACGAGCAGGC AATTCTCGGG
CTTGCCATCG GGGCTGCGCA CAACGGGCTG CTGCCGATGC CCGAGATCCA GTTCCTTGCC
TATGTCCACA ACGCCGAGGA CCAGATCCGG GGCGAGGCGG CGACACTCTC GTTCTTTTCG
AACGGGCAAT ACACCAACCC GATGGTCGTG CGGATCGCGG GCCTGCCCTA CCAGAAGGGG
TTTGGCGGGC ACTTCCACAA CGACAACTCG CTGGCCGTCT TCCGCGACAT TCCGGGCGTG
GTGCTGGCGG TGCCGTCGAA CGGGCGCGAT GCCGTGGCGA TGCTGCGCGA ATGCGTGAGG
CTGGCGCACG ATGAAGGGCG CGTGGTCGTG TTCGTGGAGC CGATCGCGCT CTACATGACG
CGCGACCTGC ATGAGCCGGG CGATGGCATG TGGTCGAGCG TCTACCAACC GCCGGGCGAG
GGAGAGATCG CGTTTGGCGA GATCGGCGTT TTCGACTCTG GGCGAGGCGA AGGCACCGAC
CTGGCCGTGG TGACCTATGG CAATGGCTTT TACCTTTCGC TCCAGGCGCA GAAGTTGCTG
TCAGAGCGCG GCGTTAACGT GCGGGTGATC GATCTGCGCT GGCTGGGGCC GGTGAACGAG
GCGGCGGTGC TCGATGCGGT CGCGCCGTGT TCGCGCGTGC TGGTGGTGGA CGAATGCCGG
ATCACCGGGG GGCAGAACGA GGCGCTGATG GCCCTGCTGG CCGAGCGAGC GCCGGGCAAG
GCCATCGCGC GGATGGCGGC GACCGACAGC TTCATCCCGC TCGCGCGCGC GGCAACGCAT
ACGCTGCCGA GCCGGGACGG GATCGTGGTC AAGGTGCTGG AGATGGTGCG TGGCTAA
 
Protein sequence
MSLDAAEQVH RQFLDALDKG TARRRSNLGL KDVGLAPERA AALFRSQALS RQLDRLSRKL 
QARGEGFYTI GSSGHEGNAV LAEVLRMDDM AFLHYRDAAF QIHRAHRVPG ENPAWDMLLS
FTASMEDPIS GGRHKVLGSK RLFIPPQTST IASHLPKAVG AAFSIGIARR MGFDDTVLSK
DGVVLCSFGD ASANHSTALG AINTACWAAF QGTPMPIIFL CEDNGIGIST RTPPGWIEAN
FSGRAGLNYI PCDGSDLVDT CAAARQALEI ARRQRKPVFL HMKTVRLYGH AGNDVQLAYR
SKEEIRAEEE RDPLLASAAL LIEEGVMSAA QVRGVYDEIE ATLERQVELA IKRPKLPDAA
AVMASIVPPR REGAARPQAS AHERAALFAD DAAAMDKPQH MAKLISWAMA DLLLQYPNAI
VCGEDVGPKG GVYAATQKLH ARFGSARVIN TLLDEQAILG LAIGAAHNGL LPMPEIQFLA
YVHNAEDQIR GEAATLSFFS NGQYTNPMVV RIAGLPYQKG FGGHFHNDNS LAVFRDIPGV
VLAVPSNGRD AVAMLRECVR LAHDEGRVVV FVEPIALYMT RDLHEPGDGM WSSVYQPPGE
GEIAFGEIGV FDSGRGEGTD LAVVTYGNGF YLSLQAQKLL SERGVNVRVI DLRWLGPVNE
AAVLDAVAPC SRVLVVDECR ITGGQNEALM ALLAERAPGK AIARMAATDS FIPLARAATH
TLPSRDGIVV KVLEMVRG