Gene Saro_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1946 
Symbol 
ID3917261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2062376 
End bp2063659 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content65% 
IMG OID640444693 
Productdihydrolipoamide acetyltransferase, long form 
Protein accessionYP_497220 
Protein GI87199963 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATCG CCATCAAGAT GCCCGCCCTC TCCCCGACGA TGGAGGAGGG AACTCTCGCC 
AAGTGGCTGG TGAAGGTGGG CGATAAGGTC TCTTCCGGCG ACATCATGGC CGAGATCGAG
ACCGACAAGG CGACCATGGA ATTCGAAGCG GTCGATGAAG GCACGATTGT TTCCATTGAC
GTGGCCGAAG GATCTGAAGG CGTGAAGGTC GGGACCGTCA TCGCAACCTT GGCGGGCGAG
GATGAAGATG CCAGTGCTCC CGCTCCTAAG GCCGTCGCTC CCGCTGCGGC GCCTGTGCCG
GTTCCTGCGC CCAAGGCAGA GCCCGCGCCG GCGGCGGTTT CCACCCCCGC TCCTGCAGCG
GCGTCCGCTA GCAAGGGTGA CCGCGTAATC GCCACCCCGC TGGCCAAGCG CATCGCGGCG
GACAAGGGTA TTGATCTCAA GGGTGTGGCA GGCTCTGGTC CTAATGGGCG CATTATTCGC
GCCGACGTCG AGGGTGCGAA GCCCGCCGCT GCCGCGCCAG TTTCTACCGT GGCGCCCGCG
GTCGCGTCGG CAGCCGCCCC TGCTCGTGCC CCGGCGGCCG TGCCAGACTT CGGCATCCCC
TACGAGGCGC AGAAGCTCAA CAATGTGCGC AAGACCATCG CGCGCCGCCT GACCGAGGCG
AAGCAGACGA TCCCGCACAT CTATCTCACC GTCGACATTC GCCTCGACGC GCTGCTCAAG
CTGCGCGGTG ATCTGAACAA GGCGCTCGAG GCACAGGGCG TCAAGTTGTC GGTCAACGAC
CTCATCATCA AGGCGCTGGC CAAGGCGCTG ATGCAGGTGC CCAAGTGCAA CGTCAGCTTT
GCCGGCGACG AACTGCGCAG CTTCAAGCGC GCGGATATTT CGGTGGCCGT TGCCGCGCCG
TCGGGCCTGA TTACGCCGAT CATTGTCGAT GCCGGCTCGA AGTCTGTCTC CGCCATCGCC
ACCGAGATGA AGGCGCTGGC CAACAAGGCT CGTGAGGGCA AGCTGCAGCC GCACGAGTAC
CAGGGCGGGA CCGCATCGCT TTCGAACCTC GGCATGTTCG GCATCAAGAA CTTCGATGCG
GTAATCAACC CGCCGCAGGC GATGATCATG GCTGTCGGCG CGGGCGAACA GCGCCCCTAC
GTCATCGACG GTGCGCTTGG CATCGCCACG GTCATGTCGG CGACGGGCAG CTTCGATCAC
CGCGCGATCG ACGGAGCGGA TGGCGCTGAA CTCATGCAGG CGTTCAAGAA CCTGATCGAG
AACCCGCTCG GCCTGGTCGC CTGA
 
Protein sequence
MPIAIKMPAL SPTMEEGTLA KWLVKVGDKV SSGDIMAEIE TDKATMEFEA VDEGTIVSID 
VAEGSEGVKV GTVIATLAGE DEDASAPAPK AVAPAAAPVP VPAPKAEPAP AAVSTPAPAA
ASASKGDRVI ATPLAKRIAA DKGIDLKGVA GSGPNGRIIR ADVEGAKPAA AAPVSTVAPA
VASAAAPARA PAAVPDFGIP YEAQKLNNVR KTIARRLTEA KQTIPHIYLT VDIRLDALLK
LRGDLNKALE AQGVKLSVND LIIKALAKAL MQVPKCNVSF AGDELRSFKR ADISVAVAAP
SGLITPIIVD AGSKSVSAIA TEMKALANKA REGKLQPHEY QGGTASLSNL GMFGIKNFDA
VINPPQAMIM AVGAGEQRPY VIDGALGIAT VMSATGSFDH RAIDGADGAE LMQAFKNLIE
NPLGLVA