Gene Sala_1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1235 
Symbol 
ID4080306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1283738 
End bp1285048 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content67% 
IMG OID638009595 
Productdihydrolipoamide acetyltransferase, long form 
Protein accessionYP_616283 
Protein GI103486722 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.97413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATCG AACTCAAGAT GCCCGCTCTC TCGCCGACGA TGGAGGAGGG CACGCTCGCC 
AAATGGCTTG TGAAGGAGGG CGATGAGGTC AAGTCGGGCG ACCTGCTCGC CGAGATCGAA
ACCGACAAGG CGACAATGGA GTTCGAAGCG GTCGATGAGG GTGTGATCAG CCAGATACTC
GTCGCCGAGG GGACCGACGG CGTGAAGGTC GGCACGGTGA TCGCGGTGAT CGCGGGCGAG
GGGGAGGATG CGGGCGAAGC GAAGGCGACG CCTGCTGCTG CGCCTGCGCC GGTTCCGGCA
AAAGACGTCG CCCCAGCGGA AGCTGGGGCC GCTACCGTCA GTGCACCTCC GCCAGCGGTC
CTAGCTTCCG CTGGGACGAC GAATGTGGGC GACCGCATCA AAGCCAGCCC GCTCGCCAGG
CGCCTCGCTG CCGAGCAAGG CATCGACCTC AAAAAGCTGA CCGGCACCGG CCCCGGCGGC
CGCATCGTCA AGGCCGACCT TGAAGGCGCG CCCACAGGCG CCGCTGCATC CACTGCCGTC
GCCCCCGCGC AGGCGGGGGC CGCTGTCGGC ACGGCGCCCG CTGCCGCACC GGAACCGGCC
GGCCCGATCC CCGATTTCGG CATCCCGCAT GAGGATGAAA AGCTGTCGGG GATGCGCAAG
ACGATCGCGC GCCGCCTGAG CCAGTCGATG CAGGACGCGC CGCACATCTA CCTCACCGTC
GACATCCGCC TCGACGCGCT GCTCAAGCTC CGCGGCGAGC TTAACGCGAG CCTGGAGAGC
CGCGGGGTCA AGCTGAGCGT CAACGACATG CTGATCAAGG CGCTCGCGGT CGCGCTCGAG
CGCGTCCCGC AGTGCAACGT CAGCTTTGGC GGCGACGTGA TGCGCTTTTA CAAGCGCGCC
GACATTTCGG TCGCGGTCAG CATCCCCGGC GGCCTCATCA CCCCGATCAT CACCGATGCG
GGGGCCAAGT CGCTGTCGAA AATCTCGACC GAAATGGCCG AGCTCGCGGG CCGCGCCAAG
GAAGGCAAGC TGCAACCGCA CGAATATCAG GGCGGCACCG CCAGCATCTC GAACATGGGC
ATGATGGGGA TCAAGCAGTT CACCGCGGTG ATCAACCCGC CGCAGGCGAT GATCATGGCG
ATCGGCGCGG GCGAAAAGCG GCCCTATGTC GTCGACGATG CGCTGGCGAT CGCGACGGTC
ATGTCGGCGA CCGGCAGCTT CGACCACCGC GCGATCGACG GGGCGGACGG GGCGCTCTTG
ATGAAGACGT TCAAGGAACT GGTGGAAAGC CCGCTGGGGC TGGTGGCGTA A
 
Protein sequence
MPIELKMPAL SPTMEEGTLA KWLVKEGDEV KSGDLLAEIE TDKATMEFEA VDEGVISQIL 
VAEGTDGVKV GTVIAVIAGE GEDAGEAKAT PAAAPAPVPA KDVAPAEAGA ATVSAPPPAV
LASAGTTNVG DRIKASPLAR RLAAEQGIDL KKLTGTGPGG RIVKADLEGA PTGAAASTAV
APAQAGAAVG TAPAAAPEPA GPIPDFGIPH EDEKLSGMRK TIARRLSQSM QDAPHIYLTV
DIRLDALLKL RGELNASLES RGVKLSVNDM LIKALAVALE RVPQCNVSFG GDVMRFYKRA
DISVAVSIPG GLITPIITDA GAKSLSKIST EMAELAGRAK EGKLQPHEYQ GGTASISNMG
MMGIKQFTAV INPPQAMIMA IGAGEKRPYV VDDALAIATV MSATGSFDHR AIDGADGALL
MKTFKELVES PLGLVA