Gene Synpcc7942_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1068 
Symbol 
ID3774000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1079478 
End bp1080773 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content60% 
IMG OID637799492 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_400085 
Protein GI81299877 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0808214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00512837 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCCACG AAGTCTTCAT GCCCGCCCTC AGCTCCACTA TGACCGAGGG CAAGATCGTC 
GAGTGGGTGA AGGCTCCGGG CGATCGTGTC GAGAAAGGTG AAACAGTCCT GATCGTCGAG
TCGGACAAAG CCGACATGGA CGTGGAATCT TTCTATGAAG GCTACCTCGC GACGATTATT
GTGCCGGCAG GCGGCAATGC TCCCGTGGGC GAAGCGATCG CTCTGATTGC AGAAACTGAA
GCCGAAATTG AAGTTGCGAA GCAACAGGCA GCCGGTGCTG GTTCCGCTGC AGCGACCCCT
GCAACCCCTG CTGCTACGGC GGCACCCGAA CCCGTCGCTG TGAGCCCTGA ACCTGTCGCT
GCTCCTACAG CAACCCGCAG CGATCGCTTG GTTGCCTCCC CGCGTGCCAA AAAACTCGCC
AAGAGCTTGG GCGTTGACCT CGCTAGCCTT ACCGGCAGTG GTCCCCACGG TCGAATTGTG
GCAGCCGATG TCGAAGCCGC CGCAGGAGTT ACTGCCAAAC CCGCGATCGC TACCCCTGTG
GCTCCTGCCG TTGTCACCGC ACCGGTTGCT GCTCCCGTTG CTACGGCTCC TGCTGCCCCC
GCGCCAACTC CCGCGATCGC GCCGGGTCAG TTCGTGCCCT ACAGCACCTT CCAGCAGGCC
GTGGTTCGCA ACATGGAAGC GAGCCTCAAC GTGCCGGTCT TCCGCGTCGG TTACACGATC
ACCACCGATG CGATCGATAG TTTGGCCAAG CAGCTCAAGC CCAAAGGCGT GACAATCACG
GTTTTGCTGG CCAAAGCCGT CGCTGCAACC CTCGCCAAGC ATCCCTTGCT CAATGCTCGG
GCGACAGAAA CTGGCGTCCA GTACAACGAA GCCATCAACG TGGCGATCGC GGTGGCTATG
GATGATGGCG GTCTGCTGAC CCCCGTACTG GGTCGTGCTG ATCAAACCGA TCTCTATAGC
TTGGCTCGCA ACTGGAAGGA TCTCGTGGCG CGATCGCGCA CCAAGCAACT CAAACCCGAG
GAATACACCA CCGGCACCTT TACCCTCTCC AATCTAGGGA TGTTTGGTGT CGATCGCTTC
GATGCGATTC TGCCGCCGGG CACCGGTGCG ATTCTGGCGA TTGGGGCTTC CAAACCGACC
CTCGTGGCCA CGGCTGACGG TCTGTTTGGC GTCAAGCGGC AGATGCAAGT TAACCTCACC
TGTGATCACC GCCACATCTA CGGCGCTCAT GCGGCTGCCT TCCTCAAGGA CTTGGCCGAC
CTGATTGAAA ACCGCCCCGA AAGCCTGACC CTCTAA
 
Protein sequence
MIHEVFMPAL SSTMTEGKIV EWVKAPGDRV EKGETVLIVE SDKADMDVES FYEGYLATII 
VPAGGNAPVG EAIALIAETE AEIEVAKQQA AGAGSAAATP ATPAATAAPE PVAVSPEPVA
APTATRSDRL VASPRAKKLA KSLGVDLASL TGSGPHGRIV AADVEAAAGV TAKPAIATPV
APAVVTAPVA APVATAPAAP APTPAIAPGQ FVPYSTFQQA VVRNMEASLN VPVFRVGYTI
TTDAIDSLAK QLKPKGVTIT VLLAKAVAAT LAKHPLLNAR ATETGVQYNE AINVAIAVAM
DDGGLLTPVL GRADQTDLYS LARNWKDLVA RSRTKQLKPE EYTTGTFTLS NLGMFGVDRF
DAILPPGTGA ILAIGASKPT LVATADGLFG VKRQMQVNLT CDHRHIYGAH AAAFLKDLAD
LIENRPESLT L