Gene Hhal_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1036 
Symbol 
ID4709772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1116889 
End bp1118259 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content72% 
IMG OID639855507 
Productpyruvate dehydrogenase complex dihydrolipoamide acetyltransferase 
Protein accessionYP_001002614 
Protein GI121997827 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGAAC AGGAGCTCAA GGTACCGGAT ATCGGCGGCT TCGAGGAGGT GGAGGTCATC 
GAGGTCCTGG TTGCGCCCGG CGACCGGATC GAGGCCGAGC AGTCGCTGAT CACACTGGAG
TCCGACAAGG CGAGCATGGA GGTGCCGGCC GAGGTCGGGG GCGAGATCCG AGCGGTGCAT
GTGGCCGTGG GGGATACCGT CTCCGAGGGG AGTGTCGTTG CCACCGTTGA TCCTGTCGCC
GAGCCGGCGG AACCGGCGAC GCAGGCCGAG GCCCCGGCCG CCGCGGGTGG CCCGGCGGAG
GAAACGGCCC CGTCGGCCGA TGGCGGCGCG CCAGCGACCG CGGCCCCCGC CGCCGCGGCG
CAACCGGCTG CCTCCGCGGG CAGTGGCGGG GGCGCTGCAG CCGGAGGTGT CGACGAGTCG
CCGGCGATCG ACCGCGACGG CCATCGCGCC GCCCACGCCA GCCCCTCGGT ACGCCGCTAC
GCCCGCGAGC TCGGGGTCGA TCTCTCCCGC GTGCAGGGCA GCGGGCGTAA GGGGCGCATC
CGCCGTGAGG ACGTGGAGGC CTACGTCAAG CAGGTGATGC AGGGCCAGGA GGCGCCGCCG
GCTGGCGCCG CCGGTGCCCC CGCTGCCGAA GGGGCCGGCA TCCCGCCGAT CCCGGAGCAG
GACTTCAGCC GCTTCGGCGA GGTGGAGCGC GTGCCGCTCA CCCGTATCCA GCGCCTCTCG
GGGCCGCACC TGCACCGGAG CTGGCTGAAT GTCCCGCACG TGACCCAGTT CGACGAGGCC
GATATCACCG AGATGGAGGC GTTCCGCCAA TCTCTCAAGA AGGAGGCCGA GGCGCGGGGG
GTGAAGCTGA CCCCGCTGGC CTTCCTGGTC CGGGCGGCGG CCGCCGCCCT GGCGGAGTAT
CCGCGTTTTA ACGCCAGCCT TTCGGCGGAC GGGCAGGAGC TGATCCTCAA GCACTACTGC
CACATCGGCG TCGCCGTCGA CACCCCGGAG GGGCTGGTGG TGCCGGTGCT GCGTGACGCC
GACCAGAAAG GCGTCCTGCA GATCGCCGAG GACCTCGGCA CCCTCTCGGC CAAGGCCCGG
GACGGCAAGC TCGGTCCGGC GGACATGCAG GGCGGCTGCT TCTCCATCTC GAGCCTCGGT
GGTATCGGCG GTACCGCGTT CACGCCCATT GTCAACGCTC CGGAAGTGGC CATCCTCGGT
GTCTCCCGGT CGCAGACCCG GCCGGTGTGG GATGGGCAGA CCTTCCAGCC GCGGCTGATG
CTCCCGCTGT CGCTCTCCTA CGACCACCGG GTCATCGATG GCGCCATGGC TGCCCGCTTC
ACCAACTACC TGAGCCAGGT CCTCGGCGAC CTGCGCCGGC TGGTGTTGTA G
 
Protein sequence
MAEQELKVPD IGGFEEVEVI EVLVAPGDRI EAEQSLITLE SDKASMEVPA EVGGEIRAVH 
VAVGDTVSEG SVVATVDPVA EPAEPATQAE APAAAGGPAE ETAPSADGGA PATAAPAAAA
QPAASAGSGG GAAAGGVDES PAIDRDGHRA AHASPSVRRY ARELGVDLSR VQGSGRKGRI
RREDVEAYVK QVMQGQEAPP AGAAGAPAAE GAGIPPIPEQ DFSRFGEVER VPLTRIQRLS
GPHLHRSWLN VPHVTQFDEA DITEMEAFRQ SLKKEAEARG VKLTPLAFLV RAAAAALAEY
PRFNASLSAD GQELILKHYC HIGVAVDTPE GLVVPVLRDA DQKGVLQIAE DLGTLSAKAR
DGKLGPADMQ GGCFSISSLG GIGGTAFTPI VNAPEVAILG VSRSQTRPVW DGQTFQPRLM
LPLSLSYDHR VIDGAMAARF TNYLSQVLGD LRRLVL