Gene Hhal_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1037 
SymbolaceE 
ID4709777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1118262 
End bp1120937 
Gene Length2676 bp 
Protein Length891 aa 
Translation table11 
GC content67% 
IMG OID639855508 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_001002615 
Protein GI121997828 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATCGA TACCCGAGTT GCGCGACGAT CCGGATCCGG AGGAGATCCA GGAGTGGCTC 
GACGCCCTGG ATGCGGTGAT CGAACACGAG GGCCCCGAGC GGGCACAGCA GATCCTGGAG
CACCTGGTCA GCAAGGGCCG CCGACGCTTG GGGCACCTAC CATTCAAGGC CACGACCGGC
TACATCAACA CCATCCCCCG CCACCTGGAG GTCCGTCCGC CGGAGTACAC CGATCACCAC
CTGGAGTGGC GCATCCGGGC GCTGGTGCGC TGGAACGCCA TCGCCATGGT GGTCGCCGCT
AACCGTGAGC ACGACGGCAT CGGTGGGCAC ATCGCCAGCT ACGCCTCGGC CTGCACGCTC
TACGAGGTCG GCTTCAACCA CTTCTGGCGG GCCCCCAACG AGGAGCAGGA CGGCGATCTG
GTCTTCTTCC AGGGCCACTC GGCACCGGGC ATCTACGCCC GCGCCTACCT CGAAGGCCGG
CTCAGCGCCG AGCAACTGGC GGGCTTCCGC CAGGATGTCG ACGGCGACGG TGTCAGCTCC
TACCCGCACC CGTGGCTGAT GCCCGACTTC TGGCAGTTCC CCACCGTCTC CATGGGCCTC
GGCCCCATCC AGGCGGTGCA GCAGGCGCGC ATGATGAAGT ACCTCCACCA CCGCGGGATT
GAGGACACCA GCGGTCGGAA GGTCTGGTGC TTCATGGGTG ATGGCGAGAT GGACGAGCCG
GAGTCCATGG GCTCCATCGG CCTCGCCGCC CGCGAGCAGC TCGACAACCT GATCTTCGTG
GTCAACTGCA ACCTGCAGCG TCTCGACGGT CCGGTACGTG GCAACGGCAA GATCATCCAG
GAACTCGAGG GCGAGTTCCG CGGCGCCGGC TGGAACGTCA TCAAGGTGAT CTGGGGCTCG
CGCTGGGACC CACTGCTGGA GCTCGACCAC GAGGGGCTCC TGCAGCAGCG CATGGAGGAG
GCCGTCGACG GCGAGTACCA GGCCTTCAAG GCGCGCGGCG GCGATTACAC GCGCAAGCAC
TTCTTCGCCA AGGATCCAGA GCTGGAGAAG ATGGTCGCCG GCATGTCGGA CTACGACATC
TACCGGCTCA ACCGCGGTGG CCACGACCCG CACAAGGTCT ACGCCGCCTA CCACGCCGCG
GTGAACCACA CCGGGCAGCC CACCGTGATC CTGGCCAAGA CGGTCAAGGG CTACGGCATG
GGCGAGGCCG GCGAGGGGCA GAACATCACC CACCAGCAGA AGAAGATGGG CGAGAACGCC
CTGCGCCGGT TCCGGGATCA CTACGAGATC CCCATCCCCG ACGAGCAGCT CAAGGAGACG
CCCTTCTACA AGCCCGACGA CGACGCCCCG GAGATGCGCT ATCTCCACGA GCGTCGCCAG
GCCCTGGGCG GCTATATGCC GGTGCGCTAC GAACGGGCCC CGGCGCTCGA GGTGCCGGAG
CTCTCCGCCT TCGACGCCCT GCTCAAAGAC AGCGGCGAGC GGGAACTCTC CACCACCATG
GCCTTCGTGC GCGCCCTGAC GGTACTCACC CGCGACAAAC AGGTCGGCCA GCGGGTGGTG
CCCATCATCC CCGACGAGGC GCGCACCTTC GGCATGGAGG GGCTGTTCCG TCAGCTCGGC
ATCTACTCCA ACGTCGGCCA GCTCTACGAG CCCGAAGACG CCGACCAGCT GATGTCCTAC
CGCGAGGCGC AGACCGGGCA GATCCTCGAG GAAGGCCTCG ACGAGGCCGG GGCCATGTCC
TCGTGGATGG CCGCGGCCAC GTCCTACGCC AACCACGGCG TCAACATGAT CCCCTTCTAC
ATCTTCTACT CCATGTTCGG CTTCCAGCGC GTGGGGGATC TGTGCTGGGC GGCTGGGGAT
ATCCAGGCCC GCGGCTTCCT CATCGGCGGC ACCGCTGGCA GGACCACCCT CAACGGCGAG
GGGCTGCAGC ACCAGGACGG CCACAGCCAC GTCCTCGCCT CGACCATCCC CAACTGCGTC
TCCTACGACC CGGCCTTCGA CTACGAGCTG GCGGTGATCG TCCAAGACGG GCTGCGGCGG
ATGTACGCCG AGCAGGAGAA CTGCTTCTAC TACCTGACCG TCTACAACGA GAACCACACC
CACCCGGCGA TGCCGGAGGG GGCCGAGGAG GGGATCCGCC GCGGCATGTA CCTCTTCCGG
GCCGGGCCCG AGAAGAAGGG GCCGCGGGTG CAGCTGATGG GCTCGGGCAG CATCTTCCGC
GAGGTGTTGG CGGCGGCGGA TCTGCTCGCC AACGACTTCG GCGTCCACGC CGACATCTGG
AGCTGCCCCT CGTTCACCGA GTTGGCCCGC GACGGGATGG TCTGCGCCCG GGCCAATCGG
CTCCACCCGG AGGCGGAGCG CCGCCAGTCC TACCTGCAGG CGTGCCTGGA GAGGTACAGC
GGTCCGGCGG TCGCCGCCAC GGACTACATG CGCGCCTACC CCGATCAGAT CCGCCCCTAC
ATCGGGCGCA AGTTCTGGTC CCTGGGGACC GACGGTTTTG GCCGCTCGGA TACCCGGGAG
AAGCTCCGGC GCTTCTTTGA GGTGGACCGC CACCACATCG CCGCAGCGGC CCTCTACGCC
CTCGCCGATG AGGGCACCAT CGAGGCCGCG AAGGTCAGCG AGGCCATCGG CAAGTACCAG
CTGGCGGCGG ATGCCCCGAA CCCGTGGGAG GTGTGA
 
Protein sequence
MQSIPELRDD PDPEEIQEWL DALDAVIEHE GPERAQQILE HLVSKGRRRL GHLPFKATTG 
YINTIPRHLE VRPPEYTDHH LEWRIRALVR WNAIAMVVAA NREHDGIGGH IASYASACTL
YEVGFNHFWR APNEEQDGDL VFFQGHSAPG IYARAYLEGR LSAEQLAGFR QDVDGDGVSS
YPHPWLMPDF WQFPTVSMGL GPIQAVQQAR MMKYLHHRGI EDTSGRKVWC FMGDGEMDEP
ESMGSIGLAA REQLDNLIFV VNCNLQRLDG PVRGNGKIIQ ELEGEFRGAG WNVIKVIWGS
RWDPLLELDH EGLLQQRMEE AVDGEYQAFK ARGGDYTRKH FFAKDPELEK MVAGMSDYDI
YRLNRGGHDP HKVYAAYHAA VNHTGQPTVI LAKTVKGYGM GEAGEGQNIT HQQKKMGENA
LRRFRDHYEI PIPDEQLKET PFYKPDDDAP EMRYLHERRQ ALGGYMPVRY ERAPALEVPE
LSAFDALLKD SGERELSTTM AFVRALTVLT RDKQVGQRVV PIIPDEARTF GMEGLFRQLG
IYSNVGQLYE PEDADQLMSY REAQTGQILE EGLDEAGAMS SWMAAATSYA NHGVNMIPFY
IFYSMFGFQR VGDLCWAAGD IQARGFLIGG TAGRTTLNGE GLQHQDGHSH VLASTIPNCV
SYDPAFDYEL AVIVQDGLRR MYAEQENCFY YLTVYNENHT HPAMPEGAEE GIRRGMYLFR
AGPEKKGPRV QLMGSGSIFR EVLAAADLLA NDFGVHADIW SCPSFTELAR DGMVCARANR
LHPEAERRQS YLQACLERYS GPAVAATDYM RAYPDQIRPY IGRKFWSLGT DGFGRSDTRE
KLRRFFEVDR HHIAAAALYA LADEGTIEAA KVSEAIGKYQ LAADAPNPWE V