Gene Hhal_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1042 
Symbol 
ID4709794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1125586 
End bp1127592 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content69% 
IMG OID639855513 
Producttransketolase 
Protein accessionYP_001002620 
Protein GI121997833 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCGA CCGAACTGAG CCGACGAGAG CTCGCCAACG GCATTCGCGC CCTGGCCATG 
GACGCTGTCC AGAAGGCCAA TTCCGGCCAT CCCGGGGCGC CGATGGGCAT GGCGGATATC
GCCGAGGTGC TGTGGCGGGA CTACCTGCGC CATGCGCCGC AGAGGCCGGG CTGGTTCGAC
CGCGACCGCT TTGTCCTCTC CAATGGCCAC GGTTCGATGT TGCAGTACGC ACTGCTCCAT
CTGAGCGGTT ACGACGTCTC CGTCGACGAC CTCAAGAACT TCCGGCAGCT GCACTCGCGC
ACGCCGGGCC ACCCGGAGTA CGGCTACACC CCCGGCGTCG AGACCACCAC CGGGCCGCTG
GGCCAGGGGT TGGCCAACAG CGTCGGCATG GCGCTGGCCG AGCGGATGCT GGCGGCGCGC
TTCAACCGCC CTGGTCACGA GATCATCGAC CACTACACCT ACGTCTTCAT GGGCGACGGC
TGCCTGATGG AGGGGGTCTC CCACGAGGCC TGCTCGCTGG CCGGCACTCT CGGCCTGGGC
AAGCTGATCG CCTTCTATGA CGACAACAGC ATCTCCATCG ACGGCAACGT CGAGGGCTGG
TACACCGAGG ACGTCCGTCA GCGCTTCGAG GCCTACGGCT GGCAGGTGAT CGGTCCCATC
GACGGCCACG ACGGCGAGGC CGTGCGCCAG GCCATTGAGC AGGCGCGCGC CGACACCAGC
CGTCCCACGC TGATCGACTG CAAGACGGTG ATCGGCTTCG GTGCGCCGAA TCAGTGCGGT
ACCGCCGGCG TGCACGGTGC GCCGCTGGGC GAGGAGGAGA TCGCGGCCTG TCGCAAGGAG
CTCGGCTGGG AGCACGGTCC GTTCGAGATC CCCGACGCGC ACTACCAGGC GTGGGATGGT
CGCGGCCGGG GCGAGGAACT CCTGCAGGCC TGGGGCGAGC GCCTTGAGGC GTACCGCAAG
GAACACCCCG AGCTGGCCGA TGAGCTGGAG CGGCGGATGC GCGGCGAGCT GCCGGGCAAC
TGGCAGGAGC TGGTCGCCGA GGGCCTGGAG AAGTTCGCCG GCGGCAAGAA GGACGCCACT
CGCAAGAGCT CCCAGGCGGT GCTCGGCCAC TTCGGCCCGC ACTTGCCCGA GCTGCTGGGT
GGCTCTGCCG ACCTGACCGG CTCCAACAAC ACCTGGTGGG ACGGCTGCTC GACGGTCAGC
CAAGCGCAGG CCGACGGCAA CTACGTCTTC TACGGTGTCC GCGAGTTCGG GATGACGGCG
ATCATGAACG GCGTCGCCCT GCACGGCGGG TTCATCCCCT ACGGCGGCAC CTTCCTGATC
TTCTCCGACT ACGCGCGCAA CGCTGTGCGC ATGGCCGCGC TGATGAAGCA GCGGGTGGTG
ATGGTCTACA CCCACGACTC CATCGGCCTG GGCGAGGACG GCCCCACCCA CCAGCCGATC
GAACAGCTGG CCAGCCTGCG CCTGATCCCC AACCTGCACG TCTGGCGCCC GTGCGACGCC
CAGGAGACGG CTGCGGCCTG GGCGGCGTCC ATCGCCCGCG AGGACGGCCC GAGCATGCTG
GCGCTGTCGC GCCAGGGCAT GGAGCCGCAG CAGCGCAACG GCGAGCAAGT CCGGTCCATC
TCCCGTGGTG GCTACATCCT CAAGGAGGCC GGTTCCGGCA AGCCCGAACT GGTGATCCTG
GCCACGGGCT CCGAGGTGGA TCTGGCCATG GCGGCCCGCG AGCAGCTCGA GGCCGAGGGG
CGCGCGGTGC GCGTGGTCTC CATGCCGTGT GTCGAGGCGT TCTGTGAGCA GGACGAGGCC
TACCGGCAGC AGGTCCTGCC CGGTGACGTG CCGCGGGTGG CCGTCGAGGC CGGCGCCACC
GGCCTGTGGC AGGGCTGGGT CGGCCAGTGT GGCCTGGTGG TTGGCATCGA CACCTTCGGC
GAATCGGCGC CGGGGCCCGA GCTCTACGAG CACTTCGGCC TGACCGCGGA CAACGTCGCC
ACTGCGGGTC GGCGCGTCCT CGGCTGA
 
Protein sequence
MASTELSRRE LANGIRALAM DAVQKANSGH PGAPMGMADI AEVLWRDYLR HAPQRPGWFD 
RDRFVLSNGH GSMLQYALLH LSGYDVSVDD LKNFRQLHSR TPGHPEYGYT PGVETTTGPL
GQGLANSVGM ALAERMLAAR FNRPGHEIID HYTYVFMGDG CLMEGVSHEA CSLAGTLGLG
KLIAFYDDNS ISIDGNVEGW YTEDVRQRFE AYGWQVIGPI DGHDGEAVRQ AIEQARADTS
RPTLIDCKTV IGFGAPNQCG TAGVHGAPLG EEEIAACRKE LGWEHGPFEI PDAHYQAWDG
RGRGEELLQA WGERLEAYRK EHPELADELE RRMRGELPGN WQELVAEGLE KFAGGKKDAT
RKSSQAVLGH FGPHLPELLG GSADLTGSNN TWWDGCSTVS QAQADGNYVF YGVREFGMTA
IMNGVALHGG FIPYGGTFLI FSDYARNAVR MAALMKQRVV MVYTHDSIGL GEDGPTHQPI
EQLASLRLIP NLHVWRPCDA QETAAAWAAS IAREDGPSML ALSRQGMEPQ QRNGEQVRSI
SRGGYILKEA GSGKPELVIL ATGSEVDLAM AAREQLEAEG RAVRVVSMPC VEAFCEQDEA
YRQQVLPGDV PRVAVEAGAT GLWQGWVGQC GLVVGIDTFG ESAPGPELYE HFGLTADNVA
TAGRRVLG