Gene Hhal_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1039 
Symbol 
ID4709609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1121786 
End bp1122850 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content67% 
IMG OID639855510 
Productfructose-1,6-bisphosphate aldolase 
Protein accessionYP_001002617 
Protein GI121997830 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0191] Fructose/tagatose bisphosphate aldolase 
TIGRFAM ID[TIGR00167] ketose-bisphosphate aldolases
[TIGR01521] fructose-bisphosphate aldolase, class II, Calvin cycle subtype 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATGA TCACCCTGCG ACAGCTGCTC GATCACGCAG CCGAGCACGG CTACGGCATG 
CCGGCGTTCA ACGCCAACAA CATGGAGCAG TTGCACGCCA TCATGGAGGC GGCCAAGGAG
TGCGACAGCC CGGCCATCGT CCAGGCGTCC GCCGGTGCCC GCAAGTATGC CGGTGTCCCC
TTCTATCGGC ACCTGATGGA GGCCGCCGTG GAGTCCTACC CGGACGTGCC GCTGGTGGTC
CACCTCGACC ACGGTGCCAA TCCCGGTGCC TGCATGCGCG CCATCCAGTC CGGGTTCACC
TCGGTGATGA TGGACGGCTC CCTGAAGGAA GACGGCAAGA CCCCGGCCGA CTACGACTAC
AACGCCAGCG TGACCCGGCG CGCCGCCGAG ATGGCCCACG CCGGCGGTGT CTCCGTCGAG
GGCGAGATCG GTGTGCTCGG CTCCCTAGAG ACGGGTGAGG CCGGCAAGGA GGACGGCGTC
GGCGCTGAGG GCAAGATGGA CAAGGACAAG CTCCTCACCG ATCCCGAGGA GGCGGCCCAG
TTCGTCCGCG ATACCCACGT CGACGCCCTC GCCATCGCCT GCGGCACCAG CCACGGTGCC
TACAAGTTCA CCCGCCCGCC GACGGGCGAC ATCCTGGCCA TCAGCCGGAT CAAGGAGATC
CACCAGCGTC TGCCGGACAC CCACCTGGTG ATGCACGGCA GCTCCCAGGT GCCGCAGGAG
TGGCTGGAGC TGATCAACCG CTTCGGCGGC GAGATCCCCG AGACCTACGG CGTGCCGGTC
GAGGAGGTCC AGGAGGGCAT CCGCAACGGC GTGCGTAAGG TCAACATCGA CACGGACCTG
CGCCTGGCCT CCACCGGTGC GGTGCGCAAG CACCTGGCCG AGAACCCGTC GAACTTCGAC
CCGCGTAAGT TCCTGAAGGC CTCCACCGAG GCCATGAAGG AGATCTGCAA GGCCCGCTAC
GAGGCCTTCG GTTCGGCGGG CATGGCCTCC AAGATCAAGC CGATCGCGCT GGAGACCATG
GTCGAGCGCT ACGAGTCCGG CGAGCTCGCC CCCAAGGTGA GCTGA
 
Protein sequence
MAMITLRQLL DHAAEHGYGM PAFNANNMEQ LHAIMEAAKE CDSPAIVQAS AGARKYAGVP 
FYRHLMEAAV ESYPDVPLVV HLDHGANPGA CMRAIQSGFT SVMMDGSLKE DGKTPADYDY
NASVTRRAAE MAHAGGVSVE GEIGVLGSLE TGEAGKEDGV GAEGKMDKDK LLTDPEEAAQ
FVRDTHVDAL AIACGTSHGA YKFTRPPTGD ILAISRIKEI HQRLPDTHLV MHGSSQVPQE
WLELINRFGG EIPETYGVPV EEVQEGIRNG VRKVNIDTDL RLASTGAVRK HLAENPSNFD
PRKFLKASTE AMKEICKARY EAFGSAGMAS KIKPIALETM VERYESGELA PKVS