Gene Hhal_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1198 
Symbol 
ID4710362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1302640 
End bp1303746 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content70% 
IMG OID639855671 
Producttransaldolase 
Protein accessionYP_001002775 
Protein GI121997988 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0176] Transaldolase 
TIGRFAM ID[TIGR00876] transaldolase, mycobacterial type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.178231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAGA ATCCGCTTCG AGGGCTGGCC GGCGTCGGGC AGAGCGTCTG GTACGACAAT 
ATCCATCGCG GCATGCTGCC GGCGGAGTTG GAGCGCCTGG TCCGGGAGGA CGGGCTCAGC
GGCGTGACGA CCAACCCGGC GATCTTCCAG AAGGCCATCG GTGGGACCGA GGCCTATGAT
GAGGCCATCA CCGCGGCGGT CCAGCAGGTC GGTCGCGACC CGGAAGCGGT CTTCGAGGCC
CTGGCCATGG GTGATATCCG GGCTGCGGCG CAGGTGTTGC GCCCACAGTA CACGCAGAGC
GATGGCTTGG ATGGCTATGT GAGCATGGAG GTCTCCCCGC AACTGGCCGA CGACGCCCGC
GGGACTGTGA CGGAAGCCCT GCGCCTCTAT GCGGATGCCG GTGAGCCCAA CGTGATGATC
AAGGTCCCCG CGACGCCCGC CGGCGTTGAG GCCTTCGAGG AGCTGACGGT GCGGGGCATC
CCGGTCAACG TGACGCTGAT CTTCGGGGTG GAGCGCTACC GGCAGGTTGC CGAAGCCTAT
GTGCGCGGTC TGGAGCGCCG CCGCCAGGCG GGCGATTCGG TCGCCGAGCC GGCCTCGGTG
GCGAGCCTGT TCATCAGCCG GCTCGATGCC AAGATCGACC CGCTGCTTGC CGACAGCGGT
TCCGCGGATG TGCAGCCGGG GCAGGCCGCG ATCGCCAACG CCCGGGTCGC CTACTCGGTG
TACCGGGAGA TCTTCCACGG CGCCCCCTTC GCTGCACTGG CCGAGGCCGG GGCGCGGCCG
CAGCGCCTGC TCTGGGCCAG TACCGGGGTC AAGGGCGAGC GGTATCCCGA GACTTACTAT
GTCGAGGCGC TGGCCGGCCC GGAGACGGTC ACCACCCTGC CGCCAGCCAC CTACGAGGCG
TACCGCCGCG ATGGTCAGCC GCGGGAGCAG CTGACTGCGC AGCTCGAGCA GGCCCCCGCC
GTGCTCGAGG CCCTGCGCGC CGGGGGGATC GACCTCGACG CCATCCTCGA CGAGCTCGAG
CGCGAGGGGA TCGACGCCTT CGTTCAGGCC CATCGGACCC TGCTCGATGT GCTCGAGAAA
AAGCTGCAGG CCGCCGCCGC GACGTGA
 
Protein sequence
MTENPLRGLA GVGQSVWYDN IHRGMLPAEL ERLVREDGLS GVTTNPAIFQ KAIGGTEAYD 
EAITAAVQQV GRDPEAVFEA LAMGDIRAAA QVLRPQYTQS DGLDGYVSME VSPQLADDAR
GTVTEALRLY ADAGEPNVMI KVPATPAGVE AFEELTVRGI PVNVTLIFGV ERYRQVAEAY
VRGLERRRQA GDSVAEPASV ASLFISRLDA KIDPLLADSG SADVQPGQAA IANARVAYSV
YREIFHGAPF AALAEAGARP QRLLWASTGV KGERYPETYY VEALAGPETV TTLPPATYEA
YRRDGQPREQ LTAQLEQAPA VLEALRAGGI DLDAILDELE REGIDAFVQA HRTLLDVLEK
KLQAAAAT