Gene Hhal_1445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1445 
Symbol 
ID4711373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1557237 
End bp1558187 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content67% 
IMG OID639855912 
Product5-carboxymethyl-2-hydroxymuconate delta-isomerase 
Protein accessionYP_001003014 
Protein GI121998227 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCCGA GTGCACCACC GTCGACGCAC GGCGCCATTT GCGGGACAAT GGCGATTGGC 
AGGCCCGATC GCAACCCCAC ACGCGGAAAC GGCACCATGC GCATCACCAC CGTCCGTCAT
CAAGGCGTTT CCCGGATCGC CGTTCACGAG CAGGGCGACC GCTGGGCGGT ATCCCCCACT
CCCGGGGATC TGGGTGAACA CCTATGCGCC GGCACCGTCC CCGAAGCCGG CTACGACTGG
CCACGGGTCA CCGCCGAGGC CCTGACCTTC CTCGCGCCTC TGCCACACCC ACCGCGCAAT
GTGATCTGCC TCGGGCTGAA CTACGCCGAC CACGCCCGGG AATCCCAGCA GGCCAAGGGC
GATGAGCTCG CCCTGCCCGA AGCCCCGGTG GTCTTCACCA AGGCAACAAC CAGTGTCGCC
GGTCCCTACG ACGATTTCAT CCTCGACCCG TCCGTCACCA GCGAGCTGGA CTGGGAGGTG
GAGCTCGCGG TCGTCATAGG CCGGGGCGGA CGACACATCC GCGAACAGGA CGCCCTTCAG
CACGTCTTCG GCTACACCGT CGTCAACGAC CTCTCCGCGC GGGACCTGCA GTTCCGACAC
AAGCAGTTCT TCCTCGGCAA ATCGGTGGAC GGCAGCTGCC CGATGGGGCC CTGGATCACC
ACCGCCAATG CGGTGCCGAA CCCCCACAAC CTCGCCCTCT CCTGCCGGAT CAACGACACC
ACCGAGCAAC AGTCGCACAC CGGCGAGATG GTCTTCTCCA TCCCCAGGAT CATCGCCGAG
CTGTCACGGG TCATGACCCT GATCCCGGGG GATATCATCG CCACCGGCAC CCCCGCCGGC
GTCGGCTTTG CGCGCACGCC GCCCCGCTTC CTGCAGGCCG GCGATATCGT GACCTCCGAG
GTCGAGGGAC TCGGTACGCT GCGTAATCGC ATTGTGGCAC CGGATTCGTG A
 
Protein sequence
MAPSAPPSTH GAICGTMAIG RPDRNPTRGN GTMRITTVRH QGVSRIAVHE QGDRWAVSPT 
PGDLGEHLCA GTVPEAGYDW PRVTAEALTF LAPLPHPPRN VICLGLNYAD HARESQQAKG
DELALPEAPV VFTKATTSVA GPYDDFILDP SVTSELDWEV ELAVVIGRGG RHIREQDALQ
HVFGYTVVND LSARDLQFRH KQFFLGKSVD GSCPMGPWIT TANAVPNPHN LALSCRINDT
TEQQSHTGEM VFSIPRIIAE LSRVMTLIPG DIIATGTPAG VGFARTPPRF LQAGDIVTSE
VEGLGTLRNR IVAPDS