Gene Hhal_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2204 
Symbol 
ID4710971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2418779 
End bp2419819 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID639856679 
Productalpha/beta hydrolase fold 
Protein accessionYP_001003770 
Protein GI121998983 
COG category[I] Lipid transport and metabolism 
COG ID[COG3243] Poly(3-hydroxyalkanoate) synthetase 
TIGRFAM ID[TIGR01836] poly(R)-hydroxyalkanoic acid synthase, class III, PhaC subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000187049 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGA GGCCGTCCGG CAACGGACTG ACCGACTGGC AGCGCCGGCT GATCGAGACG 
CTGGACCAGG CGGCAGCACT GCCCGTGGAC ACCCGCGGCG CCACCCCGTT CACCCACCAC
GCCGAGGTGG GGCCGGGCAT GCACCTGCGC CGCTACAGCC CGACCCACGG CGCCCGACAA
CGCCCGGTAC TGATCGTCTA CTCGCTGGTC AACCGCCCGT TCATCCTCGA TCTGACCGAG
CGCCGCTCAC TGATCGCCGC CCTGACCCGG GCCGGCCACC CGGTCTACCT CCTCGACTGG
GGGTACCCGA AGGGCGCCGA TCGCTTCCTC GGCCTGGCGG ATTACATCGA GGGCTTTCTC
GCGGCAGCGG CCGACGAGGT CGCCGCCAGC GAGGGGACCA CACCGGACCT GCTCGGTGTC
TGCCAGGGGG GGGTCTTCGC GCTGTGCCTG GCCGCCCTGC AGCCGCAACG GGTCCACCGG
CTGGTGAACC TGGTAACCCC GGTGGACTTC CACACCCCCG GCGACAACCT CAGTCGCATG
GCGCGGGAGG TCGACTTCGA CCAGGCGGCG CGGTCCCTCG GCAACATCTC GGCGGAGTGG
CTCAACGGCG TCTTTGTCGC CCTGAAACCC TACCGACTCC TGGCCCAGCG CTACATGGAC
CTGCCCGAGC TGGCCGACCA CCCGGAGGCC CTCCACGACT TCCTGCGCCT AGAGCGCTGG
ATGTACGACA GCCCGGACCA GGCAGCCACC GCGTTTGCTG AATTCGGCCG CGAATGCTAC
CAGCGCAACG GACTGATCCA GGGCACACTG CAGCTCGACG GCCAGCCCGT GCGACTGGCC
AACATCGAGC ACCCGATCCT GAACGTCTAC GCCGAACAGG ACCACCTGGT CCCCGCCGAC
GCCGCCCGCG CCCTGGGCAC ACACGTGGGT TCAGGGGATT ATGGCGAGCT GACCTTCCCC
GGGGGGCACC TGGGCGTATT CATCAGCCGC CGTGCCCACG CGGAACTCCT GCCGCGCATC
GTGGCCTGGC TGGCGGAATG A
 
Protein sequence
MAERPSGNGL TDWQRRLIET LDQAAALPVD TRGATPFTHH AEVGPGMHLR RYSPTHGARQ 
RPVLIVYSLV NRPFILDLTE RRSLIAALTR AGHPVYLLDW GYPKGADRFL GLADYIEGFL
AAAADEVAAS EGTTPDLLGV CQGGVFALCL AALQPQRVHR LVNLVTPVDF HTPGDNLSRM
AREVDFDQAA RSLGNISAEW LNGVFVALKP YRLLAQRYMD LPELADHPEA LHDFLRLERW
MYDSPDQAAT AFAEFGRECY QRNGLIQGTL QLDGQPVRLA NIEHPILNVY AEQDHLVPAD
AARALGTHVG SGDYGELTFP GGHLGVFISR RAHAELLPRI VAWLAE