Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2204 |
Symbol | |
ID | 4710971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2418779 |
End bp | 2419819 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639856679 |
Product | alpha/beta hydrolase fold |
Protein accession | YP_001003770 |
Protein GI | 121998983 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3243] Poly(3-hydroxyalkanoate) synthetase |
TIGRFAM ID | [TIGR01836] poly(R)-hydroxyalkanoic acid synthase, class III, PhaC subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000187049 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAGA GGCCGTCCGG CAACGGACTG ACCGACTGGC AGCGCCGGCT GATCGAGACG CTGGACCAGG CGGCAGCACT GCCCGTGGAC ACCCGCGGCG CCACCCCGTT CACCCACCAC GCCGAGGTGG GGCCGGGCAT GCACCTGCGC CGCTACAGCC CGACCCACGG CGCCCGACAA CGCCCGGTAC TGATCGTCTA CTCGCTGGTC AACCGCCCGT TCATCCTCGA TCTGACCGAG CGCCGCTCAC TGATCGCCGC CCTGACCCGG GCCGGCCACC CGGTCTACCT CCTCGACTGG GGGTACCCGA AGGGCGCCGA TCGCTTCCTC GGCCTGGCGG ATTACATCGA GGGCTTTCTC GCGGCAGCGG CCGACGAGGT CGCCGCCAGC GAGGGGACCA CACCGGACCT GCTCGGTGTC TGCCAGGGGG GGGTCTTCGC GCTGTGCCTG GCCGCCCTGC AGCCGCAACG GGTCCACCGG CTGGTGAACC TGGTAACCCC GGTGGACTTC CACACCCCCG GCGACAACCT CAGTCGCATG GCGCGGGAGG TCGACTTCGA CCAGGCGGCG CGGTCCCTCG GCAACATCTC GGCGGAGTGG CTCAACGGCG TCTTTGTCGC CCTGAAACCC TACCGACTCC TGGCCCAGCG CTACATGGAC CTGCCCGAGC TGGCCGACCA CCCGGAGGCC CTCCACGACT TCCTGCGCCT AGAGCGCTGG ATGTACGACA GCCCGGACCA GGCAGCCACC GCGTTTGCTG AATTCGGCCG CGAATGCTAC CAGCGCAACG GACTGATCCA GGGCACACTG CAGCTCGACG GCCAGCCCGT GCGACTGGCC AACATCGAGC ACCCGATCCT GAACGTCTAC GCCGAACAGG ACCACCTGGT CCCCGCCGAC GCCGCCCGCG CCCTGGGCAC ACACGTGGGT TCAGGGGATT ATGGCGAGCT GACCTTCCCC GGGGGGCACC TGGGCGTATT CATCAGCCGC CGTGCCCACG CGGAACTCCT GCCGCGCATC GTGGCCTGGC TGGCGGAATG A
|
Protein sequence | MAERPSGNGL TDWQRRLIET LDQAAALPVD TRGATPFTHH AEVGPGMHLR RYSPTHGARQ RPVLIVYSLV NRPFILDLTE RRSLIAALTR AGHPVYLLDW GYPKGADRFL GLADYIEGFL AAAADEVAAS EGTTPDLLGV CQGGVFALCL AALQPQRVHR LVNLVTPVDF HTPGDNLSRM AREVDFDQAA RSLGNISAEW LNGVFVALKP YRLLAQRYMD LPELADHPEA LHDFLRLERW MYDSPDQAAT AFAEFGRECY QRNGLIQGTL QLDGQPVRLA NIEHPILNVY AEQDHLVPAD AARALGTHVG SGDYGELTFP GGHLGVFISR RAHAELLPRI VAWLAE
|
| |