Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1819 |
Symbol | |
ID | 4711042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1992514 |
End bp | 1993698 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639856289 |
Product | 4-coumarate--CoA ligase |
Protein accession | YP_001003385 |
Protein GI | 121998598 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | [TIGR02372] 4-coumarate--CoA ligase, photoactive yellow protein activation family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0064272 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGGGC TGAATGCCGA TGAGGTGCTG CGCCTGCTGC GCAGCCTGAT TCCCGGCGAA CTGGCCGAGG GGCGCGGTCA CCGCAATGAC CCGCCCGAGG GAACGGACAT CTGCGCGGAC ACCCGGCTGG ACCACACCCC CATCCGCGCC GATTCGCTGG ATCGGCTCCA TCTGGCCAGC GCCCTGAATC GCCTCTTCTG CCTCCACGAG ACCGGGGTGG AGGACCGCCT CTTGACGGTG CGCCGCATCG GCGATATCGC TGAACTGATC GCCGAGGGCA GCCAGCACAC CAGCGGCCTG AGCTTCTCCA CCTCCGGCAG CACGGGCACC CCGCAATCCC ACCATCACAG CTGGTCGGCC CTGACCCAGG AAGCCGAGGC CCTGGCCGCG GCCCTCGGCC ACCACCGACG CGTGATCGCC TGGTTGCCGC TCCACCACCT GTATGGGTTC GTCTTCGGCG TCGCCCTACC CCGCACCCTG GGCAGCACGG TGGTCGAGAG CCACGAGGCT CCCGCCGCCT TGTTCCGCAA CCCCGCACCC GACGACCTCA TCGCCAGCGT CCCGGCGCGA TGGCGCTACC TACTCGACAG CGATCACCGC TTCCCCGGCG GCACCGGCGT GAGTTCGACG GCCCCACTCG AGGCAGCCTG TCGGCACGGG CTGCCGCGGG CCGGACTGGA CGCCCTGGTC GAGGTCTACG GCGCCACGGA GACCGGCGGG ATCGGCTTGC GCTGGGCCCC CGCAGAGGAT TACCGACTCC TGCCCTACTG GCAGTGCAAC GCCGACGGCA ACCTCCGGCG CGCACTGCCC GAAGGGTCGG CCGTGACCAT CACCCCGCTG GATCGGCTCG AGTGGCTGGA CGAGCGGGTC TTCCGGCCCC GCGGGCGCAT CGATGACATC ATCCAGATCG GCGGGGTCAA CGTCTCGCCG CAGCACGTGG CACGCCGCTT CGAGAGCCAC GAGGCGGTTG CCGCCTGCGC GATACGCAGC CACGGAGAGG GCAGCCAGCG GCGGCTGAAG GCCTTCATCG TCCCGGCTCA CCCCGAGACC GACCCCGAGG AGCTACGCCA GGCGCTGGAA ACCTGGGCGT GGGAACACCT GCCGGCAGTG GAGCGGCCGA CCGACCTGCG CATCGGTCCC GAGTTGCCGC GCAACGCCAT GGGCAAGCTG CAGGATTGGG ACTGA
|
Protein sequence | MQGLNADEVL RLLRSLIPGE LAEGRGHRND PPEGTDICAD TRLDHTPIRA DSLDRLHLAS ALNRLFCLHE TGVEDRLLTV RRIGDIAELI AEGSQHTSGL SFSTSGSTGT PQSHHHSWSA LTQEAEALAA ALGHHRRVIA WLPLHHLYGF VFGVALPRTL GSTVVESHEA PAALFRNPAP DDLIASVPAR WRYLLDSDHR FPGGTGVSST APLEAACRHG LPRAGLDALV EVYGATETGG IGLRWAPAED YRLLPYWQCN ADGNLRRALP EGSAVTITPL DRLEWLDERV FRPRGRIDDI IQIGGVNVSP QHVARRFESH EAVAACAIRS HGEGSQRRLK AFIVPAHPET DPEELRQALE TWAWEHLPAV ERPTDLRIGP ELPRNAMGKL QDWD
|
| |