Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1803 |
Symbol | |
ID | 5733705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2092256 |
End bp | 2093884 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641278946 |
Product | 2,3-dihydroxybenzoate-AMP ligase |
Protein accession | YP_001544574 |
Protein GI | 159898327 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1021] Peptide arylation enzymes |
TIGRFAM ID | [TIGR02275] 2,3-dihydroxybenzoate-AMP ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAGC TGACGCTTGG CCTTGAAGGC ACAACTGCTT GGCCCGCCGA GTTTGCCGAA CGCTATCGTA AGGCTGGCTA TTGGGTTGAT CAGACCTTTG GCGAAGCACT ACGCGAGTGG GCGCAACGCT CGGGCGATGC CACCGCTGTG GTATGTGGTG AACGCCGTTG GAGCTACCGC GAGCTTGATC AACGGGTTGA TCGCTTGGCA GCGGGCTTGC AACAGCTCGG CATCCAAACA AAACAACGGG TGGTGGTGCA ATTGCCCAAT TGCGCCGAAT GGTTTGTGGT TTGTTTTGCA CTCTTTCGGG TTGGTGCAAT TCCGTTGATG GCACTGCCCG CCCATCGCTT GGCTGAAATT GGCTATTTTT GCCAACACAG CGAAGCCGTG GCCTATGTGA TTGCCGATAA AGTTGGCAGT TTCGATTATC GCAATTTGGC AGCTGAGGTC AAAGCGGTTG CGCCAACCTT GGAACATGTG TTGGTGGTTG GTGAGGCTGG GCCGTTTACT GCTTTGGCTG AGGTCGATGC TGAGCCAAGC GAATTTCCAA CGCTTGACCC TGCCGAGGTG GCTTTGTTCC AGCTTTCGGG CGGTAGCACG GGCGTGCCCA AATTGATTGC CCGCACCCAC GACGATTATT TGTATTCGGT GCGAGCGAGT GCTGAAATCT GCAAGTTGGA TGCCAGCAGC GTGTATTTGT GCGTCTTGCC GATGGCTCAT AACTTTCCGA TGAGTTCGCC TGGAACATTG GGAACTTTGG CGGCGGGCGG CACGGTGGTG CTTGCGCCGC AACCAAGCCC CGATGTGGCT TTTCCCTTGA TTGCTCGCGA AGGTGTGACG ATTACTGGCA TGGTTCCGCC GCTGGCATTG CTGTGGCTCG ATGCTGCGGC AAATCGCAAG GCCGAATTAT CAAGCCTCAA GCAAATTTTG GTTGGTGGTG CGCCCTTTGG TGCTTATACC GCCCGCCGCG TGCAGCCAGA GCTTGGTTGC CAATTGCAAC AAGTCTATGG TATGGCTGAG GGCTTGGTTA ATTACACACG GCTTGATGAT GCGGCTGAGC TGATTTGCCA CACCCAAGGC CGACCAATTT CGCCCTTGGA TGAGGTACGG ATTGTCGATG ATGAAGATAA TGATGTGCCG TTGGGCGAGC TTGGTCACCT GATTACCCGT GGCCCCTACA CGATTCGCGG CTATTATCGC GCTGCTGAAC ATAATCAACG GGCTTTTACC AGCGACGGTT TTTATCGGAC TGGTGATTTG GCACGCTTGA ACGCCACTGG CTATGTTTCA GTCGAAGGCC GCGCCAAAGA TCAAATCAAT CGTGGTGGCG AAAAAGTCGC TGCCGAAGAA ATCGAGCAAC ATTTGCTCAA TCACCCAGCG ATTCACGATG TGGCGCTGGT GGGCTTGCCC GACCGATTTT TGGGCGAACG CACTTGTGCG GTGATTGTCA GCAACGGTGT GAATATCAAC CGCCGCGAAG TGTTGCAATT TTTGCGCAGC CGTGGGCTTG CCGAATACAA ATTGCCAGAT CGGGTTGAAA TCGTGGAGAG TTTGCCCAAA ACTGGGGTTG GCAAGATCAA CAAACGGCTG TTACGTGAGC AATTAAGTGC TGGTCGTGTG CCAGCCTAG
|
Protein sequence | MAELTLGLEG TTAWPAEFAE RYRKAGYWVD QTFGEALREW AQRSGDATAV VCGERRWSYR ELDQRVDRLA AGLQQLGIQT KQRVVVQLPN CAEWFVVCFA LFRVGAIPLM ALPAHRLAEI GYFCQHSEAV AYVIADKVGS FDYRNLAAEV KAVAPTLEHV LVVGEAGPFT ALAEVDAEPS EFPTLDPAEV ALFQLSGGST GVPKLIARTH DDYLYSVRAS AEICKLDASS VYLCVLPMAH NFPMSSPGTL GTLAAGGTVV LAPQPSPDVA FPLIAREGVT ITGMVPPLAL LWLDAAANRK AELSSLKQIL VGGAPFGAYT ARRVQPELGC QLQQVYGMAE GLVNYTRLDD AAELICHTQG RPISPLDEVR IVDDEDNDVP LGELGHLITR GPYTIRGYYR AAEHNQRAFT SDGFYRTGDL ARLNATGYVS VEGRAKDQIN RGGEKVAAEE IEQHLLNHPA IHDVALVGLP DRFLGERTCA VIVSNGVNIN RREVLQFLRS RGLAEYKLPD RVEIVESLPK TGVGKINKRL LREQLSAGRV PA
|
| |