Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2091 |
Symbol | |
ID | 5733979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2609700 |
End bp | 2613137 |
Gene Length | 3438 bp |
Protein Length | 1145 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279232 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544859 |
Protein GI | 159898612 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCAAT TGTTGACCCA ATTACGGGCA GCCGATATTC GGATTTGGCT TGAAGGCGAG CGCTTATGCT ACGATGCCCC ACCAGGCGTA CTAAGTGAGC AACTATTAAG CCAATTACGC CAGCATAAGG CCGAACTATG CCAATTTTTG GCGCGAACTC AGCCCAATCA ATTTGCGCCG ATTCCAAGGC TGGAAGATGA TCAACCTGTG GCGCTAAGCT TTGCCCAGCA ACGGTTGTGG TTTTTGCATG AATTTGCCCC CACCAGCACG GCTTATCATC TTCCGGCGGC AATTGAACTC AATGGCGAGT TGGATCGTAC AGCCTTGCAC CAGAGTTTCG AGCAACTGAT CGAACGCCAT ACAATCTTAC GGAGCCATGT GCTGTGGCAG GATGGCCAGC CATTGCAACA GGTCGCCGCC AATTGGCAGT TACCTTGGGC TTTTTATGAT CTGCGTTCAG CCAATGATCC TGCGGCTGAA TGCTATCAGC AGTTGTTGGC AGCCTTTGAA GCACCCTTCG ATCTCGCGAT TGCGCCATCG CTGCGTTGCG TTTTAGTCTG CTTAGCAGAA AAACAGCATA TTCTTTTGGT CACACAGCAC CATTGGATTA GCGATGGCTG GTCGATTGGA ATTATGATCA ACGAGCTAGC CCACATCTAT CGAGCAACAT TGAACCAACA ACCGCATCAG CTACCAACCT TACCGGTGCA ATATCGTGAT TTGGCGGTCT GGCAGCGCCA ACAATTGCAG GGAGCAAAGT TAACGCAATT GTTGACGTAC TGGCGACAAC AATTGGCCGA TTTGCCCAAT TTGGCGTTGC CTTACGAAGC CCAAAAGCCC GCCCAAACGA TGTCAACCAG CATTCCAATC CAACTTGATC GCGCCTTGGT CGAACGCTTA ACCAACCTAA ACCAAACTCA TGGCACGACC ATGTTTATGA GCTTGTTGGC GGGCTTTTTT GCGCTGTTAA GCCGTTATAC GCAGCAACAC GATTTTGCGG TTGGTTCGCC GATTGCTGGG CGTTTGCAGC CTGAAGCTGA GCCTTTAATT GGTTGTTTTA TCAATAGTTT AGTGCTACGA GCTGATTTAA GTGGCCAGCC CAGCTTTCGT CAATTGTTGG AGCGGGTGCG CCAAACCAGC TTGGCGGCCT ATGCACACCA AGAATTGCCG TTTGAATTGC TAGTTGAGGC CTTGCAGCCT GAGCGCAAAC TCGATCAACA ACCCCTATTT CAAGCGCTAT TTGCTCTGCA AAATATGCCA ACTGGCCAGC TTGAAGCCCC AAATCTGGTA ATCAAACCCT ATGCATTTGC CCAACAAGCG CCTCAATTTC CGCTTAGTCT GATCTTGTTT GAGCAAGCGG GTCAGATTAC GGGCGAGCTG AATTATGATC CACAACAGCT ATCGCAGCAG TTTGCCAGCC AATTTGCCGC GCACTATGCC CAATTCTTGA CGCAGTTGCT GGTAGAACCA GATACGGCAA TCGCTCAGCT GAAATTATTG CAGCAACCTG AACAGGATAA GGTGCTCAAT CTACTAAATT CAAATCGGCA AGTTTACCCG ATCAACAATA GTTTAGTTGC ACGTTTTACC CAACAAGCGC TGGCAACGCC CCAAGCAATT GCCTTGAGTT ATGCCGAGCA AGCGATCAGC TACCAACAAT TGGCCGAAGC TGCCGATCAA TTGGTCTATG TGCTTTTGGC CCAAGGTGTA CAACCTGAAC AACCAATCGG CTTGTTGTGT GAGCGTTCGC CGCAATTAAT TATTGGCATT TTGGGAATTC TCAAGGCTGG CGCAGCCTAT CTGCCGCTCG ATCCGCAGTT GCCTACCAGC CGAATCGAAT GGATGCTGGC CGATGCCCAA GTTAATTTAA TTGTTACTCA AAACAGTCTG TTGCACAGTG TTAATTCACA AGCAACCACC ATTCTCAACC TTGATCAACT TCCAACCACC AAACTCACTC AATTACCAAC TATCCATCCC GATCAACTTG CCTATATCAT TTATACCTCT GGCTCAACTG GGCAACCCAA AGGCACGTTG TTGAGCCATG CCAACGTGTT GCGCTTATTC GAGGCGACAG TTGCTACGAT TAAACCTAGT GCCAATGATG TTTGGAGCTT ATTCCATTCA TATGCATTCG ATTTTTCGGT TTGGGAAATT TGGGGGGCAT TGCTGTATGG TGGACGTTTG GTGGTTGTGC CAAGTACCAC AACTCGCTCA CCCGAAGCGT TCAGCCAATT GTTGGCAGAC GAATCGATAA CTGTGCTCAA TCAAACACCC TCGGCGTTTC GTCAACTATT ACCCCAACTC ACGCCAGCGG TGGCCGCGAA TTTGGCGCTG CGCTTGATTA TCTTTGGTGG CGAGGCGCTT GATCTGGCCA GCCTCGCCGC TTGGTATCAA GCCTATCCTG CGCCCGCCCC GCAACTGCTG AATATGTATG GCATCACCGA AACTACGGTG CATGTGACTG AGCGTTGGTT GGAGCTTAAT GACTTGATCG AGGCAAAAGC CAGCCTAATC GGCTTGCCAA TCGCCGACTT AACCATGTAT CTGCTCGATC AGTACGGCCA ACTTGTGCCT CAAGGTGCAG TCGGCGAGAT CTATGTGGGC GGGGCAGGTT TGGCGCGGGG CTATCTCAAG CAAGCGGCGT TGACCGCCCA ACGCTTTGTG CCTGATCCAT GGTCAAGCAC TGGGGCACGG CTCTACCGCA GCGGCGATTT GGCTCGGATT AATCAATTTG GCGAGCTAGA ATACCTTGGG CGTAGTGATC AGCAAGTCAA ATTGCGCGGC TTTCGGATTG AGCTAGGCGA ACTTGAACAA GCGATTTGCC GCCAAGCAGG AGTTGCTGAT TGCTGGGCGT TTGTGCAAAA ACTTGATCAA CACGAGCGCC TAGTAGTTTG GGTTGTGCCA AATCAGCCAG CGCTTAGCGT TGAACAATTG CGCCAAGCGC TGGCTCTCGA GCTGCCCCAT TATCTGCAAC CCAATCTCTG GCTGCTGTGC GAACACTTGC CGCTTACCAA CAACGGCAAA CGCGATTATG CTCATTTATT AGCGCAACTT GACGTAACGC TTGAGCAAGC ATCAAGTATT GCCCCCAATA ATCCAATTCA AGCGCTGATT GCAGCAATTT GGCTTGAGAT TTTGCAGCAA CCAATTGCCA GCATTGATCA AAACTTTTTC GAGGTTGGCG GTAATTCGTT GAATGCGGTT TTGGTGCTAA CCCGCATTCG CGAATTATTA CGGGTCAATC TGAGTTTGCG CAGCCTATTT GCCCAACCAA CCATTCGCGG TGTTGAACAA GCCTTAGTAC AGCAAGAACC CAAGCCTGGC CAAACTGCCA AAATTGCCGA GTTGGTGCAA AAACTCCAGC AAGCTACACC TGAGCAGCGC CAACAAGCCT TAGCATCCAA GCGCCAAGAA CGGACGGTCG GCGAATGA
|
Protein sequence | MWQLLTQLRA ADIRIWLEGE RLCYDAPPGV LSEQLLSQLR QHKAELCQFL ARTQPNQFAP IPRLEDDQPV ALSFAQQRLW FLHEFAPTST AYHLPAAIEL NGELDRTALH QSFEQLIERH TILRSHVLWQ DGQPLQQVAA NWQLPWAFYD LRSANDPAAE CYQQLLAAFE APFDLAIAPS LRCVLVCLAE KQHILLVTQH HWISDGWSIG IMINELAHIY RATLNQQPHQ LPTLPVQYRD LAVWQRQQLQ GAKLTQLLTY WRQQLADLPN LALPYEAQKP AQTMSTSIPI QLDRALVERL TNLNQTHGTT MFMSLLAGFF ALLSRYTQQH DFAVGSPIAG RLQPEAEPLI GCFINSLVLR ADLSGQPSFR QLLERVRQTS LAAYAHQELP FELLVEALQP ERKLDQQPLF QALFALQNMP TGQLEAPNLV IKPYAFAQQA PQFPLSLILF EQAGQITGEL NYDPQQLSQQ FASQFAAHYA QFLTQLLVEP DTAIAQLKLL QQPEQDKVLN LLNSNRQVYP INNSLVARFT QQALATPQAI ALSYAEQAIS YQQLAEAADQ LVYVLLAQGV QPEQPIGLLC ERSPQLIIGI LGILKAGAAY LPLDPQLPTS RIEWMLADAQ VNLIVTQNSL LHSVNSQATT ILNLDQLPTT KLTQLPTIHP DQLAYIIYTS GSTGQPKGTL LSHANVLRLF EATVATIKPS ANDVWSLFHS YAFDFSVWEI WGALLYGGRL VVVPSTTTRS PEAFSQLLAD ESITVLNQTP SAFRQLLPQL TPAVAANLAL RLIIFGGEAL DLASLAAWYQ AYPAPAPQLL NMYGITETTV HVTERWLELN DLIEAKASLI GLPIADLTMY LLDQYGQLVP QGAVGEIYVG GAGLARGYLK QAALTAQRFV PDPWSSTGAR LYRSGDLARI NQFGELEYLG RSDQQVKLRG FRIELGELEQ AICRQAGVAD CWAFVQKLDQ HERLVVWVVP NQPALSVEQL RQALALELPH YLQPNLWLLC EHLPLTNNGK RDYAHLLAQL DVTLEQASSI APNNPIQALI AAIWLEILQQ PIASIDQNFF EVGGNSLNAV LVLTRIRELL RVNLSLRSLF AQPTIRGVEQ ALVQQEPKPG QTAKIAELVQ KLQQATPEQR QQALASKRQE RTVGE
|
| |