Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2413 |
Symbol | |
ID | 5734294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3081398 |
End bp | 3086107 |
Gene Length | 4710 bp |
Protein Length | 1569 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279554 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001545181 |
Protein GI | 159898934 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATT TTGCTGCAAA AATTGCTGCA TTACCACCAG AAAAACAAGC GCTGTTGATT CGCCGACTGC AACAAGCAGC CAACGAGCCA CCAGCCTTGG TTGCCCAACC ACGGACAACC AATACGCTGC CGTTATCGTT TGCCCAAGAG CGCCAATGGG TGTTGTATCA GTGGGACCCA ACCAGCCCGC TGTATAACAT CGTTTATGGG GTGCGCTACC GTGGCAAGCT TGATATTGCC GCCTTGCAAG CAGGCTTCAA CACCATTGCC CAGCGCCATG AGGTGTTGCG CACGACCTTT TTGCTGGTTG ATACAGTGCC CCATCAGCAG ATCCATGCTG AGCTTAAGCC AGGGTTTAGC GTGGTCGATT TACGCGATCT GGCCGAAACT GAGCGCGATA CCGCCATCCA AGCGCAGATT CAAGCCGAAA CCCAGCTGCC GTTTGATTTG CAAACTGGGC CATTATTGCG CGTGCTGTTG CTGCACATCC GCGATCACGA ATATATCAAA TTGGTCAGTG TGCATCATAG TGTTTTTGAC GGCTGGTCAG CTGGAGTGAT CATCTCCGAA TTGAATCATT TGCTGAATGC GGCCTATGCT GGCGAGCCAA GCAGCTTGCC AGCCTTGCCA ATTCAATACG CCGATTATGC GGTTTGGCAA CGCAATTGGC TACAAGGTAA GGTGTTGGAG CAGCAACTGC AATATTGGAA AGAACAACTC GCTGGCGAAT TGCCAATTCT GCAATTACCG ACTGATCGGC CTTATCCACC AGTTGAATCG TCGCGTGGCG CACATTATCG GGTGCAATTG AGTGCCGATT TGGTTGAACG ATTGGTCAAT TGGAGTCGCA GCGAGGGCTA TACACTCAAC ATTATTTTGC TCACCATCTG GAAAACCCTG CTCTTTCGTT ACACCAACCA AACCGATTTG CTGGTCGGCA TGCCGATCGC CAATCGCCAT TACAACGATT TGCAAGCCTT AATCGGCTAT TTCGTCAATA CCTTGGTGAT TCGCACCAAA GCCGCTGGCG ACCTCAGTTT CCGCAGCTTT TTGGATCAAG TGCGAGCGGC GAATTTGGCA GCGCAAGAGC ATCAAGATTT GCCATTTGAG CAATTGGTCG AGGCTTTGCA GCCTGATCGT AATTTGGCAC ATACGCCAAT TTTTCAAAGC TTGTTTGTGT TTCAGCGCGA TACGGTCAAT AGTTTTCAAA TGCCTGAGCT TTCGGTTGAG CCATTGCCAA TTGAAACTGG TACGGCCAAG TTTGCCCTTA GCCTTGAAGC CGTCTCGCTT GATCAGCAAA TCAAGCTCAA TTTCGAGTAT AAAACTGATC TGTTTGATCC AGCGACGATT GAACGCTTGG CCCAACATTA CCAAAATTTA CTCGAAGCAG TGCTTGCTTC GCCCGATTTG GAGCTTTCGC GCTTACCCAT GTTGGGCAAG GCTGAGCTAG CCCAATTGTT GCCAACGCCA ACAGCGCTCG AAGCAAACTT GTTGCCCTTG CACCAACGCT TTGAGCAGCA GGTGCAGGCC AATCCGCAAG CGATTGCGGT ACGCTTTGAG CAAAGCCAAC TAAGCTACGC CGAGCTAAAT AGCCGCGCCA ACCAACTGGC CCATCAACTC AAAACGCTCG ACGTTGGCCC AGATACCTTG GTTGGGCTAT GTGTTGAGCC ATCGCTCGAT ACAATCATTG GAATTTTGGC AATTCTCAAG GCTGGCGGCG CATATCTGCC GATCGATCCT GCACATCCTC AAGAGCGAAT TGTTTGGTTG TTGGCTGATG CCAAGGTTGG CTTGGTGGTT ACCCAAGCTC GTTGTGTCAA CAAATTACCC CAAGCTGGGT TGCAATTGAT TGTGCTTGAT GCCGTCGATT CAGCGCTGAG CAATCAGCCA ACCAGTAATC TGCCAGCTAG CGCCCAGCTC GATGACTTGG CCTATATGAT TTACACCTCA GGCTCGACTG GCACGCCCAA AGGTGCATTG ATCACCCATC GCAACGTGGC GCGACTGTTT AGTTCAACCG AAGCATGGTT CAACTTCAAC AACCACGATG TCTGGAGTTT GTTCCATTCG TTCGCCTTCG ATTTCTCAGT TTGGGAAATT TGGGGAGCCT TGCTGTATGG TGGTCGCGTG GTCGTCGTGC CATTTATGAC CACCCGCAAC CCCGCTGGCT TCTATCAATT GCTGGTTGAT GAAGGCGTGA CGGTGCTCAA CCAAACGCCC TCGGCCTTCC GCCAATTGAT CATCAGCGAT GCTGAACATG ACTTGCCATC GCGCCTAGCC TTGCGTTATG TCATCTTCGG TGGCGAAGCA CTGAACGTTG GGGCATTGCA ACCATGGTTC GAGCGCCATG GCGATCTACG CCCGCAGTTA GTCAATATGT ATGGCATTAC TGAAACCACC GTCCATGTGA CCTACCGACC GCTGAGCATG CACGATGTTG AAAATCCCCA AAGCAGCCCG ATTGGCACGG CAATTCCCGA TTTGGATCTG TATGTGCTTG ATGATCATTG TTTGCCAGTG CCATTGGGAA TTACTGGCGA ACTATATGTG GGCGGCGCGG GCTTGGCTCG CGGCTATTGG AATCGACCCG AACTAACCAA CGAGCGCTTT ATCAAGCATC CCTTTGCTGA AACAGGCCGC CTCTATAAAA CTGGCGATTT AGTGCGGCGC TTGGCTAATA ATGAGATCGA ATATCTGGGG CGGCGTGACA ACCAAGTCAA AATTCGCGGC TTCCGAATTG AGCTAGGCGA AATTCAGGCC ACCCTGATGA GCCACCCCGC GATCACCGAT GCGATTGTGG CGGTCAATAC AATCTCAGCC GATGATCAGC GCTTGGTGGC CTATTTGGTA ACCCAGCCTA ATCAAGTGCC ACGCTTTAGC CAATTGCGCA CATTTCTCAA GCAACGCCTG CCAGAATATA TGGTGCCAAC CTCATTCATT ATGCTTGAGC GTATTCCGCT GACTGCCAAC GGCAAAATCG ATTATCGCGC TTTGCCCAGC CAACAACAGA CCAAACAACT TGAGCGCAGC CAACCGATCG CGGCTCCAAC CAGTGTTACC GAGCAATCGT TGATCGCGAT TTGGAGCAAT TTGCTGGGAG TAACCCAAGT TGGCATTCAG GATAATTTCT TTGATTTGGG TGGGCACTCA TTGTTGGCAA CCCAAGTTAT TTCACGGGTG CGCGAGGTGT TTAATGTTAG CCTCAACTTG CGCGATTTCT TCCTCAACCC AACCATCAAG GGCTTAGCCA GCGTTATCGA GCAGGCCAAC CAAAAACCAG AGCAAACGCC AATTATCGCG ATTCACAAAG CTGATCCGCA GGCACGTTTG CCACTTTCCT ATGCCCAACA GCGCATTTGG TTCTTGGCTC AAATCGATCC AACCAGCAGT TTTTATACTG TCCCTGTGGC CTTAGAAATT ACTGGCCCCT TGCAAGTTGC CGCTTTAGAA CGCACCTTGA GCGAGATTGT GCAGCGCCAT CATGGCTTGC GCACCCTGTT TGTGAGCCAC GAAGGCCAAA CGCTTCAGCA AGTCCAAGCA GCTCAGCCAA TTCAATTGCC AATTCATGAT CTCAGTGATT TAGCTGAGGC AGCCCAAGAA ACAGCGATCA ACCAATTATT GGAGCAGGAG ATTAATCAAC CCTTCCAGCT TGATCGCGAT CAACTGTTGC GCGGGCGCTT GCTCAAACTA GCTCCAACCA AGCATGTGCT CGGCCTAAGC ATTCATCATA TTGCCTTTGA TGATTGGTCG CAGGGCATTT TGTTTGATGA AATGACCAAG CTCTACCAGG CCTTTGCCAA CAATCAAGCC TCGCCATTGC CCGAGCTAGC TTTGCAATAC CCTGATTTTG CCGCTTGGCA ACGTCAATGG TTGCAAGGCG AAGTTTTACA AAACCAATTG AATTACTGGA AACAGCAATT GAGTGGCAAA TTACCATTGC TCGAAATGCC GCTGGATTAC CCACGGCCTA GCGTACAAAG TTTCAATGGC GCACACGAAA TGCTGAGCAT CGAGCCAGAA CTGTATCGCA AACTTAATCA ATTGGCGCGA GATAACGAAG CAACCATGTT TATGTTGCTG ATGGCAGCAT GGGCGATTTT GCTCAATCGT TATAGCAATC AAACTGATAT TGTTATCGGC ACGCCAATCG CCAACCGCAA CCGCCAAGAG CTAGAGGGCA TCATCGGGTT CTTTGCGAAT ACCCTGGCCA TTCGCGCCGA TTTAACTGGC GATCCAACGC TTCAACAAGT GGTGCAACGG ATTCGCGAAA CCGCCCTAAA TGGCTATAGC CACCAAGATT TACCGTTCGA TTTGTTGGTC AGCGAGCTTT TGCCTGAACG ACAAGCCAAT CGCTCACCAA TTTTCCAAGC TATGTTGGTG CTGCAAAACG CCTCACAGAG CAGCAACCTC GATTTGGCCG AGGTGCAGAT TGCGCCGCGC AGCGTTGAAA CCAACTCGGC TAAGTTTGAG CTAAGTTTGG TGCTCTACGA CAACGGCTCA AGCATCGACG CTTGGATTGA ATACAACACC GACCTGTTTA AACCCACCAC GATTGGCCGC ATGGTCGAGC AGTTACAACA ATTGCTCACC AGCATGACCA CCGATCCACA ACAACGCATC AGCCAAGTTT CGTTGGTCAG TTCGGCAGAA AAAGAACGGC TACTCGGCGG CTGGTCGCAA GGAAGCGACG ACGACTACGA TTTGTTCTAG
|
Protein sequence | MTDFAAKIAA LPPEKQALLI RRLQQAANEP PALVAQPRTT NTLPLSFAQE RQWVLYQWDP TSPLYNIVYG VRYRGKLDIA ALQAGFNTIA QRHEVLRTTF LLVDTVPHQQ IHAELKPGFS VVDLRDLAET ERDTAIQAQI QAETQLPFDL QTGPLLRVLL LHIRDHEYIK LVSVHHSVFD GWSAGVIISE LNHLLNAAYA GEPSSLPALP IQYADYAVWQ RNWLQGKVLE QQLQYWKEQL AGELPILQLP TDRPYPPVES SRGAHYRVQL SADLVERLVN WSRSEGYTLN IILLTIWKTL LFRYTNQTDL LVGMPIANRH YNDLQALIGY FVNTLVIRTK AAGDLSFRSF LDQVRAANLA AQEHQDLPFE QLVEALQPDR NLAHTPIFQS LFVFQRDTVN SFQMPELSVE PLPIETGTAK FALSLEAVSL DQQIKLNFEY KTDLFDPATI ERLAQHYQNL LEAVLASPDL ELSRLPMLGK AELAQLLPTP TALEANLLPL HQRFEQQVQA NPQAIAVRFE QSQLSYAELN SRANQLAHQL KTLDVGPDTL VGLCVEPSLD TIIGILAILK AGGAYLPIDP AHPQERIVWL LADAKVGLVV TQARCVNKLP QAGLQLIVLD AVDSALSNQP TSNLPASAQL DDLAYMIYTS GSTGTPKGAL ITHRNVARLF SSTEAWFNFN NHDVWSLFHS FAFDFSVWEI WGALLYGGRV VVVPFMTTRN PAGFYQLLVD EGVTVLNQTP SAFRQLIISD AEHDLPSRLA LRYVIFGGEA LNVGALQPWF ERHGDLRPQL VNMYGITETT VHVTYRPLSM HDVENPQSSP IGTAIPDLDL YVLDDHCLPV PLGITGELYV GGAGLARGYW NRPELTNERF IKHPFAETGR LYKTGDLVRR LANNEIEYLG RRDNQVKIRG FRIELGEIQA TLMSHPAITD AIVAVNTISA DDQRLVAYLV TQPNQVPRFS QLRTFLKQRL PEYMVPTSFI MLERIPLTAN GKIDYRALPS QQQTKQLERS QPIAAPTSVT EQSLIAIWSN LLGVTQVGIQ DNFFDLGGHS LLATQVISRV REVFNVSLNL RDFFLNPTIK GLASVIEQAN QKPEQTPIIA IHKADPQARL PLSYAQQRIW FLAQIDPTSS FYTVPVALEI TGPLQVAALE RTLSEIVQRH HGLRTLFVSH EGQTLQQVQA AQPIQLPIHD LSDLAEAAQE TAINQLLEQE INQPFQLDRD QLLRGRLLKL APTKHVLGLS IHHIAFDDWS QGILFDEMTK LYQAFANNQA SPLPELALQY PDFAAWQRQW LQGEVLQNQL NYWKQQLSGK LPLLEMPLDY PRPSVQSFNG AHEMLSIEPE LYRKLNQLAR DNEATMFMLL MAAWAILLNR YSNQTDIVIG TPIANRNRQE LEGIIGFFAN TLAIRADLTG DPTLQQVVQR IRETALNGYS HQDLPFDLLV SELLPERQAN RSPIFQAMLV LQNASQSSNL DLAEVQIAPR SVETNSAKFE LSLVLYDNGS SIDAWIEYNT DLFKPTTIGR MVEQLQQLLT SMTTDPQQRI SQVSLVSSAE KERLLGGWSQ GSDDDYDLF
|
| |