Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2090 |
Symbol | |
ID | 5733978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2606263 |
End bp | 2609703 |
Gene Length | 3441 bp |
Protein Length | 1146 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279231 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001544858 |
Protein GI | 159898611 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTATT CAAATATTAC CCAACTCGTG ACCGCTCAAG CCAACCAAAC GCCTGCCGCT TGGGCAGTGC AAACGCCTAC AGGTTATGGA TTAACCTTTG CCGATCTTGA GCAACAATCT AGCCAAGCAG CGGCCTATTT GCAACATCTT GGTGTACAAC CAGCGAGTGT TGTGGGCATT TGTTTGCGCC GCACGCCACA GCTCATCGTG TGGATGCTGG CAATTCTCAA GGCTGGTGCG ACCTATCTGC CGCTTGATCC GGCCTATCCA ACCGCGCGGT TGCAATTTAT GCTGGCCGAT GCCAAGGCCT TGCTGGTCGT CAGCGAAACG TCATGCCAAG CAGCTTTACC CCTGAATACC ATTGAGTGGG TGTTGATTGA TCAGCCTTGG TCGAGGGAGT TGGCATGGCG CGAACCCTTC TATCATAGCG CAATCCCTGC TTATATCATC TATACCTCTG GCTCAACCGG CCAACCCAAA GGTGTGCTGA TTAGCCATGC CAATGCGCTC ACCTTTTTAG CATGGGCTGA AACCACGTTT AGCGTAGCCG AACGCGCTGG AATTTTAGCG GCAACCTCGA TCAACTTTGA TCTTTCGATC TTCGAGATTT TTCTGCCATT GATTAGTGGC GGTACGTTGG TGCTGGTGGA AAATCTGCTT GATCCAGCGC TGTTTCACTC GCAGCATCCG ATTTGTTTGA TCAATAGCGT GCCGTCGGCG GTGCAAACGT TGTTGCAACA TACGGCACTT CCATCTAGCG TGCTCACGGT GAATCTTGCC GGCGAGCCGC TGAGCTTGCG ATTAGCGCAG CAACTCTATC AACAGCCAAA CATCCAGCGC GTATTCAATT TATATGGGCC AACCGAGGCC ACGACCTATG CCACCTACCA GCTCGTTGAG CGCACTGCCA GCCGGCCGCC AGCGATTGGT CAGCCGCTCA CTGGCACGAC CTGCGTTATC CTCGATGCGC ACTATCACCC TGTTGCCGCT AAGGATGTTG GCGAATTGTT TATTGCTGGG CTGGGCGTAG CGCAAGGCTA TTTGCAACGC CCCGATTTAA CTGCCGAACG TTTTTTGCCC AATCCGTGGG CTACCACGCC TGGCGAACGA ATGTATAAAA CTGGCGATTT GGCTCATTGG AACGCGGCAA ACGAGCTTTG TTACCTAGGA CGTAACGATC AGCAGGTCAA AATTCGTGGC TTTCGGATCG AGCTTGGTGA GATTGAGGCC CAGATTCTGC GCTTAGCACC ATTGCAAGCG GTTGTGGTTC AGCCAATTAC GCTGGTGGCT GATGATCCGC AGTTGACCGC CTATCTGGTT GCTAATCAGC CGATCGATTG CGAAGCCTTA CGCGCTAGCT TAGCCCACCA TGTGCCAAGC TATATGCTAC CAAGTTTTTG GGTACAGCTG GCCGAACTGC CATTAACACC CAATGGCAAG CTTGATCGAG CGGCCTTGCC ATGCCCTGAT GCCCCAATTA AACAACCATT GCAAAGCTCA ACTGAACAGC GTTTGGCGAT AATCTGGCGC GAAATCTTGG GCGTGGAACA GCTTGGGCGT GAGAGTAATT TTTTGCAGCT TGGTGGTCAT TCGCTCAATG TGATGCAAGT GCTCAAACGA ATTGAGCAGA CGTGGCAGCT TCAGCTTTCG ATTACGCGCT TGTTTGAGCA ACCAACCTTG GCAGCTTGGG CGCGGTTAAT CGATCAGCAG CAGCAAGCCT TTGCTCAGGC TGAACCTCAA TTCTATCAGC GCACTACGCA ATTGCATCAG CTTTCGTTTG GCCAACAACG CCTGTGGTTT GCCGAGCAAT TACACCCAAA CACCGCCTAC AACGTCATCC ACGCATGGCG CATCGATGCT CTGCTTGATG CGGTTGCGCT TGAACAAAGC TGGCTAAGGC TGATCGAACG TCATGAAATG TTACGCAGCA GCATTCAGCT CATCGCGGGT ATTCCACAAC AAACGATCAT GCTCAAGCCA GTTTGGCAGC TCCAATCTGC GCCGCAGGCA AGCTTAGAAT ACTTGTTAAG GTTGCTTGAT CGGCCATTCG ATTTGGCGCA AGCCCCATTG TTACGGGTTG GCTTGGCGCA ACACCACGAT CACGCCATCA TGCTGGTAGT TATCCACCAT AGCATTATTG ATGCTTGGTC GTTGGGCGTG CTGTGGGCTG AATTAAGTCA GCTGTATGCA AGCTTCTTCG AAAACCAACC AATTCAGCTG CCAAGCCAAG CCTACGATTA CCTTGATTTT GTGGCTTGGC AGCGCCAGCA GCTTGATTCG GCGTGCTTAG CCCAATTGCA AACCTACTGG CAAACCCAGC TTGCCCAGCT TGATCCGCTC CCAGCTTTGG CGACCGATTA CCCCCGTTCG ACGCACATGC AGGGCTTGGG CATCAGCCAA ACCTATCAAC TTGATCAGCA GGTTATCCAA GCATTACAAG GCTTGGCCAA CGCCAATAAC GCTAGTTTGT TTATGCTGTT ACTGGCGGGA TGGGCGAGCG TGCTCTATCA ACGCACTCAG CGCAGCGATC TGCTGATTGG CACGCTCAGC GCTGGCCGTG AGCATGCAGC GTTTGAGCGT TGCGTTGGCT TTTTTATTAA TATCTTGCCC TTGCGCCTGC ATTGCGCCGC CGAGCAAACG TGGCTTGATC TATTGCAGCA AACCCGCATG GTTGCCTTAC AAGCATACCA ACACCAAGCC TTGCCATTCG AGCAGATTGT GGCCAACGTG GCTCATGAGC GCAACAACCA ACCGCAGATT CCGCTAATTC AAAGTTTGTT GGTGTTGCAA AACGCGCCCA GTCAGCCCTT AGTTTTAGGT GCGCCAGCCC AAGCCCTAGC TACGCCAATT CAGGCCAGCA AAACCGATTT GGTGCTGTTG GTGCAACCCG CTGCAACTGG CTATCAACTG ACCCTAGAAT ATGCGAGCGA ATTGTTCGTC GCTGAATCGA TTGCAGCGTT GGCCTCCGAT TTCCAAGCGG TTTTAGGCCA AATGGCGCAG CATCCTACCA GCACACTTAG CGCTGTTCAG TTGGCTGGGC ATTGGACGGC GGAGCACTAT TCCAACAAAT TACCCACGCT TCAGCCAATG GCCGCCCCGC CGCAAACAGC GCTAGAGCAA ACCCTTGCCG ATATGTGGCA AGAGGTCTTG GGCTTGTCAA TTGATAATAT TCATGCCGAT TTCTTCCGCA TGGGCGGCCA TTCACTCAAC GCCACCCAAG TTGTCTCGCG CATGCAACAG CTTTTACAGG TAACTACAAG TATTCGAATG TTGTTCGATT ATCCAACGAT TGCCCAATTA AGCCAGCATT TGCTGGCGAA TCAAGCTCAG GCAGAGCGAA TCAATAAAAT TGCCACCGCA CTGCAACAGA TCAAAACCAT GAGTGCCAGC ACCAAACAGG CCTTGCAACA AAAGGCCGCA GGAAGGATAA GCCAACCATG A
|
Protein sequence | MSYSNITQLV TAQANQTPAA WAVQTPTGYG LTFADLEQQS SQAAAYLQHL GVQPASVVGI CLRRTPQLIV WMLAILKAGA TYLPLDPAYP TARLQFMLAD AKALLVVSET SCQAALPLNT IEWVLIDQPW SRELAWREPF YHSAIPAYII YTSGSTGQPK GVLISHANAL TFLAWAETTF SVAERAGILA ATSINFDLSI FEIFLPLISG GTLVLVENLL DPALFHSQHP ICLINSVPSA VQTLLQHTAL PSSVLTVNLA GEPLSLRLAQ QLYQQPNIQR VFNLYGPTEA TTYATYQLVE RTASRPPAIG QPLTGTTCVI LDAHYHPVAA KDVGELFIAG LGVAQGYLQR PDLTAERFLP NPWATTPGER MYKTGDLAHW NAANELCYLG RNDQQVKIRG FRIELGEIEA QILRLAPLQA VVVQPITLVA DDPQLTAYLV ANQPIDCEAL RASLAHHVPS YMLPSFWVQL AELPLTPNGK LDRAALPCPD APIKQPLQSS TEQRLAIIWR EILGVEQLGR ESNFLQLGGH SLNVMQVLKR IEQTWQLQLS ITRLFEQPTL AAWARLIDQQ QQAFAQAEPQ FYQRTTQLHQ LSFGQQRLWF AEQLHPNTAY NVIHAWRIDA LLDAVALEQS WLRLIERHEM LRSSIQLIAG IPQQTIMLKP VWQLQSAPQA SLEYLLRLLD RPFDLAQAPL LRVGLAQHHD HAIMLVVIHH SIIDAWSLGV LWAELSQLYA SFFENQPIQL PSQAYDYLDF VAWQRQQLDS ACLAQLQTYW QTQLAQLDPL PALATDYPRS THMQGLGISQ TYQLDQQVIQ ALQGLANANN ASLFMLLLAG WASVLYQRTQ RSDLLIGTLS AGREHAAFER CVGFFINILP LRLHCAAEQT WLDLLQQTRM VALQAYQHQA LPFEQIVANV AHERNNQPQI PLIQSLLVLQ NAPSQPLVLG APAQALATPI QASKTDLVLL VQPAATGYQL TLEYASELFV AESIAALASD FQAVLGQMAQ HPTSTLSAVQ LAGHWTAEHY SNKLPTLQPM AAPPQTALEQ TLADMWQEVL GLSIDNIHAD FFRMGGHSLN ATQVVSRMQQ LLQVTTSIRM LFDYPTIAQL SQHLLANQAQ AERINKIATA LQQIKTMSAS TKQALQQKAA GRISQP
|
| |