Gene Haur_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2089 
Symbol 
ID5733977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2602175 
End bp2606266 
Gene Length4092 bp 
Protein Length1363 aa 
Translation table11 
GC content52% 
IMG OID641279230 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544857 
Protein GI159898610 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAAA CCTTGCCCCA AGGCTTTTCG ACCGATGACC TTGAGTTGCT GGCCTATTTG 
CTTGAAGAAG CAGGCATTGA TCATGCCGTG CCAAACCAAA TTCAGCCTCG TCCAGCCAAC
CAGCCAGTTC CACTGTCGTT TGCCCAAGAA CGCTTGTGGT TTATCGACCA GCTAGAACCT
GGCAATCCAG CCTATAATAT TTTGTTTGCG GTGCAGATTG ACGGCCCGTT GCATGTCGCA
TACTTGCAGC AGAGCTTTGA TGCGGTGATT GCCCGCCACG AGAGCCTGCG TACCAGCTTT
CCAGTGCTCA ACGATCAGCC GATTCAGGCG ATTGCTGAAA AACATGATTT TGACTTAACA
ATCGTCGATC TCCGCTATTT AGCAGCAATT GAACAAGCAT CAACAATTGA ACAACTATCA
ATTGAACAAC GCTCAATTAT CGAACAGCAA TTACTGATCG ATAGTGCCCA TCGTTTTGAT
TTGGCGCAAG GCCCATTGCT GTATGGTCGT TTGCTCTGGC TGGCCGAGCA GCAATATGTG
CTGATTCTCA ACTTGCACCA TGCGATTTTT GATGGCTGGT CGTTGGCAAT TTTTATTGAG
GAATTGCGCC ATTGCTATAG CGCCTTGCTT GCAGGTCAAG CGCTCGATTT AACCCCAGCA
GCGTTGCAAT ATGCCGATTT TAGTTATTGG CAGCGTGAAT ATTTGCAAGG CGAGGTTTTG
GCCGCGCAAT TAGCCTTTTG GCAAAACCAA TTTGCTGGAC GTTTGCCCAC CTTGGCCTTA
CCAACCGATC GACCACGCCC AAAACACGAG ACTGGTCGTG GCGCAGCCTT GCCATTTCGT
GTTGATCAGG TATTAACGGA GCAGTTGCAA CACCTCGCCC AAAACGAACA TGCCACGATG
TTTATGCTGT TATTGGCGGC ATTTCAGCTG ATGCTGGCTC GCTATAGCCA ACAGCAAGAG
TTTGTGGTTG GCTCGCCAAT TGCCAATCGT GATCGGGTTG AAATTGAGCA TTTGATCGGC
TTTTTCGTCA ATATGCTGCT GCTGCGTTGT GATGTTCAGC CGCAGCTGAG CTTTCGCGAA
TTTCTGGCGC AAGTGCGCGA AACGACGCTC GAAGCCTATG CCCACCAAGA TTTGCCCTTT
GAGCAGTTGG TCGAGGTGCT TCAGCCCGAC CGTGGCGCAG GCTATGGTTC GTTGTTTCAG
GTGATGTTTG TGCTGCAAAA TACGCCTAAG GTCAACTATG AGATCGCCGA TTTGCAGCTG
AGCTTTCTCG ATACCGAGGC GCATAGTACC AAATATGACC TGACCATGAC CTTGACTGAA
ACCGCAACTG GGCTGGAAGG TTGGTTTGAA TACAACACCG ATTTGTACGA TCAGGCCACG
ATTCAGCGCA TGCTTGGGCA TTATCAACAA GTATTACGGG TGGTTGGCGA AGCGCCTGAT
CAAGCACTTA ATGCGATTAG CGTATTTGAT GAGCAAGCCC AAACGGCTTT ATTCAAGCTA
AGCAATCAAA CTCAGCACGA TTTTGGCCCT GCTGATTTCT TAGAACGCTT TGCCAATCAA
GTGGCAGCGA CACCCAACGC GATTGCGGTG CGCGATGCCC ATCAGCAGTA TTCGTATCAG
GCTTTGCAGC AACGAGCTAT GGCTTTGGCG GCCCAACTGC AACAGCATGG CGTTAGGCAA
GAAACCCTCG TGCCAATTTT GTTGCCGCGT ACCAGCGATT TTGTGGTTGC GGTTTTGGGT
GTTTTTTACG CTGGGGCAGC CTATTTACCG CTTGACCCTG CGTGGCCAGC CCAGCGTAGC
GCCCAGATTT TGCAGGGATT GGCGATTCCT GCCTTGATTT GCGAACCCGA TTTAGCCCGT
TGGTTTGCTA AGCATGTTCA GCCGTTGTTT AGGCTTCACA ATCAGCCGCA GTTAATCGAA
CAATGGAATG ATGCAGCAAC CAAGCTTGTT GTAAGCCAAA CCCATCCGCA GCAATTGGCC
TATACCCTGT TTACCTCTGG CTCAACTGGC ACACCCAAAG GAGTGATGAT CGATCAGGCT
GGGATGCTCA ACCATCTGTT GGTGATGAAT CAGGTGCTGG AAATCCAAGC CCATGATGTG
GTGGCCCAAA CCGCTTCGCA ATGCTTCGAT ATCTCAGTGT GGCAGATGCT GTCGGGCTTG
TTGGTTGGCG CAACGGTGGC AATCATTGAT GATCAGACGA TGCGTGACCC GTTGGCCTTA
GCCCAAACTC TGGCTGAACA GCAAGTCACA ACTTTCGAGC CGGTGCCAAG CCTGTTGCAA
GCCTTGCTTG AAACATTGCA AACCCCTGCT GAACAAGCCT TATTGCACCG CTTGCGCTGG
GTTTTGCCAA CCGGCGAGGC CTTGCAACCA GTTCAGGCCC GTCAATGGTT TGCCACCTAT
CCGCAGATTC CGTTGCTGAA TGCGTATGGG CCTGCCGAAT GTGCCGACGA TGTAACCCTG
CAACGGCTTG ATTCTGCTCC GACCGAAGGC CATAGCACCA TGCCAATCGG CAAGCCTGTC
GCCAATATGC AGGTGTTTGT GCTTGATCCA AACTGGCAAT TGTTGCCATT GGGCGCAGTC
GGCGAATTGT ATATCGGTGG AGTTGGGGTT GGTCGGGGCT ATTTGAATGA TCCAGCCCGC
ACTGCCAGCG CCTTCGTACC CAACCCATTT GCTGATAATG GCAGTCGGCT CTATCGTACT
GGCGATTTGG TGCGCCAAAC TGCTGATGGG GCTTTGCACT TCATTGGTCG CGCTGATCAG
CAAGTTAAAG TGCGCGGCTA TCGGATTGAA CTAGGCGAGA TCGAGGCGGT CTTGGCGGAA
TTGAGCTGGT TGCGCGAGGC GGCGGTGCAC CCTTGGCAGC AACAATTAGT TGCCTATCTG
GTTCCGGTTT CCGATACTCC TGATTTGATC AGCCTTGTGC AGCCTGCGCT CCAACAGCGA
TTGCCCAGCT ATATGCTGCC AAACCAATAT CTGGTTTTAG ATCAATTGCC GCGTAACCGT
AATGGCAAGC TTGATCGTCA ACAACTGCCA GCGCCGAATC CTGCCAACCT TGGCTTTCAA
ACACCGCTGG TTGTGCCACG TACCCAGCAC GAGGCCGAAC TCGCGGCGAT TTGGGCTGAC
GTACTTCAAC TTGACGTAAT CAGTATTGAT GCCAATTTCT TCAGCAGCGG CGGCCACTCG
CTCTTAGCAA CGCGGGTGAT GCTACGCACA CGCCAGCATT ATGGCCGTGA TTTACCATTG
CGCATGATTT TTGAAGCACC AACTATTCGT GAATTTGCTG CCTTATTGGA ACAGCAACAA
GCAGTGTCAG CGCTCCCTAA CCTACTCGTG CCTATTAAAC CCCAAGGCTC ACGCACGCCA
CTGGTGTGTG TCCATGCAAT TGCGGGCACG GTTGGCTGTT ATAGCGAACT AGCAATTGCA
CTTAATCCTG AACAACCCGT GTATGCCTTG CAAGCACCTG GCATTGACGG CGGCACAACC
CATGCCAAGG TTGAAGCAAT TGCCCAAGAC TATTGCCAAG CATTGCGACA ACTGCAACCT
CAGGGGCCAT ATCGTTTGGC TGGTTGGTCA TTTGGAGGTT TGGTTGCGCT TGAAATGGCG
CGACAACTGC AACTTGCTGG CGAGCAGGTA TCTATGCTCA GTTTGATCGA TAGCTTTCTG
GCCGAGCCAA CGCCTGATCC ATTCCCATTG ATTCAGAGCT TCGCCGCCGA TCTGTTTGCT
GATGTTGATC CATTGGCGGC GCAACAGATC GATTGGCCCG CGATTGTAGT GCTGCCTGCT
GAGCAACAAT TGGCGGCGCT CTATCAACAA GCCCAACAGG CTGGCCTAAT TGATCGTGAT
CTGCCGTTCG ATCTGGCACA ACGCTTATAC GCGGTTTTTA CCAGCCATGC CCACGCCATA
CAAGCCTATC AGCCCGCTAT ATATCTTGGG GAAGCTCAAT TGCTGCAAGC TCAAGCCAAC
CCAGCAGCAG CTCGCCGCTG GCAAGCAGTC ATTCCAAATC TGCACATTCA GGTAATCGGC
GGCGATCATA TCAGCATTCT GCGGCAGCCC CATGTGCATG GTTTAGCAAA TGCCATAGAG
CAGAGAACAT AG
 
Protein sequence
MTQTLPQGFS TDDLELLAYL LEEAGIDHAV PNQIQPRPAN QPVPLSFAQE RLWFIDQLEP 
GNPAYNILFA VQIDGPLHVA YLQQSFDAVI ARHESLRTSF PVLNDQPIQA IAEKHDFDLT
IVDLRYLAAI EQASTIEQLS IEQRSIIEQQ LLIDSAHRFD LAQGPLLYGR LLWLAEQQYV
LILNLHHAIF DGWSLAIFIE ELRHCYSALL AGQALDLTPA ALQYADFSYW QREYLQGEVL
AAQLAFWQNQ FAGRLPTLAL PTDRPRPKHE TGRGAALPFR VDQVLTEQLQ HLAQNEHATM
FMLLLAAFQL MLARYSQQQE FVVGSPIANR DRVEIEHLIG FFVNMLLLRC DVQPQLSFRE
FLAQVRETTL EAYAHQDLPF EQLVEVLQPD RGAGYGSLFQ VMFVLQNTPK VNYEIADLQL
SFLDTEAHST KYDLTMTLTE TATGLEGWFE YNTDLYDQAT IQRMLGHYQQ VLRVVGEAPD
QALNAISVFD EQAQTALFKL SNQTQHDFGP ADFLERFANQ VAATPNAIAV RDAHQQYSYQ
ALQQRAMALA AQLQQHGVRQ ETLVPILLPR TSDFVVAVLG VFYAGAAYLP LDPAWPAQRS
AQILQGLAIP ALICEPDLAR WFAKHVQPLF RLHNQPQLIE QWNDAATKLV VSQTHPQQLA
YTLFTSGSTG TPKGVMIDQA GMLNHLLVMN QVLEIQAHDV VAQTASQCFD ISVWQMLSGL
LVGATVAIID DQTMRDPLAL AQTLAEQQVT TFEPVPSLLQ ALLETLQTPA EQALLHRLRW
VLPTGEALQP VQARQWFATY PQIPLLNAYG PAECADDVTL QRLDSAPTEG HSTMPIGKPV
ANMQVFVLDP NWQLLPLGAV GELYIGGVGV GRGYLNDPAR TASAFVPNPF ADNGSRLYRT
GDLVRQTADG ALHFIGRADQ QVKVRGYRIE LGEIEAVLAE LSWLREAAVH PWQQQLVAYL
VPVSDTPDLI SLVQPALQQR LPSYMLPNQY LVLDQLPRNR NGKLDRQQLP APNPANLGFQ
TPLVVPRTQH EAELAAIWAD VLQLDVISID ANFFSSGGHS LLATRVMLRT RQHYGRDLPL
RMIFEAPTIR EFAALLEQQQ AVSALPNLLV PIKPQGSRTP LVCVHAIAGT VGCYSELAIA
LNPEQPVYAL QAPGIDGGTT HAKVEAIAQD YCQALRQLQP QGPYRLAGWS FGGLVALEMA
RQLQLAGEQV SMLSLIDSFL AEPTPDPFPL IQSFAADLFA DVDPLAAQQI DWPAIVVLPA
EQQLAALYQQ AQQAGLIDRD LPFDLAQRLY AVFTSHAHAI QAYQPAIYLG EAQLLQAQAN
PAAARRWQAV IPNLHIQVIG GDHISILRQP HVHGLANAIE QRT