Gene Haur_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1805 
Symbol 
ID5733707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2094746 
End bp2098987 
Gene Length4242 bp 
Protein Length1413 aa 
Translation table11 
GC content55% 
IMG OID641278948 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544576 
Protein GI159898329 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR01746] thioester reductase domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.323791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAGC TGGCGCTATC GGCGGCACAA CATGGCATTT GGCTGGGTCA ACAGCTCGAT 
CCGAGTAGTC CGCTGTATAA CACAGCCGAA TATGTGGCTC TGCGCGGTGC GGTTGAGCTT
ACCAATTTGA CCGCTGCGAT TAAGCAGGCC TTTGCTGAAG CCGCAACCCT GCATTTGCGT
TTTGGGCTTG AGCATGATCA GCCGTATGCG CTGGTTGAGC CACAGCCAAT TAACTTGACC
GTGCATGATT TGCGTGATTT ACCCGATGCT GAAGTACGAG CCATAGCTTG GATGCAGCAC
GATTTGGGCA ATGTGGTCGA TCTAGCAACC GGCCCGTTGT TCAACACGGC GATTTTGCAA
CTTGCTGATG ATCAGGTGTG GTGGTATTTG CGAGCGCATC ACATTGCCTT AGATGGCTAT
AGTTTTGCTT TGCTCACCAA GCGCGTCGCC GAAATTTACT CGGCATTGCA AACCAAGGCC
ACGCTCAGCC CAAGCTTTGG CGAATTGGTC CCAGTAATTG CTGAAGATCA CGCCTATCAA
GTCTCAATTC AGGCCACGCT TGATCGCGAA TTTTGGGTTA ATCGCTTTGC AGATAATCCG
CAAGTGGTCA GCCTGACTCA GCAAACTGCC CTATCGCAGC CGCGCAGCAT TCGCTTGAGC
ACGGCTTTAG CGAACGACTT GATCGAGCGA TTGACTGCAA TCGCCAAGCC CAGCCGTAGC
ACATGGCCCG ATGCCTTGAT GGCGGTGGTG GCAGCCTATC TCGCCCGCTG GAACAACAGC
GAGAGCGTCG TTTTGGGCAT GCCCTTGATG AGCCGTTTGG GTTCGGTGGC GTTGCGTGTG
CCGTGTATGG CCATGAATAT TGTGCCGCTG TGTCTTAACG TTGCGGCTGA GCATGATTTG
GCCCAATTAA CTGCGGTAGT GGCAGCCGAA CGCAATGCCT TCCGCAAGCA TGGCCGCTAT
CGCTATGAGC AGTTGCGCCG CGATTTGGGC TTTGTTGGCG CTGGGCGGCG CTTGTTTGGG
CCTGTCGTCA ATATTATGCC CTTCGATCAT CCGCTGAATT TTGGTGATTG CCAAGCCCAG
AGCACTACAC TCACCGCTGG CCCAGTCGAA GATTTGGCCT TCAACGTGAT TTTGCGCGGC
AACCAACTCT ATCTGACGCT TGAGGCCAAT CCGGCTTGCT ACAGCCAAGC AGCGCTTGAA
TATCATTTTG CGGCAATTCA ACACCTATTA AATGGATGGC TGGCAAATCC AAGCATACCT
GTGGCTGAGC AGCAGGTTTT GCCAGCGCCG CTTGTGCTTG ATGGCGGCGA GTTGCGCTTG
CCGCTGACCA GCGTGATCGA GCGAATTTTA CATAATGCCA GGCAACAGCC GCACGCTTTG
GCTTTGGTTA CCGACACTGA GCAACTGAGC TATGCCGAGT TGGCGAGCCA CGTCCATGCG
TGGGCAGGCC AATTGGTGCA GCGCGGGGTA ACTGCTGGCA GCGTGGTTGG CGTGGCTTTG
CCGCGTAGCC GCGAGGCAAT TGTCGCAATT TTGGCGACGC TTTGTTGTGG GGCAGCCTAT
CTGCCACTTG ACCCGCAATG GCCGCAAAGC CGCTTGGCGA GTGTCGTGGC GCAAGCCCAA
CCAGTGCTAG TTTTGGCACA GCAAGCTTTT GATCTGCCCA ATTTGTTGTT GGTCGAGCAG
TTGAGCAAGG CCAATGCATG GTTCGAGGCA CGGGTCGATT TGGCCCAACC AGCCTACATC
ATGTATACCT CTGGCTCGAC TGGCGAGCCA AAAGGTGTGG TGATTAGCCA TCAAGCCTTG
GCGGGTTTTG TGCAGGCTGC GGCTGAGCGT TACGCAATCA GCGCCGCTGA TCGGGTGCTG
CAATTTGCCC CATTAGCTTT TGATGCTAGC GTTGAAGAAA TTTTTGTGAC GCTTTGCCAA
GGCGCGACCT TGGTGTTGCG CAACGATGCC ATGCTCGAAT CGTTACAGCG CTTTGTGGCC
GCCTGCCAAG CGCATGCGAT TAGTGTGCTC GATTTGCCAA CCGCCTTTTG GCATGAATTA
GCCGATAGTG TGGCCCAAGG CGCGGTGCAG TTGCCCGAAT GTTTGCGGGT GGTAATTATC
GGGGGCGAGG CGGCTCTGCC AGAGCGGGTT CAAGGCTGGT TGAACGTGGT TGCGCCGAAT
GTGCGTTTGT TCAACACCTA TGGCCCAACC GAGGCGACCG TGGTGGCGAC CGTGGCCGAA
TTGAGCGACC CCAACCAGCC AATTACGATT GGCCGACCAT TGGCTGGGGT GCAAGCAGCC
ATTTTGGGCA GCGACCAGCG GCCAATTTTT GCAGGCGATG TTGGCGATTT ATATCTGCTG
GGCAATGGCT TAGCAACTGG CTACTATCAA CGCCCCGATT TGGATGCGCT GAATTTTGGC
CAACTTAGCC AATTGCCGCA TGCGCCCCGC GCCTATCGCA CTGGCGATCG AGTGCGTTTG
TTCGCAGGTC AGTTACAGTT TGTGGGTCGC AGCGACGACG AATTCAAAAT TAGCGGCCAG
CGTGTTACGC CTGCCGAAAT TGAATCGGTC TTTTTGCGGC ATACAGCGGT GCGCGAAGTA
GCGGTGATTG GCCAGCAGCT TGGCAATGCG AGCAAGCGCT TGTTTGCAGC AGTCGTTGTC
AGCGATGCTA GTTTGAGCGT GGCTGAATTG CGCAATCACG CCAGCCAACA TCTGCCAGCG
GCGGTCATTC CGGCGGCGAT CACGATTGTT GAACGCTTGC CGCGCAGCAG TGCAGGCAAG
ATCGATCGCA AGGCTGTGGC GGCCTTAGCG CCAGCACCAG TGATGGTGAA TGCTGCGATC
AACGATACGC CAGCATTAAT TCGTCAAGTT TGGGCCGAAG TTTTGGGCCA AACTGAATTC
AACGATGAAG CCGATTTCTT TGGCTTGGGC GGTCAATCGC TGCAAACCAT TCAGGTTGCC
AATCGTTTGG GTATGGCCTT GGGTCGCGAA GTAACCGCCG CCTTGATCTT CCGCTATCCC
ACGATTGCGG GCTTGAGCCA AGCGCTCGAC CCTGAATTTG AGCAGGCTCC TGAGGCAGCG
CCGCAATTTT TGAGCGATGC CAATTTGCCT GAGCAGATTG TGCCCAAACA ACTGAATGCC
CAGCCACGGC CAATCCAAAC CGTGCTGTTG ACTGGGGCAA CTGGCTTTGT CGGGGCACAT
CTGTTGGCCG AATTGCTTAG CACAACCACC ACCAACGTGA TTTGTGTGGT GCGAGCTGGC
TCGAATGCGG CAGCCTTTGA GCGGTTGCAA GCAAGTTTGC AACACTACGA ATTGCCAAGC
GAGCAGCTTG CCGAGCAGGT TGAAGCTTGG CAGGGCGATT TGGCTCAGCC CCAATTTGGG
CTTGACGATC AGCAATGGCA AAGCTTGATC GAACGTTGCG ATCTGATTTA TCACAATGCG
GCGATGGTCA GCGTGGTTCG CGAGTATAGC AGCTTGCGGG CGGTCAACGT CAACGCCACC
AGCGAAATTT TGCGTTTGGC AGCGGTGCAT TGCACCCCAG TGCATTACGT TTCGACCTTG
GCAGTTTCAC CACCGCAAAG CGTGATGCAC CGCGTGCCCG AAGATTTTGT GGCGGCGCAT
GCTGGCCTAC GCGATGGCTA TAGCCAAAGC AAATGGGTTG CCGAACGCTT GCTCGAACAA
GCGGCTACCC GTGGTTTGCC GGTTGCTGTT TATCGTTTGG GGCGGGTAGT TGGCCCAAAT
CAAAGCAATT TCGTCAATCA AGATGATTTA TTTTGGCGGA TTGTCCAAGC AGGTGTGCCG
CGTGGCTTAT TGCCCAGCCT GCCTGTCGAG GAAATCTGGA ATCCAGTTGA TTTTGCTGCA
CAGACAATCG TGCAATTTAG CCATAATCAT CGCGGCGTGC GCGTGTATAA CCTTGCTCCC
AACGAACCAA TCAGCTTTGC CCAACTTTTG GGCTGGGTTG GCGAGTATGG CTATGCCGTG
CAATTGTGCA GGGTTGAGCA ATGGTATCAA GCGTTGCGTA ACGCCGACGA TGCGATGAGT
CAGGCGACTC TGACCTTCTT TGAGCGCCAG GCTGATGGTG GGGAACTGCC CAGCGCAATT
GGTACGATTG AAAACAAACG CTTGCTGCAA ACGCTTGCAG CGCATGGCAT TGCTGTGCCT
GTGATCGATC GCGAGCGCTT CTTTGGCTAT CTTGAGCGGT GTATTCGAAC GGGTTTATTG
CCCGCACCCG ATTTACGCCA GACTAGTATT GGTATTCGCT AA
 
Protein sequence
MAKLALSAAQ HGIWLGQQLD PSSPLYNTAE YVALRGAVEL TNLTAAIKQA FAEAATLHLR 
FGLEHDQPYA LVEPQPINLT VHDLRDLPDA EVRAIAWMQH DLGNVVDLAT GPLFNTAILQ
LADDQVWWYL RAHHIALDGY SFALLTKRVA EIYSALQTKA TLSPSFGELV PVIAEDHAYQ
VSIQATLDRE FWVNRFADNP QVVSLTQQTA LSQPRSIRLS TALANDLIER LTAIAKPSRS
TWPDALMAVV AAYLARWNNS ESVVLGMPLM SRLGSVALRV PCMAMNIVPL CLNVAAEHDL
AQLTAVVAAE RNAFRKHGRY RYEQLRRDLG FVGAGRRLFG PVVNIMPFDH PLNFGDCQAQ
STTLTAGPVE DLAFNVILRG NQLYLTLEAN PACYSQAALE YHFAAIQHLL NGWLANPSIP
VAEQQVLPAP LVLDGGELRL PLTSVIERIL HNARQQPHAL ALVTDTEQLS YAELASHVHA
WAGQLVQRGV TAGSVVGVAL PRSREAIVAI LATLCCGAAY LPLDPQWPQS RLASVVAQAQ
PVLVLAQQAF DLPNLLLVEQ LSKANAWFEA RVDLAQPAYI MYTSGSTGEP KGVVISHQAL
AGFVQAAAER YAISAADRVL QFAPLAFDAS VEEIFVTLCQ GATLVLRNDA MLESLQRFVA
ACQAHAISVL DLPTAFWHEL ADSVAQGAVQ LPECLRVVII GGEAALPERV QGWLNVVAPN
VRLFNTYGPT EATVVATVAE LSDPNQPITI GRPLAGVQAA ILGSDQRPIF AGDVGDLYLL
GNGLATGYYQ RPDLDALNFG QLSQLPHAPR AYRTGDRVRL FAGQLQFVGR SDDEFKISGQ
RVTPAEIESV FLRHTAVREV AVIGQQLGNA SKRLFAAVVV SDASLSVAEL RNHASQHLPA
AVIPAAITIV ERLPRSSAGK IDRKAVAALA PAPVMVNAAI NDTPALIRQV WAEVLGQTEF
NDEADFFGLG GQSLQTIQVA NRLGMALGRE VTAALIFRYP TIAGLSQALD PEFEQAPEAA
PQFLSDANLP EQIVPKQLNA QPRPIQTVLL TGATGFVGAH LLAELLSTTT TNVICVVRAG
SNAAAFERLQ ASLQHYELPS EQLAEQVEAW QGDLAQPQFG LDDQQWQSLI ERCDLIYHNA
AMVSVVREYS SLRAVNVNAT SEILRLAAVH CTPVHYVSTL AVSPPQSVMH RVPEDFVAAH
AGLRDGYSQS KWVAERLLEQ AATRGLPVAV YRLGRVVGPN QSNFVNQDDL FWRIVQAGVP
RGLLPSLPVE EIWNPVDFAA QTIVQFSHNH RGVRVYNLAP NEPISFAQLL GWVGEYGYAV
QLCRVEQWYQ ALRNADDAMS QATLTFFERQ ADGGELPSAI GTIENKRLLQ TLAAHGIAVP
VIDRERFFGY LERCIRTGLL PAPDLRQTSI GIR