Gene Haur_2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2091 
Symbol 
ID5733979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2609700 
End bp2613137 
Gene Length3438 bp 
Protein Length1145 aa 
Translation table11 
GC content50% 
IMG OID641279232 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544859 
Protein GI159898612 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCAAT TGTTGACCCA ATTACGGGCA GCCGATATTC GGATTTGGCT TGAAGGCGAG 
CGCTTATGCT ACGATGCCCC ACCAGGCGTA CTAAGTGAGC AACTATTAAG CCAATTACGC
CAGCATAAGG CCGAACTATG CCAATTTTTG GCGCGAACTC AGCCCAATCA ATTTGCGCCG
ATTCCAAGGC TGGAAGATGA TCAACCTGTG GCGCTAAGCT TTGCCCAGCA ACGGTTGTGG
TTTTTGCATG AATTTGCCCC CACCAGCACG GCTTATCATC TTCCGGCGGC AATTGAACTC
AATGGCGAGT TGGATCGTAC AGCCTTGCAC CAGAGTTTCG AGCAACTGAT CGAACGCCAT
ACAATCTTAC GGAGCCATGT GCTGTGGCAG GATGGCCAGC CATTGCAACA GGTCGCCGCC
AATTGGCAGT TACCTTGGGC TTTTTATGAT CTGCGTTCAG CCAATGATCC TGCGGCTGAA
TGCTATCAGC AGTTGTTGGC AGCCTTTGAA GCACCCTTCG ATCTCGCGAT TGCGCCATCG
CTGCGTTGCG TTTTAGTCTG CTTAGCAGAA AAACAGCATA TTCTTTTGGT CACACAGCAC
CATTGGATTA GCGATGGCTG GTCGATTGGA ATTATGATCA ACGAGCTAGC CCACATCTAT
CGAGCAACAT TGAACCAACA ACCGCATCAG CTACCAACCT TACCGGTGCA ATATCGTGAT
TTGGCGGTCT GGCAGCGCCA ACAATTGCAG GGAGCAAAGT TAACGCAATT GTTGACGTAC
TGGCGACAAC AATTGGCCGA TTTGCCCAAT TTGGCGTTGC CTTACGAAGC CCAAAAGCCC
GCCCAAACGA TGTCAACCAG CATTCCAATC CAACTTGATC GCGCCTTGGT CGAACGCTTA
ACCAACCTAA ACCAAACTCA TGGCACGACC ATGTTTATGA GCTTGTTGGC GGGCTTTTTT
GCGCTGTTAA GCCGTTATAC GCAGCAACAC GATTTTGCGG TTGGTTCGCC GATTGCTGGG
CGTTTGCAGC CTGAAGCTGA GCCTTTAATT GGTTGTTTTA TCAATAGTTT AGTGCTACGA
GCTGATTTAA GTGGCCAGCC CAGCTTTCGT CAATTGTTGG AGCGGGTGCG CCAAACCAGC
TTGGCGGCCT ATGCACACCA AGAATTGCCG TTTGAATTGC TAGTTGAGGC CTTGCAGCCT
GAGCGCAAAC TCGATCAACA ACCCCTATTT CAAGCGCTAT TTGCTCTGCA AAATATGCCA
ACTGGCCAGC TTGAAGCCCC AAATCTGGTA ATCAAACCCT ATGCATTTGC CCAACAAGCG
CCTCAATTTC CGCTTAGTCT GATCTTGTTT GAGCAAGCGG GTCAGATTAC GGGCGAGCTG
AATTATGATC CACAACAGCT ATCGCAGCAG TTTGCCAGCC AATTTGCCGC GCACTATGCC
CAATTCTTGA CGCAGTTGCT GGTAGAACCA GATACGGCAA TCGCTCAGCT GAAATTATTG
CAGCAACCTG AACAGGATAA GGTGCTCAAT CTACTAAATT CAAATCGGCA AGTTTACCCG
ATCAACAATA GTTTAGTTGC ACGTTTTACC CAACAAGCGC TGGCAACGCC CCAAGCAATT
GCCTTGAGTT ATGCCGAGCA AGCGATCAGC TACCAACAAT TGGCCGAAGC TGCCGATCAA
TTGGTCTATG TGCTTTTGGC CCAAGGTGTA CAACCTGAAC AACCAATCGG CTTGTTGTGT
GAGCGTTCGC CGCAATTAAT TATTGGCATT TTGGGAATTC TCAAGGCTGG CGCAGCCTAT
CTGCCGCTCG ATCCGCAGTT GCCTACCAGC CGAATCGAAT GGATGCTGGC CGATGCCCAA
GTTAATTTAA TTGTTACTCA AAACAGTCTG TTGCACAGTG TTAATTCACA AGCAACCACC
ATTCTCAACC TTGATCAACT TCCAACCACC AAACTCACTC AATTACCAAC TATCCATCCC
GATCAACTTG CCTATATCAT TTATACCTCT GGCTCAACTG GGCAACCCAA AGGCACGTTG
TTGAGCCATG CCAACGTGTT GCGCTTATTC GAGGCGACAG TTGCTACGAT TAAACCTAGT
GCCAATGATG TTTGGAGCTT ATTCCATTCA TATGCATTCG ATTTTTCGGT TTGGGAAATT
TGGGGGGCAT TGCTGTATGG TGGACGTTTG GTGGTTGTGC CAAGTACCAC AACTCGCTCA
CCCGAAGCGT TCAGCCAATT GTTGGCAGAC GAATCGATAA CTGTGCTCAA TCAAACACCC
TCGGCGTTTC GTCAACTATT ACCCCAACTC ACGCCAGCGG TGGCCGCGAA TTTGGCGCTG
CGCTTGATTA TCTTTGGTGG CGAGGCGCTT GATCTGGCCA GCCTCGCCGC TTGGTATCAA
GCCTATCCTG CGCCCGCCCC GCAACTGCTG AATATGTATG GCATCACCGA AACTACGGTG
CATGTGACTG AGCGTTGGTT GGAGCTTAAT GACTTGATCG AGGCAAAAGC CAGCCTAATC
GGCTTGCCAA TCGCCGACTT AACCATGTAT CTGCTCGATC AGTACGGCCA ACTTGTGCCT
CAAGGTGCAG TCGGCGAGAT CTATGTGGGC GGGGCAGGTT TGGCGCGGGG CTATCTCAAG
CAAGCGGCGT TGACCGCCCA ACGCTTTGTG CCTGATCCAT GGTCAAGCAC TGGGGCACGG
CTCTACCGCA GCGGCGATTT GGCTCGGATT AATCAATTTG GCGAGCTAGA ATACCTTGGG
CGTAGTGATC AGCAAGTCAA ATTGCGCGGC TTTCGGATTG AGCTAGGCGA ACTTGAACAA
GCGATTTGCC GCCAAGCAGG AGTTGCTGAT TGCTGGGCGT TTGTGCAAAA ACTTGATCAA
CACGAGCGCC TAGTAGTTTG GGTTGTGCCA AATCAGCCAG CGCTTAGCGT TGAACAATTG
CGCCAAGCGC TGGCTCTCGA GCTGCCCCAT TATCTGCAAC CCAATCTCTG GCTGCTGTGC
GAACACTTGC CGCTTACCAA CAACGGCAAA CGCGATTATG CTCATTTATT AGCGCAACTT
GACGTAACGC TTGAGCAAGC ATCAAGTATT GCCCCCAATA ATCCAATTCA AGCGCTGATT
GCAGCAATTT GGCTTGAGAT TTTGCAGCAA CCAATTGCCA GCATTGATCA AAACTTTTTC
GAGGTTGGCG GTAATTCGTT GAATGCGGTT TTGGTGCTAA CCCGCATTCG CGAATTATTA
CGGGTCAATC TGAGTTTGCG CAGCCTATTT GCCCAACCAA CCATTCGCGG TGTTGAACAA
GCCTTAGTAC AGCAAGAACC CAAGCCTGGC CAAACTGCCA AAATTGCCGA GTTGGTGCAA
AAACTCCAGC AAGCTACACC TGAGCAGCGC CAACAAGCCT TAGCATCCAA GCGCCAAGAA
CGGACGGTCG GCGAATGA
 
Protein sequence
MWQLLTQLRA ADIRIWLEGE RLCYDAPPGV LSEQLLSQLR QHKAELCQFL ARTQPNQFAP 
IPRLEDDQPV ALSFAQQRLW FLHEFAPTST AYHLPAAIEL NGELDRTALH QSFEQLIERH
TILRSHVLWQ DGQPLQQVAA NWQLPWAFYD LRSANDPAAE CYQQLLAAFE APFDLAIAPS
LRCVLVCLAE KQHILLVTQH HWISDGWSIG IMINELAHIY RATLNQQPHQ LPTLPVQYRD
LAVWQRQQLQ GAKLTQLLTY WRQQLADLPN LALPYEAQKP AQTMSTSIPI QLDRALVERL
TNLNQTHGTT MFMSLLAGFF ALLSRYTQQH DFAVGSPIAG RLQPEAEPLI GCFINSLVLR
ADLSGQPSFR QLLERVRQTS LAAYAHQELP FELLVEALQP ERKLDQQPLF QALFALQNMP
TGQLEAPNLV IKPYAFAQQA PQFPLSLILF EQAGQITGEL NYDPQQLSQQ FASQFAAHYA
QFLTQLLVEP DTAIAQLKLL QQPEQDKVLN LLNSNRQVYP INNSLVARFT QQALATPQAI
ALSYAEQAIS YQQLAEAADQ LVYVLLAQGV QPEQPIGLLC ERSPQLIIGI LGILKAGAAY
LPLDPQLPTS RIEWMLADAQ VNLIVTQNSL LHSVNSQATT ILNLDQLPTT KLTQLPTIHP
DQLAYIIYTS GSTGQPKGTL LSHANVLRLF EATVATIKPS ANDVWSLFHS YAFDFSVWEI
WGALLYGGRL VVVPSTTTRS PEAFSQLLAD ESITVLNQTP SAFRQLLPQL TPAVAANLAL
RLIIFGGEAL DLASLAAWYQ AYPAPAPQLL NMYGITETTV HVTERWLELN DLIEAKASLI
GLPIADLTMY LLDQYGQLVP QGAVGEIYVG GAGLARGYLK QAALTAQRFV PDPWSSTGAR
LYRSGDLARI NQFGELEYLG RSDQQVKLRG FRIELGELEQ AICRQAGVAD CWAFVQKLDQ
HERLVVWVVP NQPALSVEQL RQALALELPH YLQPNLWLLC EHLPLTNNGK RDYAHLLAQL
DVTLEQASSI APNNPIQALI AAIWLEILQQ PIASIDQNFF EVGGNSLNAV LVLTRIRELL
RVNLSLRSLF AQPTIRGVEQ ALVQQEPKPG QTAKIAELVQ KLQQATPEQR QQALASKRQE
RTVGE