Gene Haur_1875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1875 
Symbol 
ID5733764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2226333 
End bp2232305 
Gene Length5973 bp 
Protein Length1990 aa 
Translation table11 
GC content50% 
IMG OID641279019 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544646 
Protein GI159898399 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAG ATATTGAGGA CATCTACCCA CTTTCGCCGT TACAACAAGG GTTACTCTTT 
CATAGCCTGT ATGACCCAGA TTCGGGAGCC TATTTTGAGC AATTTACCTG TCAATTAAGG
GGAATCCTTC AGCTTGACGC GTTTCGACGG GCTTGGCAAC ACGTGCTTGA GCGCCATGCT
GCTTTGCGCA CAGTCTTCGT ATGGGAAGAT TTGGCAGAAC CGTTACAAGT GGTCTATCGG
GCGGTGCAGT TGCCCTTGGA TTACCACGAT TGGCGCGAAT TAGATCCGCA GACCCATACG
GCCCAGCTTG AAGCCTATTT CCAAACTGAG CGCCAACGTG GGTTTGATCT TAGCCAAGCG
CCGTTATTAC GGGCTAGTTT AATTCAGTTG AGCGATGATT GCTATCAATT TGTTTGGTGT
AATCATCACT TATTGCTCGA TGGCTGGAGC ATGGCGCTGT TATTAAAGGA AGCATTTAGT
TACTACAGCG CTTTCTGCGA GGGTCGGGCA TTGCGTTTAG CCCAAACGCG ACCGTATCGC
GATTATATTG CATGGCTACA AAAGCAAGAT CAAGCCAAGG CCGAGCAATT TTGGCGAGCC
AACTTAAGCC CTATTTCAGC CCCAACGCCG TTGGTGATCG AGCGCCCGAA TTATGCCTTG
CTTGGGCCAG AACAACCATG CGAACAGCGG ATTGTGCTCG ATCTTGCGGC AACGGAAACG
CTACAGATCA TGGCCCGCCA GCATAAACTC ACGATCAATA CCATCTTGCA AGGAGCTTGG
GCCTTATTGC TTGGGCGGTA TAGTGGCGAA CGCACGATCG TTTTTGGGAG TCCAGTTTCC
AGTCGTCCTG CCCAACTGGC TGGTAGCAAT GCGATGGTTG GTTTATTTAT CAATACCATC
CCTGTGTGCA TAACCATCAA ACCTCAGGCA GCGGTCAGTG AATGGCTCCA AGCATTACAA
CAACAGCAGG TCGAGGCCCA ACAATACCAC TATACCGGCT TGAATCAGCT TCAAACGTGG
AGCGCAGTGC CACGTGGTAT ACCACTGTTT GAGAGTATTT TGGTGTTTGA AAATTACCCC
TTGGCCGCAT GGCAACAATC GGGCAATGCA ACCTTGCAAC TGCAAGATGT GCGCTTTATT
GAGCAAACCA ATTATCCACT TTCAATTGAG GCAGTGCTCG CACCAAATCT GGTGATTCAA
GTGTTTTATG ATCAACGGCG CTTTGAGCCA GCGACGATCA CACGCTTGTT GGAGCATCTT
CAAAATCTAT TGCTAAGCTT TGCCACCGCG CCACAAGCTC GTTTAGCCTC AATTGATCTA
TTGACCGCTG CTGAACGACA GGTGATGCTG CGGGACTGGA ATACGACGAA TGTGCCGTTA
CCTAGCTCAA TCTATTTACA TAAGATTGTT GCGGCCCATG CCCAAGCTAC TCCAGATGCA
GTTGCATTGC GATTTGGTCA ACAACACCTG AGTTATGGCG AGTTGAATCG GCGGGCCAAT
CAATTGGCGG CCTATCTTCG GGCGCAGGGA GTTCCGCCTG GTAGCTTGGT TGGGCTGTGT
GTTGAGCGTT CGCTTGAGTT GGTGCTTGGA ATTTTGGCAA TTCTTAAAGC CGGGGCCGCC
TATGTACCGC TTGATCCGCG CTATCCGCTT GAGCGATTGC ACTATATGCT CAACGATAGC
CAAGCTCAGG TTTTGCTGAC TCAGCATTCG CTTAGCCAAC AAATTCGCAC TGAGCAACAA
CGGGTAATCT ATCTTGATCA CGATTGGCCA ACGATTGCTC AATATCCCTC GTTTGAGCTA
GCCGTACCAC TTTGGCCTGA GAGTTTGGTC TATCTGATTT ACACTTCTGG CTCAACTGGG
CGACCTAAGG CAGTGCCAAT TACCCATCGG GGTTTGGCTA ACTTGGCCTA TGCCCAAATT
CAAGCCTTTG AACTTGATGC ACAGCAGCGG ATTTTGCAAT TTGCCTCGTT GAGTTTCGAT
GCCTCGATTT TTGAAATCGT TATGGCACTC TGGTCGGGGG CAACCTTAGT GCTGGCTGAT
CAAGAGACTT TGTTGCCTGG CCCAAGTTTG ATTGAATTAT TGCAACAGCA AGCGATTACT
CATATTACTG TGCCGCCGTC GGCATTAAAA GTGCTGCCCG AGGCAGAATT ACCAGCATTA
TCTACGGTCA TCGTGGCGGG CGAGGCTTGT CCGGCTGAGT TGGTGGCGCG TTGGGGTTTG
GATCGACGCT TTTTCAATGC CTATGGCCCA ACCGAAGCGA CAGTTTGGTC GAGCCTCGCC
TTGTGCGACG ATCCAAACCA AAAACCCTCA ATTGGCCGAC CAATTGCCAA TACTCAACTA
TATATTCTTG ATCAATACCT GCAACCTGTG CCAGTTGGGA TTGCTGGCGA GTTGTATATT
GCTGGGCCTG GTTTGGCATG GGGCTATCTC AATCGGCCTG AATTAACTGC CCAGATGTTT
GTGCCAAATC CCTTTAGTGC TGAGCCTGGC CAACGGCTGT ATCGTTCGGG TGATTTGGCT
TGTTTCTTAC CCGATGGCTC GATTAACCAC CTTGGGCGGG TTGATCATCA GGTTAAAATT
CGGGGCTTTC GGATTGAAAC AGGCGAGATT GAGCAATGCT TGTGTGAGCA TCCTTTGGTT
CATGAAGCGG TGGCGATTGC CCGCGATGAG CCAAATGGCC AGAAACGACT GGTGGCCTAT
GTGGTTGCCA CGCCTGATAA TCAACCAAGC AGCGCCGAAT TGCGCACGTT TTTGCAAACG
CGCTTACCAG AACATATGCT ACCAGCGGTA TTTGTGCTGC TGGCTAGCTT ACCGCTAACT
CCCAATGGCA AACTTGATCG CCATGCCTTG CCTGCACCGA AAACGACGCG CCATGCTGAA
CAAGCCTTGT TTGATGCACC CCAAACCGCC AATCAACAAA TCTTGGCTGA GATTTGGGCC
GATGTTTTGG GGCTAGCACA GGTTGGGATT CACGATAATT TCTTCGAGTT GGGCGGCGAT
TCAATTATTT GTATTCAAAT TGTGGCCCGC GCCAACCAAG CTGGTTTGCG GCTAACCCCC
AAGCAGGTTT TTGAACAACG CACGATTGCC AATTTGGCGA CCGTGGTTGG CACTGGCCCC
CAAATTCAGG CTGAACAAGG TTTAGTTAGC GGAGCCGTAC CATTAACCCC GATTCAACAG
TGGTTTTTTG CGCAAAACTT GCCAAATTTT CACCATTGGA ATCAATCGGC CTTGCTCGAA
GTCCGCCAGC CGCTTGATCT AACCTTGCTT AGCCAAGTGT TGTATCAATT GCACATTCAG
CACGATGCAC TGCGCTTACG CTTTCAGTTC GGTACAGATG GGTGGCAGCA AATAAACCTC
GACCATGCTG CCACGCCCAG CATTAGCTTG ATTGATTTAG CTGATTTGCC GCTTGAACAA
CAAAGCGTTG CAATTACTGA GCATGCTAAT CAGCTGCAAG CGTGTTTGAA TCTTAGCACT
GGACCAGTGT TACAGGTTGC TTTATTCAAT TTGGGAGCCG ATCGGTCTGG ACGCTTGTTG
GTGGTGGCTC ATCACTTAAT TTTCGATGGG GTTTCGTGGC GGATCTTTTT TGAAGATTTA
GCGACGGCCT ACCAACAAAT TGCTCAGGCC AAGCCGATTC AGCTGCCTGC GAAAACCAGC
TCATACAAGG CTTGGGCCGA GCGATTGGTT GAGTATAGTC AATCAACAAC CCTACAAGCC
GAATTAACCT ATTGGAATCA GCAAATTGGC GAGTTGCCAA GCTTGCCGAT TGATTTCCCC
GAGGCATTGG CTGACAATAG CGAAGCCTCG CAGGCCTTGG TGACGGTCGC CCTTGATGCG
CCAACGACTG CCTTATTGCT CCACGAGGTG CCCAAAGCCT ATCATACCCA GATCAACGAT
ATATTGTTAA CAGCCTTAGC CCGCTGTTTG AGTCAATGGA GCGGCCAAGC TGCCCTGCTG
ATCGATTTGG AAAGCCATGG CCGCGAAGAT CTGTTTGACG ATCTTGATCT ATCGCGGACG
ATTGGCTGGT TTACAGCAAT TGCGCCCTTG CGCTTAACCC TCGCAGAAAG CGGTGAACTT
GGTGCTGATC TTCAATCAAT CAAAGAGCAG CTTCGTCAAG TTCCACAGCA TGGTGTTGGT
TATGGTATTT TGCGCTATCT TGGGCAACAA CCGATTCAGG CTCAGCCACA GGTTGGCTTT
AATTATCTTG GTCAATTTGG CTATGGCTTG AGTGCTGATT CGCCGTTGGC ATGGGCCTAC
GAATCGAGCG GAGCCGACCA CGACCCAGCT GGGCTGCGAC CACACCTGCT CGAAGTGGGC
GGCAGTATTG TTGATGCCCA ATTAACGATC CAATGGATGT ATAGTACCAA TCTGTATCGC
TCCACGACGA TTGAGCAATT GGCGCATAGC TTGATGCACG AACTACGGGC AATCATTGCG
CATTGTTTGC AACCAGATGT TGGTGGCTAT ACGCCTTCAG ATTTTCCTTT GGCCACATTG
CCAGCAGCAG ATTTGGCTCA GTTGAATGCG CAATATCGCC AAATCGACGA TCTGTATCCG
TTGACTCCAA CCCAACAAGG TATGCTCTTT CATGCCTTAT ATGAGCCTGA ATCGACTGTC
TACTTTATGC AGATTAGTTG GCTCTTTGAG GGCAAACTTG ATCTTGCGGC GTTTCAAGCT
GCTTGGAATC ACACCCTCAA TCAGCATACA ATTTTGCGCA GTTGCTTTGT CTGGCAAGGC
TTAAGCCAAG CCTATCAGTT GGTGCATCCA ACCGTGGAGA TGCCGTGGGA GTATCTGGAT
TGGCGCGAGC TTGAACCTGA GCAACAAGCG ATTAATCTGG CAGGGTTACT TGAGGCCGAT
AAAACTAAGG TTTTTGATCT CTCCCAAGCG CCGTTGATGC GGGTTACGTT GGTGCACTTA
GCTGAGCATA GCTACCATTT TATTTGGAGC CAACACCATA TTTTGCTTGA TGGCTGGTGT
ACCAACATTC TGCTCAAAGA GGTGTTTCGC GCCTACGAGG CCTTGGTGCA GGGTTTGCCA
ATTCCGCTCA GTCAGCCAGC AATTCGGCCT TATCGTGAGT ATATTGCTTG GCTGCAACGC
CAAGATTTAG CCCAAGCCGA GGCCTATTGG CGTAAACGAT TGCAGGGCTT TGCTAAAACT
ACACCACTGC CACCAGCCAG CGGAGCCCAA CAAGCTGGCG TTGATTACGC TGTCCAGAAG
TTGCCGCTTG ATCCAGCGCT CACAACCGCG ATCTATACGC TGCTGCGTCA ACATCAACTG
ACGATGAACA CGCTGTTGCA AGGGCTTTGG GCCTGTGTTT TGGCGCATTA CAGTGGCCAG
CATGATCTTG TTTTTGGCAG CACGGTTTCT GGCCGTCCAG TCGATTTGGC TGGAGCTGAG
AATATGTTGG GATTATTTAT CAACACCCTG CCGGTACGAG TTCGCATCCA ACCAACCTTG
TCAATCATTG AATGGTTACA GGATGTGCAA GCTCAGCAGG TTGAAATGCG TCAATATGAA
TATACACCAG TGGCGCAGGT TCAGCGCTGG AGCGAATTGC CGCCCCGTCA ACCATTATTT
GAAAGCGCTG TGGTGTTTGA AAATCTGCCG ATGGATAGTA GCAATCAGGG TCAGTTTAAT
GACCTAACGA TGAGCAATAT TCAATCGTTT ATTCAAAATA ACTTCCCATT GACGATTCGC
GGTGCGCCGA GTGCCACAAC CTTCGAGTTG CATGTGCTCT ACGATCGCCA GCGTTTTGCC
ACAACCACCG TTTTAGCGTT GCTAGGCCAA CTTGAAGCCT TGTTCAAGGC CGTGCAACAC
CAACCAAGCG CCTCATTGGC CGATTTGGCC CAGCGATTAG AGGATTTTGA TCACCATAAC
CAAAAAGCGC AAGCTCAACA GAGCGAAACC AGCAGTCTGC AAAAACTAAA ACACGTCAAA
CGTAAGGCTA TCCGTGGGCA ACAATCTGAA TAA
 
Protein sequence
MNADIEDIYP LSPLQQGLLF HSLYDPDSGA YFEQFTCQLR GILQLDAFRR AWQHVLERHA 
ALRTVFVWED LAEPLQVVYR AVQLPLDYHD WRELDPQTHT AQLEAYFQTE RQRGFDLSQA
PLLRASLIQL SDDCYQFVWC NHHLLLDGWS MALLLKEAFS YYSAFCEGRA LRLAQTRPYR
DYIAWLQKQD QAKAEQFWRA NLSPISAPTP LVIERPNYAL LGPEQPCEQR IVLDLAATET
LQIMARQHKL TINTILQGAW ALLLGRYSGE RTIVFGSPVS SRPAQLAGSN AMVGLFINTI
PVCITIKPQA AVSEWLQALQ QQQVEAQQYH YTGLNQLQTW SAVPRGIPLF ESILVFENYP
LAAWQQSGNA TLQLQDVRFI EQTNYPLSIE AVLAPNLVIQ VFYDQRRFEP ATITRLLEHL
QNLLLSFATA PQARLASIDL LTAAERQVML RDWNTTNVPL PSSIYLHKIV AAHAQATPDA
VALRFGQQHL SYGELNRRAN QLAAYLRAQG VPPGSLVGLC VERSLELVLG ILAILKAGAA
YVPLDPRYPL ERLHYMLNDS QAQVLLTQHS LSQQIRTEQQ RVIYLDHDWP TIAQYPSFEL
AVPLWPESLV YLIYTSGSTG RPKAVPITHR GLANLAYAQI QAFELDAQQR ILQFASLSFD
ASIFEIVMAL WSGATLVLAD QETLLPGPSL IELLQQQAIT HITVPPSALK VLPEAELPAL
STVIVAGEAC PAELVARWGL DRRFFNAYGP TEATVWSSLA LCDDPNQKPS IGRPIANTQL
YILDQYLQPV PVGIAGELYI AGPGLAWGYL NRPELTAQMF VPNPFSAEPG QRLYRSGDLA
CFLPDGSINH LGRVDHQVKI RGFRIETGEI EQCLCEHPLV HEAVAIARDE PNGQKRLVAY
VVATPDNQPS SAELRTFLQT RLPEHMLPAV FVLLASLPLT PNGKLDRHAL PAPKTTRHAE
QALFDAPQTA NQQILAEIWA DVLGLAQVGI HDNFFELGGD SIICIQIVAR ANQAGLRLTP
KQVFEQRTIA NLATVVGTGP QIQAEQGLVS GAVPLTPIQQ WFFAQNLPNF HHWNQSALLE
VRQPLDLTLL SQVLYQLHIQ HDALRLRFQF GTDGWQQINL DHAATPSISL IDLADLPLEQ
QSVAITEHAN QLQACLNLST GPVLQVALFN LGADRSGRLL VVAHHLIFDG VSWRIFFEDL
ATAYQQIAQA KPIQLPAKTS SYKAWAERLV EYSQSTTLQA ELTYWNQQIG ELPSLPIDFP
EALADNSEAS QALVTVALDA PTTALLLHEV PKAYHTQIND ILLTALARCL SQWSGQAALL
IDLESHGRED LFDDLDLSRT IGWFTAIAPL RLTLAESGEL GADLQSIKEQ LRQVPQHGVG
YGILRYLGQQ PIQAQPQVGF NYLGQFGYGL SADSPLAWAY ESSGADHDPA GLRPHLLEVG
GSIVDAQLTI QWMYSTNLYR STTIEQLAHS LMHELRAIIA HCLQPDVGGY TPSDFPLATL
PAADLAQLNA QYRQIDDLYP LTPTQQGMLF HALYEPESTV YFMQISWLFE GKLDLAAFQA
AWNHTLNQHT ILRSCFVWQG LSQAYQLVHP TVEMPWEYLD WRELEPEQQA INLAGLLEAD
KTKVFDLSQA PLMRVTLVHL AEHSYHFIWS QHHILLDGWC TNILLKEVFR AYEALVQGLP
IPLSQPAIRP YREYIAWLQR QDLAQAEAYW RKRLQGFAKT TPLPPASGAQ QAGVDYAVQK
LPLDPALTTA IYTLLRQHQL TMNTLLQGLW ACVLAHYSGQ HDLVFGSTVS GRPVDLAGAE
NMLGLFINTL PVRVRIQPTL SIIEWLQDVQ AQQVEMRQYE YTPVAQVQRW SELPPRQPLF
ESAVVFENLP MDSSNQGQFN DLTMSNIQSF IQNNFPLTIR GAPSATTFEL HVLYDRQRFA
TTTVLALLGQ LEALFKAVQH QPSASLADLA QRLEDFDHHN QKAQAQQSET SSLQKLKHVK
RKAIRGQQSE