Gene Haur_1881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1881 
Symbol 
ID5733770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2248321 
End bp2253390 
Gene Length5070 bp 
Protein Length1689 aa 
Translation table11 
GC content53% 
IMG OID641279025 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544652 
Protein GI159898405 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAGC ATTCAGATGA TTCGTTGCGC AACCAGACTG ATCCTGCATC TCGGCGAGTT 
CCGCTTTCGG CGGCCAAAAA AGCGCGGCTC GAAAAACGCC TGCGCGGTGA TAGCGACAAT
CAGCAGGCTG AATCAAGCAT TCCTCGCCTC GCGACCTATG ATCAGGTGCC ATTATCGTTT
GCGCAAGAAC GGCTGTGGTT TCTCAGCCAA TACGAACCAA CGAGCAGCAA CGCCTATATT
ATTCCCTTGG CCGTGCGGAT CGATGGCCCC ATTGATCCAT CGCTGTTGGA GCAGGCTTTG
CAATTGGTGG TGGATCGCCA TGCCAGCTTC CGCACCACGT TTCATGCGCA GAATGGGGTG
CCGTTTCAAC GGGTTGCCCC ACAGCTGCCG CTCAGGTTGC CCGTGCTTGG GCTGGACGTG
GCTGATGCGA GCGATGAGGC GGCGGTGTTG CAGGTCGTGC TGGATCAGCT TGTACCGCTG
TTGCAGCTAC CGTTTGATCT TGAGCATGGG CCGTTGTTGC GAGCGACCTT ACTGCGTTTA
GCTGCCGAAT CGCATGTGCT GCTGCTGATT TGCCATCATA TTATTAGTGA TGGCTGGTCG
ATGGGGGTGC TGCTGCGCGA TTTTGCCAGC TTTTTAGGCG CGTTACGCAC CAATACTGCG
CCGGATGTAC CGCCGTTGGT GGTGCAAGCC CCGGATGTCG CGGTGTGGCA ACGGCAACGC
TTGCAAGGCC ACTATCTGAC GACACTCCAA GATTTTTGGA AGCAGCAGTT GGCGGATCTC
GAACCATTGA ATGTGCCTAC CGATTTTGTC CGACCGGCCC AGCAATCCTA TCGTGGGGCG
ACATTGAGTT TTCAGCTCCC CGCTGCGCTC AGCACCCAGC TTCAGCGTAT GGCGCAACAG
CATGATGTGA CCCCATTTAT GCTGCTGCTT GCCGCATTTC AGGCTTTTTT GGCGCGACTG
AGTGGGCAAC AGGATCTGGC GATTGGTTCC GTGCTAGCGA GTCGGGCCGA TGCTGACCTT
GATCCGGTGA TTGGTTTTTT GGTGAATACC TGGACGTTGC GGAACAACAT AGATGTAGCG
CAGCCATTGG CGCAGCTCTT GCCCACGGTG CGACGTACCG TCCTGGCAGC GTTTGAGCAC
CGCGATTTAC CGTTTGAGCA GGTGGTGCAA CTGGTGCAAC CGGAGCGTGA TCTGAGCCGT
TCGCCCGTGT TTCAGGTGAT GATGACCTAT CAAAACGTGC CGCAACGGCA GATGGAATGG
GGGGATGTTC GGTTGACCCC GATTAGCCTG CCCAGCACGG TGGCGAAATT CGACCTAACT
CTGGCGCTGA GCGAAACCCC CGAGGGTTTT CGTGGGGTGA TGGAATATCG GAGCGATTTA
TTTCGGCGCA GCACGATTGC CACGATGGTT GCGCGTTGGG AAATGTTTTT ACACGGGATT
GTGGCAGACT TTACCACGTC CATTGCCCGA TTGCCGTTGG TCTTGCCTGC GGAACGCAGC
TTATTGCTTG ATACGTTGAA TGCGACCACA ACCGCCTACC CACACGATCA AAGCGTAGCA
AGTTTGTTTG CCGAACAAGC CCGCCTGTGG CCGGAGCGGA TTGCGCTTCG TTTTGGTGAG
CACAGCCTCA GCTATCACGC GCTTGAGCAA CGGGCCAACC AGCTAGCGCA CCATCTGCAA
CTGCTGGGTG TTGGGCCAGA GCATGTGGTT GGTTTGTGTG TTGAGCGCTC GTTGGACTTA
GTGGTGGCGA TTCTGGCGAT TCTCAAGGCT GGCGCAGCCT ATGCCCCGGT CGATCCGAGT
TATCCCGTTG AGCGTTTGGC CTGGATGCTG AGTGATTTAC AGCCAACGGT GGTGATTGCA
CAGCACGGCG TGCTCGACCG CTTACCGTCG GTTGCGTGTT CCGTGGTTGT GCTTGAAACC
ATAGCCGCGC ACCTCGCAGC GTATCCCACG ACTGCGCCAA CCGTGGACAT CAGCCCCGAA
AATTTGGCCT ATGTGATGTA TACCTCTGGT TCAACAGGCC GACCCAAAGG GATTATGATC
AATCAGCGGA ACATTGTGCG ATTGGTCCGC AACACCACGT ATGCGGCATT TGGGCCAGAC
CAGGTTGGGT TATTGCTGGC AACAGTGGCA TTTGATGCTT CGACGTTCGA ACTTTGGGGG
TGTTTGCTGA ATGGTGGACG CTTAGTGATC GCCCCACCGC AGCAACTCAG CCTTGCCGAA
TTGGGCCACT TGGTGGAGCG CGAACAGATT ACGACGCTCT GGTTGACCGC CGGATTGTTC
CATCAAATGG TGGATCATGC GCTGGATCGA TTGGGTTCGT TGCGTCAATT ACTGGCCGGT
GGCGATCGAC TGTCGCCCGT GCATGTACAC AAAGTGCTGG AACGCTGGCC GCAGTGTCGC
CTGATTAATG GGTATGGCCC AACGGAAAAC ACCACATTTA GCTGTTGTCA GCAGCTTAGT
GCAACCACTG ACCTGGCGCA GGGCGTGCCG ATTGGGCAGC CGATTGCGAA CAGCACGGCC
TATATTCTTG ACCGGTTGTT GCAACTGGTT CCCATAGGGG TTGTAGGCGA ACTGTATTTG
GGTGGCGCAG GCTTAGCGCG AGGGTATTTA GCGCGTCCAG ACCAGACGGC GGCGGCATTT
ATCCCGAACC CCATGAGCCA AACGGCGGGC GAACGCCTGT ATCGCTCGGG GGATCTGGCG
CGGTATCGCG ATGATGGGAC GATCGAATTT ATTGGACGAC GGGATCAGCA AGTCAAGGTA
CGCGGGTATC GGATTGAGCT GGAAGAAATC GTTGGCGTGT TGCTGGCACA ACCACAGGTG
GATGATGCGG TGGTGGTGGT GCGGGAGGAT CGGGTTGGTG ATCAGCGCTT GGTGGCCTAT
CTGGTGGGTG ACAATCCGGC GATTGAGCTG ATTGAACAAG CGGTGCAAGG CCAGGTCCCG
AGCTATATGC TCCCGAGTGC CTATGTTGTG CTTGATGCCT TGCCGTTGAC GGCGAATGGC
AAGGTTGATC GGCGGCGGTT GCCAGCGCCG AGCTATGCCG CCATCGCGAA CGATGATCCG
CCACAAACCG ATTTAGAGCA GGCGATAGCG GCGATTTGGG CCGAGGTCTT GGCGGTGCCG
AGCATTCAAC GCCAGACCAA CTTTTTCCAA GTAGGTGGAC ATTCGTTGAG TGCAACTCAG
TTGATTGTGC GGTTACGCCA AATGCTGAAT CGCGATCTGC CCTTGCAATT ATTGTTCGAT
TATCCTTATT TATATCAGCT TGCCGAGCAG CTTGAGCAGC AACCAACAGC CCTGCCAACC
GCGATTCAAC CAATTCCACG TCATCAGCGC CTGCCGTTAA CGTCTGCCCA GCAACGGGTT
TGGTTTTTTG AGCAATTAGT GCCCAATACG GCGATGTATA CGATTGCTTT GCAACTACGA
TTAAGTGGCA AGCTTGAGCC AGCCTTGTTG CAACAGGCGA TTAATCTGCT GATTGCTCGC
CATGAGATCT TGCGAGCGTC GTTTCATGGG CAGGCTGGGC AGCCTTGGCT ACACATTGCT
AACCACTTAA CGCTTGATCT AGAGTTGATT GATCTTCGGA AAATAGATAT TGAACAGCAA
TCGATCGATG TTCAGAAAAT ATGCCAACGT TTAGCATATG CCAGCTATCA ACTTGAGTAT
GCGCCCTTGC TGCGTTTTTG CTTAATCCAG CTTGGCGTAC AAAGCGCGAT ACTATTATTC
ACTATTCAGC ATAGTATTAC CGATGCTTGG TCGATTGATC TGCTTTTGCA GGAGTTATGG
CAAATCTATG CTGATTTAAT CCACCAGCGC CCGCTGAGCC TTGCTGAATT ATCAGTGCAA
TATGTTGATT TTGCTGCTTG GCAAGCTGCT TGGTTGGCTA CGCCCGCAGC GCAACGCCAG
TTAAGCTACT GGCTTGAACA GCTCGCCGAA GCTCCACGCT TGTTGGCCTT ACCAACCGAT
TATCCCCGCC CCGACAACCA AACCTTTGCT GGCTCGGTGG TGAGCCTTGA ATTACCTCAG
CCTTTGACCC ACGAGTTGCG TAGTTTGAGT CAACATCAGC ATGTAACCTT GTTTATGTTG
TTGTTGGCGG CTTTCAAGAG CTTGCTTTAT CGCTACACTG GGCAAACTGA TCTCTGCGTT
GGTTCGCCGA TTGCCAATCG TGGCCAGCCT GAGCTGCAAA CAATGCTTGG CTTTTTTATC
AATACCTTGG TGCTGCGTAG TCGCATTCAG CCAGCTTGGT CTTTTCTTGA GCTCTTGGCT
ACGGTGCGAG CCACAACCTT GGCTGCCTAT GCCCAGCAAG ATCTCCCCTT AGAAAAGATT
ATTGAACAGC TCAAACTAGA GCGCGATTTG AGTTATAACC CATTGTTTCA GGTGATGTTT
AATTTTCGTC ACGATTTTGC GATCAATCGC CAGCATTATG AGCTAGCGAT TGAGGCTGAA
ATGTTGGCGA ATGGTACATC AAAATTTGAT TTAACTTTAG ATGTAGCCGA TCGTGGCTCC
ACGTTATTGC TATGGGTCGA ATATAACTCG GCCTTGTTTG CCCCAGCGAC GATTGAGCGG
ATGCTGGCAC AATTTCAGGT CTTACTGAAT AGCATTTGTG CGCATCCTGG GCAACAATTA
AGTACACTGG AGCTGCGAAC ACCCAATCAA ATTGCTCAGT TAGAACAAGC GCGAGTTGCG
CTATTACAGC AACTTGAGCA GCATCCAGCG ATTAGCCAAG CAGTTGTATT GGTTCGGCCC
GATTTGCCGC AAGCCAACTG GGTCGTTGGC TATGTTGTCA AACAGCCAAA TCAGCAGCTT
GAGCTTGGTC AATTGCAGGC AAGTATCCAA ACAACCTACC CATTGGTACA GTTGGCACTT
TATGAAGTGC CCGATATGCC CTTGGATGCG GCTGGCACGG TTGACCACGC TGGATTAAGC
AAGTATGGTC AGCCATTGCT TAACCAGCAA TTACAGCAGC CCGCCGTGCA ATCCCCGACT
GAAGCAATGC TTGCCGAAAA ACGGGCTAAA CTTTCCGCAG ATAAACGGTC ACTCCTAGAG
AAGCGTTTGA AACAATTACG AGGTGAATAA
 
Protein sequence
MSEHSDDSLR NQTDPASRRV PLSAAKKARL EKRLRGDSDN QQAESSIPRL ATYDQVPLSF 
AQERLWFLSQ YEPTSSNAYI IPLAVRIDGP IDPSLLEQAL QLVVDRHASF RTTFHAQNGV
PFQRVAPQLP LRLPVLGLDV ADASDEAAVL QVVLDQLVPL LQLPFDLEHG PLLRATLLRL
AAESHVLLLI CHHIISDGWS MGVLLRDFAS FLGALRTNTA PDVPPLVVQA PDVAVWQRQR
LQGHYLTTLQ DFWKQQLADL EPLNVPTDFV RPAQQSYRGA TLSFQLPAAL STQLQRMAQQ
HDVTPFMLLL AAFQAFLARL SGQQDLAIGS VLASRADADL DPVIGFLVNT WTLRNNIDVA
QPLAQLLPTV RRTVLAAFEH RDLPFEQVVQ LVQPERDLSR SPVFQVMMTY QNVPQRQMEW
GDVRLTPISL PSTVAKFDLT LALSETPEGF RGVMEYRSDL FRRSTIATMV ARWEMFLHGI
VADFTTSIAR LPLVLPAERS LLLDTLNATT TAYPHDQSVA SLFAEQARLW PERIALRFGE
HSLSYHALEQ RANQLAHHLQ LLGVGPEHVV GLCVERSLDL VVAILAILKA GAAYAPVDPS
YPVERLAWML SDLQPTVVIA QHGVLDRLPS VACSVVVLET IAAHLAAYPT TAPTVDISPE
NLAYVMYTSG STGRPKGIMI NQRNIVRLVR NTTYAAFGPD QVGLLLATVA FDASTFELWG
CLLNGGRLVI APPQQLSLAE LGHLVEREQI TTLWLTAGLF HQMVDHALDR LGSLRQLLAG
GDRLSPVHVH KVLERWPQCR LINGYGPTEN TTFSCCQQLS ATTDLAQGVP IGQPIANSTA
YILDRLLQLV PIGVVGELYL GGAGLARGYL ARPDQTAAAF IPNPMSQTAG ERLYRSGDLA
RYRDDGTIEF IGRRDQQVKV RGYRIELEEI VGVLLAQPQV DDAVVVVRED RVGDQRLVAY
LVGDNPAIEL IEQAVQGQVP SYMLPSAYVV LDALPLTANG KVDRRRLPAP SYAAIANDDP
PQTDLEQAIA AIWAEVLAVP SIQRQTNFFQ VGGHSLSATQ LIVRLRQMLN RDLPLQLLFD
YPYLYQLAEQ LEQQPTALPT AIQPIPRHQR LPLTSAQQRV WFFEQLVPNT AMYTIALQLR
LSGKLEPALL QQAINLLIAR HEILRASFHG QAGQPWLHIA NHLTLDLELI DLRKIDIEQQ
SIDVQKICQR LAYASYQLEY APLLRFCLIQ LGVQSAILLF TIQHSITDAW SIDLLLQELW
QIYADLIHQR PLSLAELSVQ YVDFAAWQAA WLATPAAQRQ LSYWLEQLAE APRLLALPTD
YPRPDNQTFA GSVVSLELPQ PLTHELRSLS QHQHVTLFML LLAAFKSLLY RYTGQTDLCV
GSPIANRGQP ELQTMLGFFI NTLVLRSRIQ PAWSFLELLA TVRATTLAAY AQQDLPLEKI
IEQLKLERDL SYNPLFQVMF NFRHDFAINR QHYELAIEAE MLANGTSKFD LTLDVADRGS
TLLLWVEYNS ALFAPATIER MLAQFQVLLN SICAHPGQQL STLELRTPNQ IAQLEQARVA
LLQQLEQHPA ISQAVVLVRP DLPQANWVVG YVVKQPNQQL ELGQLQASIQ TTYPLVQLAL
YEVPDMPLDA AGTVDHAGLS KYGQPLLNQQ LQQPAVQSPT EAMLAEKRAK LSADKRSLLE
KRLKQLRGE