Gene Haur_1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1874 
Symbol 
ID5733763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2221657 
End bp2226318 
Gene Length4662 bp 
Protein Length1553 aa 
Translation table11 
GC content49% 
IMG OID641279018 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544645 
Protein GI159898398 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATT TTGCCAAACG CCTAGCTGCT CTTTCGCCAG AGCAACGAGC CTTGCTTGAA 
AAACGGCTCA ACCAGAAAAA AGCCGAACCA AACAAACAGC AGATTCCCCA CGTCGAACGG
CAAGGTGCAG CCTATGCCTT ATCGTTTGCC CAAGAACGTT TGTGGTTGAT GTACCAACTT
GATCCGCAGA GTGCGCTGTA TAACGTGCCG GTCGTGATTC GCTTTGGGCC AAACTTTGAC
GTGGCCTTAG AGCAACGGGT TTTTCAGGCG ATTATTGAGC GCCATGAAAT TCTGCGTACC
ACGTTTAAAA CCGTCGATCA ACAACCAGTC CAAATGCTTG AGCCTGTGCC AGATTTTAGG
CTCCCAGTGA TCGATTTGCG GTCTTTTCCC GAGGAGCAGC AAGCGTCCGA GGCCCAAAAA
CACACCCTTG CTGAAGCTCA AAAGCCCTTT GATTTGAGCA ATGGGCCATT GTTGCGCGTG
GTTTTGCTGC GCTTAACTCA TGAACATCAC TTAATTATTA ATTTGCACCA TATTGTTTCT
GACGATTGGT CGCCTGGGGT TTTGGTCCAA GAGATTGGGG CTTTGCATCA GGCATTCAGT
CAAGGCCAGC CTTCACCATT GGCTCCGTTG GCAATACAGT ATCTTGATTT TGCCGTATGG
CAGCGCCAAC GCCTAAACCA GGCCAGTGTC TCCCAGCAAT TGGAATATTG GCAAAACCAA
CTCTCGGGTT CATTACCCTT GTTGGCCTTG CCAACCGATC GCCCACGCCC ACGGATTCAA
ACCTTCAACG GGGCAAAAAT CAGTCGGCGA ATGCCTGCCA GCCTACTTAA ATCGCTCAAA
CAGTTGACTG CCCAAACTGG CGCAACCTTG TTTATGAGCG TTTTGGCGGC CTATAAGATC
TTGTTGCAGC ACTATACAGG CCAAACAGAT TTGCTGATTG GTACGCCAAT TACCAGTCGA
ACCCGCCCTG AATTAGAGCC ATTGATCGGC TGTTTTATTA ATATGTTGGT CTTGCGTAGC
CAACTTAATC TTGAGCAAAC CTCCCGCGAG TTTATTCAGC AGGTGCGCCA AATGGCCTTG
GATGCTTATG CCAATGCTGA AGTGCCGTTT GAGCGCTTGG TGCAGGTGTT GCGGCCTGAG
CGTAATCCGA GCTATACCCC AATTTTTCAG GCAGCCTATA TTCTGCAAGA TCCCCAAGAA
GCTCGTCAGA AGATGCCTGA AAATGTACTT GGTTATGCCG AGGTTGATAC TGGGACTTCT
AAGTTTGATT TGACCTTGGA GATCGATGAA GTTGATGGTG AATTAGCTAC AGCTTTTGTG
TATAGTACTG ATCTCTTCGA TCAAGCAACC GTCGAACGCA TGCTTGGGCA TCTGCAACAG
ATTCTTGAAA CCATGGTTGC TCAGCCGGAT TTGCCGATTG CTGCGATTGA ATTGGTTACC
AGTGCCGAAC GCCAGCAATT GTTGATCGAG TGGAATGCTA CTGAAACAGC CTTTGAAACA
GATTTTGTTC AGCATGCAAT TGCCCGTCAC GCTCAAACCC AACCAGATCA ACTGGCCTTG
CGCTATGGTG ATCAGCAGTA TTCCTATGCT GAATTGAACC AGCATGCTGA GCGCTTGGCG
ACCTATTTAC AGCAATTAGG CGTAAAACCA GAATGTGTTG TTGGTTTGTG CGTTGAGCGC
ACTCCTGCAA TGGTGATTGC GATTCTAGCG ATTTTTAAAG CTGGCGGCCT GTTTTTGCCA
CTTGACCCCA GCTTTCCTGC TGATCGTTTG GCTTATATTG TTGCCGATGC CAAGCCTTTG
GTCGTGCTAA CAACTGCTGC CTTAGCTGCT GAATTGCCCT TAGAAGCTCC CCATATTGTT
GCACTTGATC AAGCTTGGCA TGCCCATATT CAGCAGGTTG ATGCGCCAAA TCATCAACTG
CAACCCAGCA ATTTGGCCTA TATGATTTAT ACCTCGGGCA CGACTGGCAC GCCTAAAGCG
GTTTTGGTTA CCCATCAAAA TCTGTTGAAT GTGTTGTTGG CAAGTCAGCA AGCGTTTGGC
TTCAATCCTC GTGATGTGAT GCCGTGTATT GCCCCATTCT CATTTGATAT TTTTCTGTTT
GAATTGCTCA ACCCATTGCT TGCTGGCGGC ACATCGTGGA TGCTTACCCG CGAAGAAATT
TTGGATATTG CTGGCTTGAT CGAGTCGCTG GCTTCGATGA GTGTGATTCA CACTGTCCCA
AGTTTGATGC GTCAATTGGT CAATGCCTTG GAAACTGAAG GCTATACTGC CGCTGCTTGT
CAAAGCATTC GGATGATTTT TATTGGTGGC GATTTAGTGC CGCCTGAATT GTTAAATGCG
ATGCGGCTCG CCTTTCCGCA AGCCGCAATT CATGTGTTGT ATGGCCCAAC CGAAGCCACG
ATTATTTGTA CCAGCTATCG TGTGCCCCAA CAGGGCTTGC TTGAGCGCCA TTTGATTGGG
CGACCACTAC CAAATATGGC GATTCGCTTG TATGACCCAC AGCAAAACCT CGTACCAATT
GGTATGCCAG GCGAACTGTA CATCGGCGGG GCTGGGGTTA GTCGGGGCTA TTTGAATCGC
TCGGAATTGA CTGACGAGAA ATTTGTCGAG CTTGATCAGC AGCGTTGGTA TCGTACTGGC
GATTTGGCGC GTTATCAGGT TGATGGAAAT CTCGAATTTT TAGGCCGCAT CGATCAACAA
GTTAAAATTC GTGGCTTTCG GATTGAGCTT GGCGAAATTG AAGCAGTACT GGCGCAACAT
CCTAGCATTC GGGAAGCGGT GGTGGTTGCC CGTGAAGATC TGCCTGGCGA TAAGCGACTC
GTAGCTTATT TGATTGCGGA ATCAGAACAA ATGCCGCATA TTGGTGAATT ACGGGCATTT
TTGCAAACCA AACTGCCTGA ATATATGCTG CCTGCAGCGT TTATGGTGTT GGAGAGCCTG
CCGCTGACCC GCAATGGGAA GGTTGATCGT CAAGCGTTGC CTGTGCCGCC CACCACCCGT
GAGCATTTGG CCAATCAATT TGCTGCACCA ACCAATCAAC TCGAAACGCT GCTGAGCACG
ATTTGGGCCG AGGTGTTGGG GCGCGAACAG GTTGGCATTC ACGACAATTT CTTTGAACTC
GGCGGCGATT CGATTCTGAG TTTGCAAATT GTAGCGCGAC TCAATCAGGC TGGCTATCAT
GTGCTAACCA AAGATATATT TCAGTATCAA ACGATTGCTG AATTGGCACA GGTGGTTTCG
AGCACCACGC TGGTTGTGGC CGATCAAGGC TTGATCGAAG GTGCCGTGCC GCTTACGCCC
ATTCAACAGT GGTTCTTTTG CCAAAATCTA CCTAATCCAC ACTATTTTAA TAGTATGCCA
GTGTTGCTTG AAGCGCCTGC CGAGCTTACT CAGGCTGATT TGCACTCGAT CGTTGCCCAA
CTTTTGCAGC ATCACGATGG CCTGCGCTTG CGCTTCGAGT TGGTTGCTGA CCAATGGCAA
CAAACCCACG GCTCGCTTGA GGCTGATTTA CCACTAGCGA TAATTGATCT CAGGGGCTTG
AATCAAGGAA CCCAAACCCA AACGATTGAA GCAACCGCGA TTGAATACCA AACCAAACTC
GATTTGAGCA CTGGCCCGCT GATTTGGTTT GTGCTATTCG AAGCAGAGTT GAGCAAGCGC
CTATTAATTG TGGCGCATCA CCTTGTATTT GATGGAATTT CGCTGCGGAT TCTGTTGGAA
GATTTACAAA CGGCCTATGC TCAACTACAG GCAGGGCAAT CGATTAATTT ACCGCTCAAA
ACCAGTTCGT TCAAAACTTG GGCTTTGGCG CTTCAGGAGT ATGCCCAATC ACCAGAAGTC
GCCCAACAAG CCAGCTATTG GCAGACGATT CAGCATACTC AATCACCTTT GCCGCTTGAT
CATAGCGGTC AGGCCAATAC TGAAGCCTCA AGCAGTATTG TCTTGGCCCG ACTTGAGGTC
GCAGAAACCG ACGTATTGCT GAACCAATTG CCAACGCTCT ATCATGCCAG CCTTGAAGAA
GTATTGCTAA CCGCTTTAGC CCAAACGATT GGCGAATGGA CGTATAGCCA AAGCCTTGTG
GTTGATTTAG AAAGCCATGG CCGCGCCGAA TCGATTGCTG AAAATCTCAA TCTTTCGCGC
ACGATTGGCT GGTTTACCAG TCTTTATCCA GTAATTTTGG ATTGGACTGG CTTTGATGGG
CCGCTTGAAA TGCTGAAAGT GATTAAAGAA ACCCTGCGCC AAGTGCCTGA ATATGGGCTT
AGTTATGGCT TATGGCAGTT CAATCAACCT AATCCTAGCG CCAATTCACA CGCTGAACTA
CGCTTTAATT ATCTTGGTCA ATTAGGGGGT GCGGCTCAAA AAGCAGCCTT TGAGTTATTG
CCCCAACTTG AAGTGCCACT GCGTGACCCT GCTAGCACGC GATCGCATGT TTTAGACGTT
GATGTGGTGG TGGTGCAACA GCAATTGTGG GTGCGCTGGA CGTATAGCAA TCACTTACAT
GAGCCAGCCA CGATTACCCA GCTTGCCGAG CGCTTTATGG CGGCATTACG CGATTTATTG
CAAGGCGATA GCGCAACTAG CGCCGCGTAT GTTCCATCCG ATTTCCCGAT GGCAAACCTT
GATCAGCAGA CATTGAACTC ACTTATGAAG AAACTTCGCT AG
 
Protein sequence
MSDFAKRLAA LSPEQRALLE KRLNQKKAEP NKQQIPHVER QGAAYALSFA QERLWLMYQL 
DPQSALYNVP VVIRFGPNFD VALEQRVFQA IIERHEILRT TFKTVDQQPV QMLEPVPDFR
LPVIDLRSFP EEQQASEAQK HTLAEAQKPF DLSNGPLLRV VLLRLTHEHH LIINLHHIVS
DDWSPGVLVQ EIGALHQAFS QGQPSPLAPL AIQYLDFAVW QRQRLNQASV SQQLEYWQNQ
LSGSLPLLAL PTDRPRPRIQ TFNGAKISRR MPASLLKSLK QLTAQTGATL FMSVLAAYKI
LLQHYTGQTD LLIGTPITSR TRPELEPLIG CFINMLVLRS QLNLEQTSRE FIQQVRQMAL
DAYANAEVPF ERLVQVLRPE RNPSYTPIFQ AAYILQDPQE ARQKMPENVL GYAEVDTGTS
KFDLTLEIDE VDGELATAFV YSTDLFDQAT VERMLGHLQQ ILETMVAQPD LPIAAIELVT
SAERQQLLIE WNATETAFET DFVQHAIARH AQTQPDQLAL RYGDQQYSYA ELNQHAERLA
TYLQQLGVKP ECVVGLCVER TPAMVIAILA IFKAGGLFLP LDPSFPADRL AYIVADAKPL
VVLTTAALAA ELPLEAPHIV ALDQAWHAHI QQVDAPNHQL QPSNLAYMIY TSGTTGTPKA
VLVTHQNLLN VLLASQQAFG FNPRDVMPCI APFSFDIFLF ELLNPLLAGG TSWMLTREEI
LDIAGLIESL ASMSVIHTVP SLMRQLVNAL ETEGYTAAAC QSIRMIFIGG DLVPPELLNA
MRLAFPQAAI HVLYGPTEAT IICTSYRVPQ QGLLERHLIG RPLPNMAIRL YDPQQNLVPI
GMPGELYIGG AGVSRGYLNR SELTDEKFVE LDQQRWYRTG DLARYQVDGN LEFLGRIDQQ
VKIRGFRIEL GEIEAVLAQH PSIREAVVVA REDLPGDKRL VAYLIAESEQ MPHIGELRAF
LQTKLPEYML PAAFMVLESL PLTRNGKVDR QALPVPPTTR EHLANQFAAP TNQLETLLST
IWAEVLGREQ VGIHDNFFEL GGDSILSLQI VARLNQAGYH VLTKDIFQYQ TIAELAQVVS
STTLVVADQG LIEGAVPLTP IQQWFFCQNL PNPHYFNSMP VLLEAPAELT QADLHSIVAQ
LLQHHDGLRL RFELVADQWQ QTHGSLEADL PLAIIDLRGL NQGTQTQTIE ATAIEYQTKL
DLSTGPLIWF VLFEAELSKR LLIVAHHLVF DGISLRILLE DLQTAYAQLQ AGQSINLPLK
TSSFKTWALA LQEYAQSPEV AQQASYWQTI QHTQSPLPLD HSGQANTEAS SSIVLARLEV
AETDVLLNQL PTLYHASLEE VLLTALAQTI GEWTYSQSLV VDLESHGRAE SIAENLNLSR
TIGWFTSLYP VILDWTGFDG PLEMLKVIKE TLRQVPEYGL SYGLWQFNQP NPSANSHAEL
RFNYLGQLGG AAQKAAFELL PQLEVPLRDP ASTRSHVLDV DVVVVQQQLW VRWTYSNHLH
EPATITQLAE RFMAALRDLL QGDSATSAAY VPSDFPMANL DQQTLNSLMK KLR