Gene Haur_1877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1877 
Symbol 
ID5733766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2233390 
End bp2237967 
Gene Length4578 bp 
Protein Length1525 aa 
Translation table11 
GC content51% 
IMG OID641279021 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544648 
Protein GI159898401 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01720] non-ribosomal peptide synthase domain TIGR01720
[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGATG CGATGATCGA GGGATTTCGG CTTTCCCCAC AGCAACAGCA TGTTTGGGCG 
CTCCAGCAAT TGGATTCAGC CCAGCCTTAT CGTACTCAGG GAACTATCGT GATTGAAGGC
TCCCTCGATA TTGCTCGCTT GCAGGCTAGT TTGTTGCAGG TTGTACAGCG CTACGAAATT
TTGCGCACAA CCTTTCACTA TCTCCAAGGG ATGGCCTTGC CGTTGCAGGT GGTTACCGAA
TTAAGCGAAC TAGCATTGCC AAGCTATGAT CTGAGCGAAC ATAGTTTGAC TGATTTGCAA
AGCCAATTGG CTCAGCAAGC CTTTGATTTT GCTGCTGGCC CATTGCTGCA TGCCTGGCTT
GGGCGTGAAC ATGCGCTAAA ACATTATTTG CTGTTGAGTG TGCCAACGCT CTGTGCTGAT
AACCTGAGTT TGGTCAACTT AACCAGCGAG CTGGCGGCGA TTTATGCTGC CGAGCCAACT
ACCGATGAGC CAATGCAATA TATCGATATT GCTGAGTGGC AGCACGAACT ATTGGAAGCT
GAAGAAACCG CTGCTGGCCG CAGTTACTGG CAACAACAAA CCTGGCACGA TGCGATTACC
GTGCGGCCAG CGTTTGTTGC CAATCAAGCA GCGGCCCAAA CGTTTCAACC GCAGCAATTG
CCAATTCCGC TGACTGAACA ACTAATTGAG CTGCTTAATC AGGCAAGTCA AACTTTAGCT
GTGCCTGCTT CAGCGCTTGC CTTGGCTGCT TGGCGAACCT TGTTGTGGCG CTTGAGTGAT
CAAACCAATG GTGTGGTTGG CGTGGTTTGT GATGGGCGCA AATATGCCGA GCTTGAAACC
ACGCTTGGTT TATTTGCCAA AGCCGTACCG CTGGCTAGCC CGTTGGCAGC AGACCAGCAG
TTTGGTCAAT TGGTCAAACA AGTGCAGCAA GAATATGTGG AGGCCTATGC TTGGCAAGAA
AGTTTTCATT GGCCTGCAGA AGTGTCCAGT CAATTGGCCT TTTTCCCCTT TGGCTTTGAA
GCGCTGACCA CACCCAAACC GCTGGTGATG GCTAATCTCA GCTTTCAAGT CGTGCAGCAA
CGGGCTACGC TCAATCCATA TGCAGCCCAC TTAACTTGGT TCAACCAACC CCAAGGATTT
GCCGCTAGCC TGAATTTTGA TGCTGGGTTG ATTGCGTCAT CGAGTGCTGA ACGGTTGATT
GAGCAATATC AAACCCTTTT GACTGCTGCA CTGAGCAACT TGAACACAAG CTTGGCTCAA
CTGCCAATTG TGGGTACAAA CGAACGTCAA CAGCTATTAA TTGAATTTAA TCAAACAGCC
GAGCCATTTG ACGCTGCACG TTGCTTTCAT GAGTTGTTTA GCGCTCAAGC AGCGATTACA
CCTGATCACC CAGCGGTTGT GGTTGAAGAC CAACAACTGA GCTATGCAGA ACTTGAGGCA
CGTTCCAATC AACTGGCGCG AGAGTTGCTG GCCCGGGGAG TTCGCCCTGA TCAACCAGTT
GCTTTAGCCT TGGATCGCTC GCTCAACCTG CTGGTGGGTA TTTTGGGTAT TCTCAAGGCT
GGCGGAGCTT ATGTACCGCT GGATTTGGGA TTGCCCAAAG AGCGTTTGGG CTTTATGCTG
GGCGATATTC AAGCCTCAAT TGTGGTGAGT GAAACCAGCC TGCAAGCCCA ATTGCCTGAG
CATGCTGCCG ACTATCTTTG GCTGGATCAA GCTTGGCCGA CGATTGCCGA GCATTCAAGC
GAACCTGTTG CTGCTTCGGC GGTTCCTTCT AACTTGATGT ATATCATTTA TACCTCTGGT
TCGACTGGCC AACCCAAAGG CGTTGGGGTC AGCCATCAAA GTCTTTATAA CTATATTTCA
AGTATTAGCC AACGGCTTAA TCTGCCGCCA CAGGCTAGTT TTGCCAGTGT CTCGACCTTT
GCCGCTGATC TTGGACATAC GGCGATTTTT CCAACCCTAA CAAATGGTGG CACGCTGCAC
CTTATTACTG CTGAGCGAGC GAGCAATGCT AGCCAATTGG CCGATTATAT GCAGCAGCAT
GCCGTTGATT GTCTTAAAAT TGTGCCATCG CATTTGGCGG CGCTCTTGGC TGTGGCTGAA
CCTGCGCGAG TCTTGCCACG TCAGCGCTTG ATTCTGGGCG GCGAGGCGGT TAGTTGGAAG
TTGCTACAAA CCTTGGCGCT ACTTGCACCT GATTGCCAAG TATTCAACCA CTACGGACCG
ACCGAAACAA CGGTGGGTGT GCTAACCAAT CCACTGAGTG CGAATTTGCC AAGTGCTCAA
TCGGCAATAC CAGCCTTGGG TCGTCCCATC GCTAACACCC AAATCTATCT GCTCGATGTT
CACGGTCAGC CTGTGCCATT GGGTATGACT GGCGAGCTGT ATGTGGGGGG CGCGGCGCTG
GCACGTGGTT ATTGGCAGCG ACCTGCGATT ACGGCTGAGC GGTTTGTGCC CGATGGCTTA
AGTGGCCAAA CTGGCAGTCG CTTGTATCGT ACTGGCGATG TAGCTCGCTA TTTGCCTGAT
GGCAAACTTG AGTTTTTAGG CCGCGCTGAT GATCAGGTGA AAATTCGCGG CTTCCGGATT
GAATTGGGTG AAATTGAAGC AGCGTTGCGT AACCACACGG CGATTGAACA AGCAGCGGTG
ATAGTGCGTG ATGATCCTGC TGGCGATAAG CGTTTGGTAG CTTATTTGGT TGCAGGCCAA
CAACGCCCAC TTTCGTTACG CGAGTTGCGT AACTTTTTGA AACAGAGCCT GCCCGATTAC
ATGGTTCCGG CGGCATTTGT GATGTTGGAA CGACTGCCAT TGAACGCTAA CGGCAAACTT
GATCGTCAAG CCTTACCAGC GCCTGAACAG CAACAAACCA AGGCTAGCAC TCAGATTGTG
GCTCCACGAA CCCCCGTTGA GACAACATTG GTTGACATTT GGAGCCAAGT GTTGGGTGGC
AAGTCGGTGG GCATTAACAA TAACTTTTTT GAGCTGGGCG GCGATTCAAT TCTGAGTATT
CAAATTATTG CCCGCGCCAG CCAAGCAGGC CTTAAGTTGA CACCCAAGCA ATTATTTGAT
CATCCGACAA TTGCCGATTT GGCGCAAGTG GTGGCCACCA CAACAGCAGA TCAACAAGCT
CAGCAGCAAT TGATAACTGG CCCTGTGCCG TTGACTCCGA TTCAACATTG GTTTTTTGAG
CAAGCCCTCG CCGAGCCGCA GCATTATAAT CAAGCAGTCT TTTTCGAAGT GCGCTTTGAT
CTCGATCCGG CGATTTTGGC TCAGGTGTTG CCTGAACTTG TGCGCCATCA CGATGCCTTG
CGCCTACGAT TTAGCCCGAG CGAACAAGGC TGGCAACAAG TTAATAGTGC CGATGTAGCG
GTCGAGTTGC TGCACATTAA TCTGGCCGCT GCGCCAGCCG AGCAGCAGCG CCAGTTGATG
GAGCAAAAAG CCACTGAACT GCAAACTAGC CTTGATCTGA TCAATGGCCC GTTGTTGCGT
ATGGCCTTGT TTGAGCTTGG ATCCAATCAA CCAAGTCGCT TGTTGGTGAT TGTGCATCAC
TTGGCGATTG ATACTGTCTC GTGGCAGATT TTGTTTGCCG ATTTACCGCT GGTATACGAG
CAAATTCGTC AGCAACAACC AATTAATTTA CCAGCCAAAA CCAACTCATT CAAAGATTGG
GCTGAGCGCT TGCAACGCTA TGCAGGTTCG GCTGAACTTG AGCGCGAAGT TGCCTATTGG
CTTGATCCTA CCCGCCAACA GGTTCGCCCA CTACCAGTCG ATTATGCTGC TGAAGCCCAT
GCCAATACGG TTGCAAGCAC CCAAAATCTG AGCCTACATT TGAGTGTTGA GGAAACGAAG
GCCTTATTGG AAGTGGTTCC TCCGGTCTAT AACACCCAAA TTAACGATGC GCTATTGGCA
GCCTTAACCC AAAGCATCAG TCAATGGCAG GGCAATCCAA GCGTGTTAGT CGAGCTCGAA
GGCCATGGTC GTGAAGATAT CTTGGATGAT TTGGATATTT CGCGCACGGT TGGATGGTTT
ACTAGTCGCT TTCCGGTGTT GTTGCAAGCG AGCAAATCAG CCAATGCTGG CGATAGCCTA
CGCGCAACTA AAGAACAGTT GCGCCAAATT CCACAGCGCG GGATTGGCTA TGGTTTATTG
CGCTATTTGC GTGGCGATGC CCAGCTGAGC CAGCAATTAG CCAATCTGCC CCAACCGCAA
CTGAGCTTTA ATTATTTGGG CACGGTCGCC CACGATGTTT CGCAAACTGG TCCATTGGCT
TGGACGAGCG AATCGAGTGG GCCAACCCGT AGCCCCGCAG CCTTACGCCG CCATTACCTT
GATCTGACGA TCTTGGTAAC CGATCACATG TTACAGATGA ATTGGACATA TAGCCAAGCA
TTGCATAGTG CAGCGACGAT TCAGCGCTTG GCTGAACGTT TTGTGTCCGC CTTACAAGCG
ATTATTCAGC ATTGCCAACA ACCCAATGCT GGTGGTTATA CCCCTTCGGA TTTTCCATCG
GCCAACTTAA ACCAAAAGAA TTTGGATAGC TTTATCGCCA AATTACGCAA CAGCGAGAAC
AGCACTCATG AAAGTTGA
 
Protein sequence
MSDAMIEGFR LSPQQQHVWA LQQLDSAQPY RTQGTIVIEG SLDIARLQAS LLQVVQRYEI 
LRTTFHYLQG MALPLQVVTE LSELALPSYD LSEHSLTDLQ SQLAQQAFDF AAGPLLHAWL
GREHALKHYL LLSVPTLCAD NLSLVNLTSE LAAIYAAEPT TDEPMQYIDI AEWQHELLEA
EETAAGRSYW QQQTWHDAIT VRPAFVANQA AAQTFQPQQL PIPLTEQLIE LLNQASQTLA
VPASALALAA WRTLLWRLSD QTNGVVGVVC DGRKYAELET TLGLFAKAVP LASPLAADQQ
FGQLVKQVQQ EYVEAYAWQE SFHWPAEVSS QLAFFPFGFE ALTTPKPLVM ANLSFQVVQQ
RATLNPYAAH LTWFNQPQGF AASLNFDAGL IASSSAERLI EQYQTLLTAA LSNLNTSLAQ
LPIVGTNERQ QLLIEFNQTA EPFDAARCFH ELFSAQAAIT PDHPAVVVED QQLSYAELEA
RSNQLARELL ARGVRPDQPV ALALDRSLNL LVGILGILKA GGAYVPLDLG LPKERLGFML
GDIQASIVVS ETSLQAQLPE HAADYLWLDQ AWPTIAEHSS EPVAASAVPS NLMYIIYTSG
STGQPKGVGV SHQSLYNYIS SISQRLNLPP QASFASVSTF AADLGHTAIF PTLTNGGTLH
LITAERASNA SQLADYMQQH AVDCLKIVPS HLAALLAVAE PARVLPRQRL ILGGEAVSWK
LLQTLALLAP DCQVFNHYGP TETTVGVLTN PLSANLPSAQ SAIPALGRPI ANTQIYLLDV
HGQPVPLGMT GELYVGGAAL ARGYWQRPAI TAERFVPDGL SGQTGSRLYR TGDVARYLPD
GKLEFLGRAD DQVKIRGFRI ELGEIEAALR NHTAIEQAAV IVRDDPAGDK RLVAYLVAGQ
QRPLSLRELR NFLKQSLPDY MVPAAFVMLE RLPLNANGKL DRQALPAPEQ QQTKASTQIV
APRTPVETTL VDIWSQVLGG KSVGINNNFF ELGGDSILSI QIIARASQAG LKLTPKQLFD
HPTIADLAQV VATTTADQQA QQQLITGPVP LTPIQHWFFE QALAEPQHYN QAVFFEVRFD
LDPAILAQVL PELVRHHDAL RLRFSPSEQG WQQVNSADVA VELLHINLAA APAEQQRQLM
EQKATELQTS LDLINGPLLR MALFELGSNQ PSRLLVIVHH LAIDTVSWQI LFADLPLVYE
QIRQQQPINL PAKTNSFKDW AERLQRYAGS AELEREVAYW LDPTRQQVRP LPVDYAAEAH
ANTVASTQNL SLHLSVEETK ALLEVVPPVY NTQINDALLA ALTQSISQWQ GNPSVLVELE
GHGREDILDD LDISRTVGWF TSRFPVLLQA SKSANAGDSL RATKEQLRQI PQRGIGYGLL
RYLRGDAQLS QQLANLPQPQ LSFNYLGTVA HDVSQTGPLA WTSESSGPTR SPAALRRHYL
DLTILVTDHM LQMNWTYSQA LHSAATIQRL AERFVSALQA IIQHCQQPNA GGYTPSDFPS
ANLNQKNLDS FIAKLRNSEN STHES