Gene Haur_2413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2413 
Symbol 
ID5734294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3081398 
End bp3086107 
Gene Length4710 bp 
Protein Length1569 aa 
Translation table11 
GC content51% 
IMG OID641279554 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001545181 
Protein GI159898934 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATT TTGCTGCAAA AATTGCTGCA TTACCACCAG AAAAACAAGC GCTGTTGATT 
CGCCGACTGC AACAAGCAGC CAACGAGCCA CCAGCCTTGG TTGCCCAACC ACGGACAACC
AATACGCTGC CGTTATCGTT TGCCCAAGAG CGCCAATGGG TGTTGTATCA GTGGGACCCA
ACCAGCCCGC TGTATAACAT CGTTTATGGG GTGCGCTACC GTGGCAAGCT TGATATTGCC
GCCTTGCAAG CAGGCTTCAA CACCATTGCC CAGCGCCATG AGGTGTTGCG CACGACCTTT
TTGCTGGTTG ATACAGTGCC CCATCAGCAG ATCCATGCTG AGCTTAAGCC AGGGTTTAGC
GTGGTCGATT TACGCGATCT GGCCGAAACT GAGCGCGATA CCGCCATCCA AGCGCAGATT
CAAGCCGAAA CCCAGCTGCC GTTTGATTTG CAAACTGGGC CATTATTGCG CGTGCTGTTG
CTGCACATCC GCGATCACGA ATATATCAAA TTGGTCAGTG TGCATCATAG TGTTTTTGAC
GGCTGGTCAG CTGGAGTGAT CATCTCCGAA TTGAATCATT TGCTGAATGC GGCCTATGCT
GGCGAGCCAA GCAGCTTGCC AGCCTTGCCA ATTCAATACG CCGATTATGC GGTTTGGCAA
CGCAATTGGC TACAAGGTAA GGTGTTGGAG CAGCAACTGC AATATTGGAA AGAACAACTC
GCTGGCGAAT TGCCAATTCT GCAATTACCG ACTGATCGGC CTTATCCACC AGTTGAATCG
TCGCGTGGCG CACATTATCG GGTGCAATTG AGTGCCGATT TGGTTGAACG ATTGGTCAAT
TGGAGTCGCA GCGAGGGCTA TACACTCAAC ATTATTTTGC TCACCATCTG GAAAACCCTG
CTCTTTCGTT ACACCAACCA AACCGATTTG CTGGTCGGCA TGCCGATCGC CAATCGCCAT
TACAACGATT TGCAAGCCTT AATCGGCTAT TTCGTCAATA CCTTGGTGAT TCGCACCAAA
GCCGCTGGCG ACCTCAGTTT CCGCAGCTTT TTGGATCAAG TGCGAGCGGC GAATTTGGCA
GCGCAAGAGC ATCAAGATTT GCCATTTGAG CAATTGGTCG AGGCTTTGCA GCCTGATCGT
AATTTGGCAC ATACGCCAAT TTTTCAAAGC TTGTTTGTGT TTCAGCGCGA TACGGTCAAT
AGTTTTCAAA TGCCTGAGCT TTCGGTTGAG CCATTGCCAA TTGAAACTGG TACGGCCAAG
TTTGCCCTTA GCCTTGAAGC CGTCTCGCTT GATCAGCAAA TCAAGCTCAA TTTCGAGTAT
AAAACTGATC TGTTTGATCC AGCGACGATT GAACGCTTGG CCCAACATTA CCAAAATTTA
CTCGAAGCAG TGCTTGCTTC GCCCGATTTG GAGCTTTCGC GCTTACCCAT GTTGGGCAAG
GCTGAGCTAG CCCAATTGTT GCCAACGCCA ACAGCGCTCG AAGCAAACTT GTTGCCCTTG
CACCAACGCT TTGAGCAGCA GGTGCAGGCC AATCCGCAAG CGATTGCGGT ACGCTTTGAG
CAAAGCCAAC TAAGCTACGC CGAGCTAAAT AGCCGCGCCA ACCAACTGGC CCATCAACTC
AAAACGCTCG ACGTTGGCCC AGATACCTTG GTTGGGCTAT GTGTTGAGCC ATCGCTCGAT
ACAATCATTG GAATTTTGGC AATTCTCAAG GCTGGCGGCG CATATCTGCC GATCGATCCT
GCACATCCTC AAGAGCGAAT TGTTTGGTTG TTGGCTGATG CCAAGGTTGG CTTGGTGGTT
ACCCAAGCTC GTTGTGTCAA CAAATTACCC CAAGCTGGGT TGCAATTGAT TGTGCTTGAT
GCCGTCGATT CAGCGCTGAG CAATCAGCCA ACCAGTAATC TGCCAGCTAG CGCCCAGCTC
GATGACTTGG CCTATATGAT TTACACCTCA GGCTCGACTG GCACGCCCAA AGGTGCATTG
ATCACCCATC GCAACGTGGC GCGACTGTTT AGTTCAACCG AAGCATGGTT CAACTTCAAC
AACCACGATG TCTGGAGTTT GTTCCATTCG TTCGCCTTCG ATTTCTCAGT TTGGGAAATT
TGGGGAGCCT TGCTGTATGG TGGTCGCGTG GTCGTCGTGC CATTTATGAC CACCCGCAAC
CCCGCTGGCT TCTATCAATT GCTGGTTGAT GAAGGCGTGA CGGTGCTCAA CCAAACGCCC
TCGGCCTTCC GCCAATTGAT CATCAGCGAT GCTGAACATG ACTTGCCATC GCGCCTAGCC
TTGCGTTATG TCATCTTCGG TGGCGAAGCA CTGAACGTTG GGGCATTGCA ACCATGGTTC
GAGCGCCATG GCGATCTACG CCCGCAGTTA GTCAATATGT ATGGCATTAC TGAAACCACC
GTCCATGTGA CCTACCGACC GCTGAGCATG CACGATGTTG AAAATCCCCA AAGCAGCCCG
ATTGGCACGG CAATTCCCGA TTTGGATCTG TATGTGCTTG ATGATCATTG TTTGCCAGTG
CCATTGGGAA TTACTGGCGA ACTATATGTG GGCGGCGCGG GCTTGGCTCG CGGCTATTGG
AATCGACCCG AACTAACCAA CGAGCGCTTT ATCAAGCATC CCTTTGCTGA AACAGGCCGC
CTCTATAAAA CTGGCGATTT AGTGCGGCGC TTGGCTAATA ATGAGATCGA ATATCTGGGG
CGGCGTGACA ACCAAGTCAA AATTCGCGGC TTCCGAATTG AGCTAGGCGA AATTCAGGCC
ACCCTGATGA GCCACCCCGC GATCACCGAT GCGATTGTGG CGGTCAATAC AATCTCAGCC
GATGATCAGC GCTTGGTGGC CTATTTGGTA ACCCAGCCTA ATCAAGTGCC ACGCTTTAGC
CAATTGCGCA CATTTCTCAA GCAACGCCTG CCAGAATATA TGGTGCCAAC CTCATTCATT
ATGCTTGAGC GTATTCCGCT GACTGCCAAC GGCAAAATCG ATTATCGCGC TTTGCCCAGC
CAACAACAGA CCAAACAACT TGAGCGCAGC CAACCGATCG CGGCTCCAAC CAGTGTTACC
GAGCAATCGT TGATCGCGAT TTGGAGCAAT TTGCTGGGAG TAACCCAAGT TGGCATTCAG
GATAATTTCT TTGATTTGGG TGGGCACTCA TTGTTGGCAA CCCAAGTTAT TTCACGGGTG
CGCGAGGTGT TTAATGTTAG CCTCAACTTG CGCGATTTCT TCCTCAACCC AACCATCAAG
GGCTTAGCCA GCGTTATCGA GCAGGCCAAC CAAAAACCAG AGCAAACGCC AATTATCGCG
ATTCACAAAG CTGATCCGCA GGCACGTTTG CCACTTTCCT ATGCCCAACA GCGCATTTGG
TTCTTGGCTC AAATCGATCC AACCAGCAGT TTTTATACTG TCCCTGTGGC CTTAGAAATT
ACTGGCCCCT TGCAAGTTGC CGCTTTAGAA CGCACCTTGA GCGAGATTGT GCAGCGCCAT
CATGGCTTGC GCACCCTGTT TGTGAGCCAC GAAGGCCAAA CGCTTCAGCA AGTCCAAGCA
GCTCAGCCAA TTCAATTGCC AATTCATGAT CTCAGTGATT TAGCTGAGGC AGCCCAAGAA
ACAGCGATCA ACCAATTATT GGAGCAGGAG ATTAATCAAC CCTTCCAGCT TGATCGCGAT
CAACTGTTGC GCGGGCGCTT GCTCAAACTA GCTCCAACCA AGCATGTGCT CGGCCTAAGC
ATTCATCATA TTGCCTTTGA TGATTGGTCG CAGGGCATTT TGTTTGATGA AATGACCAAG
CTCTACCAGG CCTTTGCCAA CAATCAAGCC TCGCCATTGC CCGAGCTAGC TTTGCAATAC
CCTGATTTTG CCGCTTGGCA ACGTCAATGG TTGCAAGGCG AAGTTTTACA AAACCAATTG
AATTACTGGA AACAGCAATT GAGTGGCAAA TTACCATTGC TCGAAATGCC GCTGGATTAC
CCACGGCCTA GCGTACAAAG TTTCAATGGC GCACACGAAA TGCTGAGCAT CGAGCCAGAA
CTGTATCGCA AACTTAATCA ATTGGCGCGA GATAACGAAG CAACCATGTT TATGTTGCTG
ATGGCAGCAT GGGCGATTTT GCTCAATCGT TATAGCAATC AAACTGATAT TGTTATCGGC
ACGCCAATCG CCAACCGCAA CCGCCAAGAG CTAGAGGGCA TCATCGGGTT CTTTGCGAAT
ACCCTGGCCA TTCGCGCCGA TTTAACTGGC GATCCAACGC TTCAACAAGT GGTGCAACGG
ATTCGCGAAA CCGCCCTAAA TGGCTATAGC CACCAAGATT TACCGTTCGA TTTGTTGGTC
AGCGAGCTTT TGCCTGAACG ACAAGCCAAT CGCTCACCAA TTTTCCAAGC TATGTTGGTG
CTGCAAAACG CCTCACAGAG CAGCAACCTC GATTTGGCCG AGGTGCAGAT TGCGCCGCGC
AGCGTTGAAA CCAACTCGGC TAAGTTTGAG CTAAGTTTGG TGCTCTACGA CAACGGCTCA
AGCATCGACG CTTGGATTGA ATACAACACC GACCTGTTTA AACCCACCAC GATTGGCCGC
ATGGTCGAGC AGTTACAACA ATTGCTCACC AGCATGACCA CCGATCCACA ACAACGCATC
AGCCAAGTTT CGTTGGTCAG TTCGGCAGAA AAAGAACGGC TACTCGGCGG CTGGTCGCAA
GGAAGCGACG ACGACTACGA TTTGTTCTAG
 
Protein sequence
MTDFAAKIAA LPPEKQALLI RRLQQAANEP PALVAQPRTT NTLPLSFAQE RQWVLYQWDP 
TSPLYNIVYG VRYRGKLDIA ALQAGFNTIA QRHEVLRTTF LLVDTVPHQQ IHAELKPGFS
VVDLRDLAET ERDTAIQAQI QAETQLPFDL QTGPLLRVLL LHIRDHEYIK LVSVHHSVFD
GWSAGVIISE LNHLLNAAYA GEPSSLPALP IQYADYAVWQ RNWLQGKVLE QQLQYWKEQL
AGELPILQLP TDRPYPPVES SRGAHYRVQL SADLVERLVN WSRSEGYTLN IILLTIWKTL
LFRYTNQTDL LVGMPIANRH YNDLQALIGY FVNTLVIRTK AAGDLSFRSF LDQVRAANLA
AQEHQDLPFE QLVEALQPDR NLAHTPIFQS LFVFQRDTVN SFQMPELSVE PLPIETGTAK
FALSLEAVSL DQQIKLNFEY KTDLFDPATI ERLAQHYQNL LEAVLASPDL ELSRLPMLGK
AELAQLLPTP TALEANLLPL HQRFEQQVQA NPQAIAVRFE QSQLSYAELN SRANQLAHQL
KTLDVGPDTL VGLCVEPSLD TIIGILAILK AGGAYLPIDP AHPQERIVWL LADAKVGLVV
TQARCVNKLP QAGLQLIVLD AVDSALSNQP TSNLPASAQL DDLAYMIYTS GSTGTPKGAL
ITHRNVARLF SSTEAWFNFN NHDVWSLFHS FAFDFSVWEI WGALLYGGRV VVVPFMTTRN
PAGFYQLLVD EGVTVLNQTP SAFRQLIISD AEHDLPSRLA LRYVIFGGEA LNVGALQPWF
ERHGDLRPQL VNMYGITETT VHVTYRPLSM HDVENPQSSP IGTAIPDLDL YVLDDHCLPV
PLGITGELYV GGAGLARGYW NRPELTNERF IKHPFAETGR LYKTGDLVRR LANNEIEYLG
RRDNQVKIRG FRIELGEIQA TLMSHPAITD AIVAVNTISA DDQRLVAYLV TQPNQVPRFS
QLRTFLKQRL PEYMVPTSFI MLERIPLTAN GKIDYRALPS QQQTKQLERS QPIAAPTSVT
EQSLIAIWSN LLGVTQVGIQ DNFFDLGGHS LLATQVISRV REVFNVSLNL RDFFLNPTIK
GLASVIEQAN QKPEQTPIIA IHKADPQARL PLSYAQQRIW FLAQIDPTSS FYTVPVALEI
TGPLQVAALE RTLSEIVQRH HGLRTLFVSH EGQTLQQVQA AQPIQLPIHD LSDLAEAAQE
TAINQLLEQE INQPFQLDRD QLLRGRLLKL APTKHVLGLS IHHIAFDDWS QGILFDEMTK
LYQAFANNQA SPLPELALQY PDFAAWQRQW LQGEVLQNQL NYWKQQLSGK LPLLEMPLDY
PRPSVQSFNG AHEMLSIEPE LYRKLNQLAR DNEATMFMLL MAAWAILLNR YSNQTDIVIG
TPIANRNRQE LEGIIGFFAN TLAIRADLTG DPTLQQVVQR IRETALNGYS HQDLPFDLLV
SELLPERQAN RSPIFQAMLV LQNASQSSNL DLAEVQIAPR SVETNSAKFE LSLVLYDNGS
SIDAWIEYNT DLFKPTTIGR MVEQLQQLLT SMTTDPQQRI SQVSLVSSAE KERLLGGWSQ
GSDDDYDLF