Gene Haur_2415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2415 
Symbol 
ID5734296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3091811 
End bp3095194 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content51% 
IMG OID641279556 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001545183 
Protein GI159898936 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGATT ATTCTGAGCG TTTAGCGGCA CTTTCGCCAG CCAAACGGGC CTTACTTTTG 
CAAAAAATTC AAGACAAAGC CAATAAAGCC GCCCAACAGA TTCAGCCAAG GCTTGATCAA
AACAGCTATC CGTTGTCGTT TGCTCAACAA CGGCTCTGGC TGATCGAGCA ACTTCAGCCC
AACGCGGCTT TATACAACAT TTCAATTCCA ATTCTGATTC GCGGCGTTTA CCCTGAATTG
CCGAGCATTT TGACCCAGTG TTTGACCACG ATTTTCCAAC GCCATGAGCC ATTACGTGCC
AGTTTCAGCA TGGTCGATGG CCTTCCCCAG CAATCAATTA TGGCCTTGGA AAGCGTCGAA
TTACCGATTG TTGATCTACG TGAATTAACC CCAAGCGAGC GTGAAGCCGC CGTACATCAA
TTTATGCAAG CTGAAACTGA ACGCCCGTTT GATTTGCGTC ATGATCAACT GTTTCGGGCC
AATTTGCTAC GCATCAGTCC CGACGAGCAT CTTTTATTGC TAATCATTCA TCACATTGCC
TTTGATGCGT GGTCAGTCAA GATCTTTATC CAAGAATTTG AGGCGATTTT TACGGCCTTG
GAAAATCAAC AGCCGTTGCC GCAATTCGAG CCATTGGCCT TGCAATATGC CGATTTTGCT
GCTTGGCAAC GCGAATGGTT GCAAGGTGCA ACTCTCAATC AACAACTCAG CTATTGGAAA
AACCAGCTTG CTGGTGAATT GCCAGTGCTG GCCTTGCCAA CCGATCGCCC ACGCCCAGCC
ATTCAAACCT TCAAAGGCGG TCGCCATACC TTTTGGATCA GCACCGAACT AACCAATCAA
CTCAATCAGC TCAGCCAACA GCACCAAGCA ACTCTATTTA TGCTGTTGCT CGGCGCATGG
GTCAGCTTAC TCCATCGCTA CAGCGGCCAA AACGACATCA TTGTTGGTTC ACCGATCGCC
AACCGCAATC GGCGTGAGCT AGAAGATCTG ATCGGCTTTT TCGTCAATAC CTTGGTGCTG
CGAGTCAAAT GTGCTGATGA TCCAAGCTTC CTCGATTTGC TGGAGCATGT GCGTGCAGTA
GCGCTGGGAG CCTACGCCCA CCAAGACTTG CCGTTTGAAA TGTTGGTCGA TGCCCTGCAA
CCAACTCGCG ATCTGAGCCG TTCAGCGCTC TTTCAGGTGT TGTTTGTGCT GCAAAATGTT
TCGATCGGCG GTCAGCATTC AGCCTTCGAT ATTTTGGAAG ATGTGGCTAG CCCTTCAAAA
TTCGATCTCA CACTGTCGAT GTTTGAGTTT GAGCAGGGTT TACGCGCAAC AATTGAATAC
AATGTTGATC TGTTTGATGC CAGCACGATC GACCGCATGA GTCAACATTA TGTCACATTG
CTCAGCAGCA TCAATCAACA ACCGCAAACC AAACTTTCAC AATTGGCAAT GCTCACGGCG
GCTGAGCAAA CCCAAATCAT CAAAACTTGG AACAACACCA GCCAAACCTA CCCTGAGCAG
CTTACATTTG CCCAACTATT TGAGGCTCAA GTTGCCAAAA CACCTGAAGC AACTGCCCTG
ATTGGCGAAG ATCAGGTGCT CAGCTATCAC GAACTTAATC GCCGTGCCAA TCAATTGGCC
TATCGGTTGC AAGCCCAAGG TGTTGGCCCC GAAAGCTTGG TTGGCATCTG TTGCGATCGC
TCAATTGCGA TGGTGGTTGC GTTATTGGCC ACGCTCAAAT CCGGCGGCGC GTATATCCCG
CTTGATCCAG CTTACCCCAA CGAGCGTTTG GCTTGGATGC TCAACGATTC GCAAGCAGCA
CTAGTTTTGA CCCAAAGCCA TTTGCTTGAA AAAGTACAAC AACTCAAGCA GGCTGATTTA
ACCGTGCTTG ATCTGGCGAA GATTTGCGAT GGCAATGAGC CAAGCCAAAA TCTTGTCAGT
GCTGTTCAGC CCGCCAACTT AGCCTATATC GTCTATACCT CTGGCTCAAC TGGCCAGCCC
AAAGGCGTGA TGGCTAGCCA GCAAGGCTTA ACCAACTTGG TAACAGCCCA AATTGCCGGC
TTTGGGGTAA CCTCAGCAAG TCGGGTGTTG CAATTTGCTT CGTTCAGTTT CGATGCGGCG
ATCTCCGAGA TTGGCATGGC GTTGGCTTCA GGCGCAAGTT TGGTGCTCAT GCCAGCAGGT
GGCCTAGCCG CCGGAACCGA TGTGCTAGCA TTGATTCGCC AACACAACAT TACGGTAGCG
ACCTTGCCAC CATCATTATT AGCAGTGCTT TCAGCGGATC AAGCTCCCAG TTTGACCACA
GTAATTGCTG CTGGCGAGGC TTCGAGCAAT GAGGTTGTGC AACGCTGGGC AGTTGAGCGC
AATTTAATCA ACGCCTATGG GCCAAGTGAA ACCACAGTTT GCGCGAGCCT GACTCGGCTT
GAGCCAAATC TGGCAGGTAC GCCACCGATT GGCCGACCAC TAGCCAATTT ACAAGTCTAT
TTGCTCGATC AACAGCAGCA GATCGTGCCC GTTGGCGTGA TTGGCGAAAT CTATGTTGGC
GGGGTTGGTG TAGCCCGTGG CTATCTGAAA CGGCCCGCTC TGACCGCCGA GCGCTTTATT
CCCAACCAAT TTAGCTCAAC TCCAGGCCAA CGTTTGTATC GCACTGGCGA TTTAGGCCGC
TATCGGGTTG ATGGCCAAAT TGAATTTGTT GGGCGGATCG ATCAGCAGAT CAAATTACGC
GGCCATCGGA TCGAGCTAGG CGAAATTAGT AGCCTGCTGA ATGCTCACCC AGCAGTCGAG
CAAAGTGTGG TGTTGGTGCA CGATCACGCT AGCAGCACGG CGCGGCTGAT TGCCTATGTT
GTGGCAAATA GCACCAGCAG CAATGCTCTC AGCGAATTGC AAATCGCCGC AGGAACCAGC
CAAGCACCAG CAAGCTACGA TTTAGCGGCA GATTTACAAG CCTATGCCAA GCAAAAACTG
CCAGCATTTG CCGTGCCTAG TGCCTTCGTG GTGTTGCCAA GCATGCCACT AACGCCCAAC
GGCAAGATCG ATCAGCGCAA ATTGGCCGAG CACACCCCCA ACCCAAATCC AGCTACGAGT
GTTGATGTTG TACCCCAAAC CAACCTTGAA CAAACCTTGA CCACACTTTG GCAAGAAGTA
TTAAATGTGC CAACCCTAAG CACCCAAGCC AATTTCTTCG ACCTTGGCGG CAATTCGCTG
GCAATGGTGC AGGTGCATAG CCGCTTGCAA GAGCTTTTGG GGCGCGAGTT GGTGCTTTTA
GATTTATTCA AATACCCAAC CATTCAAAGT TTAGCGGTTT ACTTGAACAG CGAGCAATCA
ACTCAAGCGT CAACCTTTGT TGATCGCGAC AATCGAGCGC AGCAACAGCG CCAAGCTATG
CAACGTCAAC GTCGCCGCCG CTAG
 
Protein sequence
MSDYSERLAA LSPAKRALLL QKIQDKANKA AQQIQPRLDQ NSYPLSFAQQ RLWLIEQLQP 
NAALYNISIP ILIRGVYPEL PSILTQCLTT IFQRHEPLRA SFSMVDGLPQ QSIMALESVE
LPIVDLRELT PSEREAAVHQ FMQAETERPF DLRHDQLFRA NLLRISPDEH LLLLIIHHIA
FDAWSVKIFI QEFEAIFTAL ENQQPLPQFE PLALQYADFA AWQREWLQGA TLNQQLSYWK
NQLAGELPVL ALPTDRPRPA IQTFKGGRHT FWISTELTNQ LNQLSQQHQA TLFMLLLGAW
VSLLHRYSGQ NDIIVGSPIA NRNRRELEDL IGFFVNTLVL RVKCADDPSF LDLLEHVRAV
ALGAYAHQDL PFEMLVDALQ PTRDLSRSAL FQVLFVLQNV SIGGQHSAFD ILEDVASPSK
FDLTLSMFEF EQGLRATIEY NVDLFDASTI DRMSQHYVTL LSSINQQPQT KLSQLAMLTA
AEQTQIIKTW NNTSQTYPEQ LTFAQLFEAQ VAKTPEATAL IGEDQVLSYH ELNRRANQLA
YRLQAQGVGP ESLVGICCDR SIAMVVALLA TLKSGGAYIP LDPAYPNERL AWMLNDSQAA
LVLTQSHLLE KVQQLKQADL TVLDLAKICD GNEPSQNLVS AVQPANLAYI VYTSGSTGQP
KGVMASQQGL TNLVTAQIAG FGVTSASRVL QFASFSFDAA ISEIGMALAS GASLVLMPAG
GLAAGTDVLA LIRQHNITVA TLPPSLLAVL SADQAPSLTT VIAAGEASSN EVVQRWAVER
NLINAYGPSE TTVCASLTRL EPNLAGTPPI GRPLANLQVY LLDQQQQIVP VGVIGEIYVG
GVGVARGYLK RPALTAERFI PNQFSSTPGQ RLYRTGDLGR YRVDGQIEFV GRIDQQIKLR
GHRIELGEIS SLLNAHPAVE QSVVLVHDHA SSTARLIAYV VANSTSSNAL SELQIAAGTS
QAPASYDLAA DLQAYAKQKL PAFAVPSAFV VLPSMPLTPN GKIDQRKLAE HTPNPNPATS
VDVVPQTNLE QTLTTLWQEV LNVPTLSTQA NFFDLGGNSL AMVQVHSRLQ ELLGRELVLL
DLFKYPTIQS LAVYLNSEQS TQASTFVDRD NRAQQQRQAM QRQRRRR