Gene Haur_2090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2090 
Symbol 
ID5733978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2606263 
End bp2609703 
Gene Length3441 bp 
Protein Length1146 aa 
Translation table11 
GC content52% 
IMG OID641279231 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001544858 
Protein GI159898611 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTATT CAAATATTAC CCAACTCGTG ACCGCTCAAG CCAACCAAAC GCCTGCCGCT 
TGGGCAGTGC AAACGCCTAC AGGTTATGGA TTAACCTTTG CCGATCTTGA GCAACAATCT
AGCCAAGCAG CGGCCTATTT GCAACATCTT GGTGTACAAC CAGCGAGTGT TGTGGGCATT
TGTTTGCGCC GCACGCCACA GCTCATCGTG TGGATGCTGG CAATTCTCAA GGCTGGTGCG
ACCTATCTGC CGCTTGATCC GGCCTATCCA ACCGCGCGGT TGCAATTTAT GCTGGCCGAT
GCCAAGGCCT TGCTGGTCGT CAGCGAAACG TCATGCCAAG CAGCTTTACC CCTGAATACC
ATTGAGTGGG TGTTGATTGA TCAGCCTTGG TCGAGGGAGT TGGCATGGCG CGAACCCTTC
TATCATAGCG CAATCCCTGC TTATATCATC TATACCTCTG GCTCAACCGG CCAACCCAAA
GGTGTGCTGA TTAGCCATGC CAATGCGCTC ACCTTTTTAG CATGGGCTGA AACCACGTTT
AGCGTAGCCG AACGCGCTGG AATTTTAGCG GCAACCTCGA TCAACTTTGA TCTTTCGATC
TTCGAGATTT TTCTGCCATT GATTAGTGGC GGTACGTTGG TGCTGGTGGA AAATCTGCTT
GATCCAGCGC TGTTTCACTC GCAGCATCCG ATTTGTTTGA TCAATAGCGT GCCGTCGGCG
GTGCAAACGT TGTTGCAACA TACGGCACTT CCATCTAGCG TGCTCACGGT GAATCTTGCC
GGCGAGCCGC TGAGCTTGCG ATTAGCGCAG CAACTCTATC AACAGCCAAA CATCCAGCGC
GTATTCAATT TATATGGGCC AACCGAGGCC ACGACCTATG CCACCTACCA GCTCGTTGAG
CGCACTGCCA GCCGGCCGCC AGCGATTGGT CAGCCGCTCA CTGGCACGAC CTGCGTTATC
CTCGATGCGC ACTATCACCC TGTTGCCGCT AAGGATGTTG GCGAATTGTT TATTGCTGGG
CTGGGCGTAG CGCAAGGCTA TTTGCAACGC CCCGATTTAA CTGCCGAACG TTTTTTGCCC
AATCCGTGGG CTACCACGCC TGGCGAACGA ATGTATAAAA CTGGCGATTT GGCTCATTGG
AACGCGGCAA ACGAGCTTTG TTACCTAGGA CGTAACGATC AGCAGGTCAA AATTCGTGGC
TTTCGGATCG AGCTTGGTGA GATTGAGGCC CAGATTCTGC GCTTAGCACC ATTGCAAGCG
GTTGTGGTTC AGCCAATTAC GCTGGTGGCT GATGATCCGC AGTTGACCGC CTATCTGGTT
GCTAATCAGC CGATCGATTG CGAAGCCTTA CGCGCTAGCT TAGCCCACCA TGTGCCAAGC
TATATGCTAC CAAGTTTTTG GGTACAGCTG GCCGAACTGC CATTAACACC CAATGGCAAG
CTTGATCGAG CGGCCTTGCC ATGCCCTGAT GCCCCAATTA AACAACCATT GCAAAGCTCA
ACTGAACAGC GTTTGGCGAT AATCTGGCGC GAAATCTTGG GCGTGGAACA GCTTGGGCGT
GAGAGTAATT TTTTGCAGCT TGGTGGTCAT TCGCTCAATG TGATGCAAGT GCTCAAACGA
ATTGAGCAGA CGTGGCAGCT TCAGCTTTCG ATTACGCGCT TGTTTGAGCA ACCAACCTTG
GCAGCTTGGG CGCGGTTAAT CGATCAGCAG CAGCAAGCCT TTGCTCAGGC TGAACCTCAA
TTCTATCAGC GCACTACGCA ATTGCATCAG CTTTCGTTTG GCCAACAACG CCTGTGGTTT
GCCGAGCAAT TACACCCAAA CACCGCCTAC AACGTCATCC ACGCATGGCG CATCGATGCT
CTGCTTGATG CGGTTGCGCT TGAACAAAGC TGGCTAAGGC TGATCGAACG TCATGAAATG
TTACGCAGCA GCATTCAGCT CATCGCGGGT ATTCCACAAC AAACGATCAT GCTCAAGCCA
GTTTGGCAGC TCCAATCTGC GCCGCAGGCA AGCTTAGAAT ACTTGTTAAG GTTGCTTGAT
CGGCCATTCG ATTTGGCGCA AGCCCCATTG TTACGGGTTG GCTTGGCGCA ACACCACGAT
CACGCCATCA TGCTGGTAGT TATCCACCAT AGCATTATTG ATGCTTGGTC GTTGGGCGTG
CTGTGGGCTG AATTAAGTCA GCTGTATGCA AGCTTCTTCG AAAACCAACC AATTCAGCTG
CCAAGCCAAG CCTACGATTA CCTTGATTTT GTGGCTTGGC AGCGCCAGCA GCTTGATTCG
GCGTGCTTAG CCCAATTGCA AACCTACTGG CAAACCCAGC TTGCCCAGCT TGATCCGCTC
CCAGCTTTGG CGACCGATTA CCCCCGTTCG ACGCACATGC AGGGCTTGGG CATCAGCCAA
ACCTATCAAC TTGATCAGCA GGTTATCCAA GCATTACAAG GCTTGGCCAA CGCCAATAAC
GCTAGTTTGT TTATGCTGTT ACTGGCGGGA TGGGCGAGCG TGCTCTATCA ACGCACTCAG
CGCAGCGATC TGCTGATTGG CACGCTCAGC GCTGGCCGTG AGCATGCAGC GTTTGAGCGT
TGCGTTGGCT TTTTTATTAA TATCTTGCCC TTGCGCCTGC ATTGCGCCGC CGAGCAAACG
TGGCTTGATC TATTGCAGCA AACCCGCATG GTTGCCTTAC AAGCATACCA ACACCAAGCC
TTGCCATTCG AGCAGATTGT GGCCAACGTG GCTCATGAGC GCAACAACCA ACCGCAGATT
CCGCTAATTC AAAGTTTGTT GGTGTTGCAA AACGCGCCCA GTCAGCCCTT AGTTTTAGGT
GCGCCAGCCC AAGCCCTAGC TACGCCAATT CAGGCCAGCA AAACCGATTT GGTGCTGTTG
GTGCAACCCG CTGCAACTGG CTATCAACTG ACCCTAGAAT ATGCGAGCGA ATTGTTCGTC
GCTGAATCGA TTGCAGCGTT GGCCTCCGAT TTCCAAGCGG TTTTAGGCCA AATGGCGCAG
CATCCTACCA GCACACTTAG CGCTGTTCAG TTGGCTGGGC ATTGGACGGC GGAGCACTAT
TCCAACAAAT TACCCACGCT TCAGCCAATG GCCGCCCCGC CGCAAACAGC GCTAGAGCAA
ACCCTTGCCG ATATGTGGCA AGAGGTCTTG GGCTTGTCAA TTGATAATAT TCATGCCGAT
TTCTTCCGCA TGGGCGGCCA TTCACTCAAC GCCACCCAAG TTGTCTCGCG CATGCAACAG
CTTTTACAGG TAACTACAAG TATTCGAATG TTGTTCGATT ATCCAACGAT TGCCCAATTA
AGCCAGCATT TGCTGGCGAA TCAAGCTCAG GCAGAGCGAA TCAATAAAAT TGCCACCGCA
CTGCAACAGA TCAAAACCAT GAGTGCCAGC ACCAAACAGG CCTTGCAACA AAAGGCCGCA
GGAAGGATAA GCCAACCATG A
 
Protein sequence
MSYSNITQLV TAQANQTPAA WAVQTPTGYG LTFADLEQQS SQAAAYLQHL GVQPASVVGI 
CLRRTPQLIV WMLAILKAGA TYLPLDPAYP TARLQFMLAD AKALLVVSET SCQAALPLNT
IEWVLIDQPW SRELAWREPF YHSAIPAYII YTSGSTGQPK GVLISHANAL TFLAWAETTF
SVAERAGILA ATSINFDLSI FEIFLPLISG GTLVLVENLL DPALFHSQHP ICLINSVPSA
VQTLLQHTAL PSSVLTVNLA GEPLSLRLAQ QLYQQPNIQR VFNLYGPTEA TTYATYQLVE
RTASRPPAIG QPLTGTTCVI LDAHYHPVAA KDVGELFIAG LGVAQGYLQR PDLTAERFLP
NPWATTPGER MYKTGDLAHW NAANELCYLG RNDQQVKIRG FRIELGEIEA QILRLAPLQA
VVVQPITLVA DDPQLTAYLV ANQPIDCEAL RASLAHHVPS YMLPSFWVQL AELPLTPNGK
LDRAALPCPD APIKQPLQSS TEQRLAIIWR EILGVEQLGR ESNFLQLGGH SLNVMQVLKR
IEQTWQLQLS ITRLFEQPTL AAWARLIDQQ QQAFAQAEPQ FYQRTTQLHQ LSFGQQRLWF
AEQLHPNTAY NVIHAWRIDA LLDAVALEQS WLRLIERHEM LRSSIQLIAG IPQQTIMLKP
VWQLQSAPQA SLEYLLRLLD RPFDLAQAPL LRVGLAQHHD HAIMLVVIHH SIIDAWSLGV
LWAELSQLYA SFFENQPIQL PSQAYDYLDF VAWQRQQLDS ACLAQLQTYW QTQLAQLDPL
PALATDYPRS THMQGLGISQ TYQLDQQVIQ ALQGLANANN ASLFMLLLAG WASVLYQRTQ
RSDLLIGTLS AGREHAAFER CVGFFINILP LRLHCAAEQT WLDLLQQTRM VALQAYQHQA
LPFEQIVANV AHERNNQPQI PLIQSLLVLQ NAPSQPLVLG APAQALATPI QASKTDLVLL
VQPAATGYQL TLEYASELFV AESIAALASD FQAVLGQMAQ HPTSTLSAVQ LAGHWTAEHY
SNKLPTLQPM AAPPQTALEQ TLADMWQEVL GLSIDNIHAD FFRMGGHSLN ATQVVSRMQQ
LLQVTTSIRM LFDYPTIAQL SQHLLANQAQ AERINKIATA LQQIKTMSAS TKQALQQKAA
GRISQP