Gene Haur_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1803 
Symbol 
ID5733705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2092256 
End bp2093884 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content55% 
IMG OID641278946 
Product2,3-dihydroxybenzoate-AMP ligase 
Protein accessionYP_001544574 
Protein GI159898327 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGC TGACGCTTGG CCTTGAAGGC ACAACTGCTT GGCCCGCCGA GTTTGCCGAA 
CGCTATCGTA AGGCTGGCTA TTGGGTTGAT CAGACCTTTG GCGAAGCACT ACGCGAGTGG
GCGCAACGCT CGGGCGATGC CACCGCTGTG GTATGTGGTG AACGCCGTTG GAGCTACCGC
GAGCTTGATC AACGGGTTGA TCGCTTGGCA GCGGGCTTGC AACAGCTCGG CATCCAAACA
AAACAACGGG TGGTGGTGCA ATTGCCCAAT TGCGCCGAAT GGTTTGTGGT TTGTTTTGCA
CTCTTTCGGG TTGGTGCAAT TCCGTTGATG GCACTGCCCG CCCATCGCTT GGCTGAAATT
GGCTATTTTT GCCAACACAG CGAAGCCGTG GCCTATGTGA TTGCCGATAA AGTTGGCAGT
TTCGATTATC GCAATTTGGC AGCTGAGGTC AAAGCGGTTG CGCCAACCTT GGAACATGTG
TTGGTGGTTG GTGAGGCTGG GCCGTTTACT GCTTTGGCTG AGGTCGATGC TGAGCCAAGC
GAATTTCCAA CGCTTGACCC TGCCGAGGTG GCTTTGTTCC AGCTTTCGGG CGGTAGCACG
GGCGTGCCCA AATTGATTGC CCGCACCCAC GACGATTATT TGTATTCGGT GCGAGCGAGT
GCTGAAATCT GCAAGTTGGA TGCCAGCAGC GTGTATTTGT GCGTCTTGCC GATGGCTCAT
AACTTTCCGA TGAGTTCGCC TGGAACATTG GGAACTTTGG CGGCGGGCGG CACGGTGGTG
CTTGCGCCGC AACCAAGCCC CGATGTGGCT TTTCCCTTGA TTGCTCGCGA AGGTGTGACG
ATTACTGGCA TGGTTCCGCC GCTGGCATTG CTGTGGCTCG ATGCTGCGGC AAATCGCAAG
GCCGAATTAT CAAGCCTCAA GCAAATTTTG GTTGGTGGTG CGCCCTTTGG TGCTTATACC
GCCCGCCGCG TGCAGCCAGA GCTTGGTTGC CAATTGCAAC AAGTCTATGG TATGGCTGAG
GGCTTGGTTA ATTACACACG GCTTGATGAT GCGGCTGAGC TGATTTGCCA CACCCAAGGC
CGACCAATTT CGCCCTTGGA TGAGGTACGG ATTGTCGATG ATGAAGATAA TGATGTGCCG
TTGGGCGAGC TTGGTCACCT GATTACCCGT GGCCCCTACA CGATTCGCGG CTATTATCGC
GCTGCTGAAC ATAATCAACG GGCTTTTACC AGCGACGGTT TTTATCGGAC TGGTGATTTG
GCACGCTTGA ACGCCACTGG CTATGTTTCA GTCGAAGGCC GCGCCAAAGA TCAAATCAAT
CGTGGTGGCG AAAAAGTCGC TGCCGAAGAA ATCGAGCAAC ATTTGCTCAA TCACCCAGCG
ATTCACGATG TGGCGCTGGT GGGCTTGCCC GACCGATTTT TGGGCGAACG CACTTGTGCG
GTGATTGTCA GCAACGGTGT GAATATCAAC CGCCGCGAAG TGTTGCAATT TTTGCGCAGC
CGTGGGCTTG CCGAATACAA ATTGCCAGAT CGGGTTGAAA TCGTGGAGAG TTTGCCCAAA
ACTGGGGTTG GCAAGATCAA CAAACGGCTG TTACGTGAGC AATTAAGTGC TGGTCGTGTG
CCAGCCTAG
 
Protein sequence
MAELTLGLEG TTAWPAEFAE RYRKAGYWVD QTFGEALREW AQRSGDATAV VCGERRWSYR 
ELDQRVDRLA AGLQQLGIQT KQRVVVQLPN CAEWFVVCFA LFRVGAIPLM ALPAHRLAEI
GYFCQHSEAV AYVIADKVGS FDYRNLAAEV KAVAPTLEHV LVVGEAGPFT ALAEVDAEPS
EFPTLDPAEV ALFQLSGGST GVPKLIARTH DDYLYSVRAS AEICKLDASS VYLCVLPMAH
NFPMSSPGTL GTLAAGGTVV LAPQPSPDVA FPLIAREGVT ITGMVPPLAL LWLDAAANRK
AELSSLKQIL VGGAPFGAYT ARRVQPELGC QLQQVYGMAE GLVNYTRLDD AAELICHTQG
RPISPLDEVR IVDDEDNDVP LGELGHLITR GPYTIRGYYR AAEHNQRAFT SDGFYRTGDL
ARLNATGYVS VEGRAKDQIN RGGEKVAAEE IEQHLLNHPA IHDVALVGLP DRFLGERTCA
VIVSNGVNIN RREVLQFLRS RGLAEYKLPD RVEIVESLPK TGVGKINKRL LREQLSAGRV
PA