Gene Haur_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1802 
Symbol 
ID5733704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2091069 
End bp2092256 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content55% 
IMG OID641278945 
Productisochorismate synthase 
Protein accessionYP_001544573 
Protein GI159898326 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1169] Isochorismate synthase 
TIGRFAM ID[TIGR00543] isochorismate synthases 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGTT TTGAACAAGA ACGGCTACGC GAATCGGCGT GGCAATTGCT AGACAACTAT 
CAAGCAGAAT CGGCCTTTTT CTTTGCATCG CCCAATCATA CCTTGTTGGG CCAATTAAGC
TATGTTGATT TGATCAGCCA AACCGCCTTG CTTGAGCTTG AGCAACGAGT TAACGAGGCG
CTGCAACGGG CTGAACGTGG TGGTGAAGTT AATCCAGTCG TGGTTGGGGC ATTGCCTTTT
GCTCCCGATG CGGCGGCTTA TTTGGCCTTG CCATCGCGGG TGGTTTGGGC TGGCCCATTG
CACGCCGAAG CCCAACCCTA TTGGCATAAC CAGCGTTTGC CGCATTGCAG CATCGAGCCA
ATGCCAGCGC CCGAACACTA CAAACAGGGT GTGGCTCAAG CCTTGGCCAA AATGCAGGCT
GGCGATTTGC AAAAAGTTGT GCTCTCACGC TCGTTGCAAT TGACCGCCGA AGCACCGCTT
GATGTGAATT TGATTCTGGC GAATTTGGCA CGTAACAACA AAACTGGCTA TACCTTTGCG
GTGCCGTTGC CAACCCGCCG CGCGTTGGTT GGGGCTAGCC CTGAATTGTT GCTGGCGCGT
AATGGCAATC AAGTGATCGC CAATCCCTTA GCTGGTTCGA TTCCGCGCAG CGCCGACCCT
GAAGAAGATG CGCGGCGGGC AGCAGGTTTG CTCGAATCGC CCAAAGATTT GCATGAACAT
AAGGTTGTAA TTGAGGCGGT TGCGGCGGCC TTAGCGCCAT TCTGTCTGAG CCTTGATGTG
CCGCAACCAA CCGTTATTTC CACCGCGACG ATGTGGCATC TCTCAACAAC CTTGGTTGGC
GAATTAAAGC CTGATGCACC TTCATCGTTG GGTTTGGCAT TGGCCTTGCA CCCAACTCCA
GCGGTCTGTG GTACGCCTAC CGAGGTCGCC CGCGCCGCCA TCCGCGAAAT CGAGCCGTTT
GATCGCGGCT TTTTCACGGG GATGGTTGGT TGGTGCAACG CCCAAGGCGA TGGCGAATGG
ATTGTGACGA TTCGTTGTGC CGAAGTTGTT GATCAATCGT TGCGTTTATT TGCTGGTGCT
GGGGTGGTAC TAGGCTCGAC TCCTGAAGCC GAGTTGGCCG AAACTGCGGC GAAATTCCGC
ACGATGTTGT TGGCGATGGG CATCGATAGC GAAGGCGAGG TGGCCTAA
 
Protein sequence
MSSFEQERLR ESAWQLLDNY QAESAFFFAS PNHTLLGQLS YVDLISQTAL LELEQRVNEA 
LQRAERGGEV NPVVVGALPF APDAAAYLAL PSRVVWAGPL HAEAQPYWHN QRLPHCSIEP
MPAPEHYKQG VAQALAKMQA GDLQKVVLSR SLQLTAEAPL DVNLILANLA RNNKTGYTFA
VPLPTRRALV GASPELLLAR NGNQVIANPL AGSIPRSADP EEDARRAAGL LESPKDLHEH
KVVIEAVAAA LAPFCLSLDV PQPTVISTAT MWHLSTTLVG ELKPDAPSSL GLALALHPTP
AVCGTPTEVA RAAIREIEPF DRGFFTGMVG WCNAQGDGEW IVTIRCAEVV DQSLRLFAGA
GVVLGSTPEA ELAETAAKFR TMLLAMGIDS EGEVA