Gene Haur_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0021 
Symbol 
ID5736855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp24708 
End bp25772 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content53% 
IMG OID641277142 
Product3-oxoacyl-(ACP) synthase III 
Protein accessionYP_001542801 
Protein GI159896554 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3424] Predicted naringenin-chalcone synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACGAA TCCAAGCAAT TGGGCTAGGT GTGCCCCAGT ATTGCGTACC ACAGCGGGCG 
GTGCAAGAGT TGGTGGCCCA GCGCTTTGCT CCAGCGTTTC CCGAAATTCA GCGCTATCTG
ACGATTTTTC GCCATGCCCA AATCGATACA CGCTACTTGG TACGGCCCTT GGAGTGGTGG
CATGAACCGC GCAGTTTTGC TGAATGCAAT GCTATTTTTA TCGAAGAAGC CCTCAACTTG
AGTTGCCAAG CGATTGAGGC CTGTTTACAA CCACTTGATC TCACGCCCAA CGATATTGAT
CATTTGGTGG TGGTGACCAC AACTGGTTTG GCCGCGCCCA GCCTTGATGC GCGTTTGATG
CAGCGGCTTG GCATGCAACC TCAAACGCGC CGCACGCCAA TTTGGGGCTT GGGCTGCGCT
GGCGGTTTGG GCGGTCTCAA CACGGCCTTC GATTATCTCC GCGCTGAGCC AAATCAGCGG
GTCTTGTTAA TAAATGTTGA ATTTTGCTCG CTGACCTATC TCGCCGATGA TTTTTCCAAA
CGCAACTTGA TTGCCACTTC ACTGTTTGGG GATGGAGTGA CCGCCGTGCT GCTCGAAGGC
GATCAGGTGG CTCCGCGTGG GTCGGGCTTG GGCCAGATTG TTGGCACACT GAGTCATCTT
TATCCCGACA GCGCTGAAAT TATGGGCTGG AATGTGGTTA ATTCAGGGTT TGAGGTCGTA
TTCTCTTCGC GGATTCCCTC GATTGTGCGC GAAGAATTTC GCCCTTTGTT GGAACAATTT
TTGGCCCAAC ATGGCCTGAG TCAAGCCGAT CTTGGGCGCT ATTTATTGCA TCCTGGTGGG
GCAAAAGTCG TGCAAGCCTA CCAAGAATCG TTGCATCTCG CCGATGCCGA TTTGGCGGTT
TCGCGAGCAG TCTTGCGCGA ATATGGCAAC ATGTCGTCGG CTACCATTTT CTTTGTAATG
CAGCAAGCCT TAGCCGCCCA ACCTCTAGCC AAGGATGAAT ATGGCTTATT GGCGGTGTTC
GGGCCTGGTT TTAGTGCCGA ATTGGGATTA ATTCGCGGCG AATAA
 
Protein sequence
MPRIQAIGLG VPQYCVPQRA VQELVAQRFA PAFPEIQRYL TIFRHAQIDT RYLVRPLEWW 
HEPRSFAECN AIFIEEALNL SCQAIEACLQ PLDLTPNDID HLVVVTTTGL AAPSLDARLM
QRLGMQPQTR RTPIWGLGCA GGLGGLNTAF DYLRAEPNQR VLLINVEFCS LTYLADDFSK
RNLIATSLFG DGVTAVLLEG DQVAPRGSGL GQIVGTLSHL YPDSAEIMGW NVVNSGFEVV
FSSRIPSIVR EEFRPLLEQF LAQHGLSQAD LGRYLLHPGG AKVVQAYQES LHLADADLAV
SRAVLREYGN MSSATIFFVM QQALAAQPLA KDEYGLLAVF GPGFSAELGL IRGE