Gene Haur_3870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3870 
Symbol 
ID5735719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4864201 
End bp4865355 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content56% 
IMG OID641281021 
Productacetyl-CoA acetyltransferase-like protein 
Protein accessionYP_001546632 
Protein GI159900385 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAGG TTTTTATTGC AGGCACAGCT TGCACCGCCG TTCGCGAACA TTATGATCGT 
TCGTTGCTCG ATTTGGCCTT AGAGGCGCTG CATGGCGCGG TTGGTTCACT CGATCCAAGC
TTAATTCAAG CGTTGTATGT TGGCAATGCG CTAGGCGATA CGCTGAGTGA GCAAAGCCAA
TTGGGCGCGT ATATCGCTGG CGCGGCTGGC TTGAACTGCG AAGCGGTACG GGTTGAAGCG
GCTGGTGCTA GCGGCGCATT GGCTTTGCGT CAAGGCTATT TGGCAATTGC CAGCGGCCAG
GCTGATGTGG TGGTCGTGCT AGGCGTTGAA AAGGCCACTG ACAAACTCGA TGCTGCTTTG
CAGGCTGCCT TGGCCTTGGG CTTGGATGGC GAACTTGAAC GGGCACTTGG GCTGACATTA
ACTGGGGCTT GGGCACTGTT GATGCAACGT TATTTGCATG AATATCAATT GCCAGCCACC
GCCTTCGCGC CATTTGCGGT TAATGCACAT GCCAATGGGG CTGGCAATCG CCATGCGCTG
TATCGCTTTG CAATCAACGC TCAAAAATGG GCCAATGCCG GCCAAATTGC CGAGCCATTG
AATATGCTCG ATTGCTCGAC GGTGGCCGAT GGTGCAGCAG CAGTGGTGTT GGTCAGCCAA
CGCTATGCCC GCGAAATAGC GCAGCCAATC GCAATTGTGG GCAGCGCAAC CAGCAGCACC
AATGTTGCCT TGGCGCAACG CCCCGATCTG TTGTGGCTTG AAGCGGCAGC AGCTAGCGGT
AACAACGCGT TGCAACAAGC TAAACTCAAG CGCGATGCAA TTAACATCAT CGAATTAAGC
GACCCGCACG GGATTGCCGC AGCCTTGAGT TTAGAGGCAC TTGGGTATGC CGAACGTGGT
CATGCCACAC AACTGGCCGC CGAAGGTGTG ATTGCCAAGG ATGGCGCGTT GCCTTTGGCG
ACTGCTGGGG GCTACAAAGC TCGTGGCGAT GTTGGCGGCG CAACCGGAGT CTATCAAGTG
GTTGAGTTAG TGGCTCAACT GCGCGGCCAA GCCGGAGCCA ACCAAATTGC CAATGCCAAA
ACAGCCCTAG CCCAGTGCTT GGGTGGCGTT GGCGCAACTG CCGTGACTCA TATTTTGCAA
GTAGCGGAGG TCTAG
 
Protein sequence
MNQVFIAGTA CTAVREHYDR SLLDLALEAL HGAVGSLDPS LIQALYVGNA LGDTLSEQSQ 
LGAYIAGAAG LNCEAVRVEA AGASGALALR QGYLAIASGQ ADVVVVLGVE KATDKLDAAL
QAALALGLDG ELERALGLTL TGAWALLMQR YLHEYQLPAT AFAPFAVNAH ANGAGNRHAL
YRFAINAQKW ANAGQIAEPL NMLDCSTVAD GAAAVVLVSQ RYAREIAQPI AIVGSATSST
NVALAQRPDL LWLEAAAASG NNALQQAKLK RDAINIIELS DPHGIAAALS LEALGYAERG
HATQLAAEGV IAKDGALPLA TAGGYKARGD VGGATGVYQV VELVAQLRGQ AGANQIANAK
TALAQCLGGV GATAVTHILQ VAEV