Gene Haur_4388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4388 
Symbol 
ID5736238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5604638 
End bp5605792 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content53% 
IMG OID641281550 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001547148 
Protein GI159900901 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000185957 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAG CAGTGTTAAT TGATGCCGTT CGCACGCCGA TTGGTCGCCA ACAAGGCAGC 
CTACGCGATG TGCGCCCCGA TGTTTTATAT GCCCATGTGC TCAACACTTT AATCGAACGC
ACTGGGATTG ATCCAAACCT GATTGAAGAT GTGGTTACTG GTTGCGTAAC CAATACTGGT
GAGCAAGGCG CAAATATTGG TCGTTTGGGC GTGATGCTCT CCAATTTGCC AATTACGGTT
CCGGCGGTAA CCCTCAACCG CATGTGTGGC TCGGCTCAGC AGGCGATTCA TTTCGCGGCG
CAGGCAATTG CCGCAGGCGA CGTGAGCTAT GCAATCGCTG GCGGGGTTGA ATCGATGAGC
CGCGTGCCGA TGTTTAGCGA TGTGACAGGC AATTTTGCCA CCTTCAATCC TGCGATCAAC
GAAAAATATC AACTGGTGCA CCAAGGCGAA TCAGCCGAAC TCATTGCCGA GAAATATCAA
TTATCGCGCA CCGAGCTTGA TGATTGGAGC TTTGAGAGCC ATCAACGCGC CGCTGCCGCG
ACCAAGGCTG GTTGGTTTAG CAGCCAACTC GCGCCAATCG TTGGCAGCGA TAAAACTGGT
AATCCCCACG AATTAATCTA CGATGAAGGC ATTCGCTTCG AGGCTGATCG CGCCAAGATG
GGCACGCTCA AAACGGTGTT CCGTGCCGAT GGCGTGGTGA CTGCCGCCAA CGCCAGCCAA
ATTTCCGATG GTGCAGCGGT TGTATTAATT GGTGAGCGCG AGCAAGCTCT CGCCGATGGT
TTCAAGCCCC GTGCTAAATT CCGTGCTCGC GTGGTTGCCG CTGGTGATCC ACGCATGCAG
TTGCTCGAAG TAATTCCTGC GACGCATAAA GCCTTAGCCA AGGCTGGCTT GAGCATCAAC
GATATTGATC TGGTCGAAAT CAACGAGGCT TTTGCTTCAG TGGTGTTGGC ATGGTTGCGT
GAATTCAAGC TTGATCCTAG CCGCGTAAAT CCCAACGGCG GCGCGATTGC TCATGGTCAC
CCATTGGGCG CAACTGGCGC AGTCTTGATG AGCAAAATGA TCAACGAACT GGAACGCCGC
GATGCTCAAT TTGGCTTGCA AGTGATGTGC ATCGGTCACG GTCAAGCGAC CGCCACCATT
ATTGAGCGGG TATAA
 
Protein sequence
MAEAVLIDAV RTPIGRQQGS LRDVRPDVLY AHVLNTLIER TGIDPNLIED VVTGCVTNTG 
EQGANIGRLG VMLSNLPITV PAVTLNRMCG SAQQAIHFAA QAIAAGDVSY AIAGGVESMS
RVPMFSDVTG NFATFNPAIN EKYQLVHQGE SAELIAEKYQ LSRTELDDWS FESHQRAAAA
TKAGWFSSQL APIVGSDKTG NPHELIYDEG IRFEADRAKM GTLKTVFRAD GVVTAANASQ
ISDGAAVVLI GEREQALADG FKPRAKFRAR VVAAGDPRMQ LLEVIPATHK ALAKAGLSIN
DIDLVEINEA FASVVLAWLR EFKLDPSRVN PNGGAIAHGH PLGATGAVLM SKMINELERR
DAQFGLQVMC IGHGQATATI IERV