Gene GWCH70_2013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2013 
Symbol 
ID7978967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2072854 
End bp2074059 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content50% 
IMG OID644798836 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_002950006 
Protein GI239827382 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.536911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAAG TCGTTATTGT TGATGCGGTG CGGACGCCGA TTGGAAAATA TAAAGGGGCG 
TTAAAAGACG TCCGACCGGA TGATCTAGGG GCAACCGTCA TCCGTGCGCT TGTGGAACGC
AATCCAAATC TTCCAGTCCA TGAAATTGAA GAAGTTGTTC TCGGCAATGC CAATCAAGCT
GGGGAAGACA ACCGAAATGT CGCGCGGATG TCCGCCTTGC TAGCTGGATT GCCAGTTGAA
GTGGCAGGAA CGACCGTCAA TCGTCTTTGC GGCTCAGGGC TTGATGCCGT CAACTACGCT
GCTCGCACGA TTATGACAGG AGAAGCAGAC ATTGTAATCG CAGGTGGAAC GGAAAGCATG
ACGCGCGCGC CGTTCGTCAT GGCAAAACCG AGCACAGACT TTCCACGCGG CAATATGGAA
ATGTTCGATA CGACCATCGG ATGGCGTTTT ATTAACCCGA AGATGGAAGA AATGTACGGG
ACCGACAGCA TGCCGCAGAC AGCGGAAAAT GTCGCAAAAC GGTTTGGCAT TTCTCGTGAA
GCGCAAGATG AATTTGCTTA TGAAAGCCAA ATGAAAGCGA AAAAAGCGAT CGAATCGAAT
CGCTTCGCAG ACGAATTAGT TCCTGTCGTA TATGTTGACC GGAAAGGAAA CGAAGTCATC
GTAGATAAAG ACGAACATCC TCGTCCAGAC ACAACGTTGG AGAAGCTTGC CAAACTGCCT
CCGCTATTTG AAAATGGAAC CGTCACAGCA GGGAACGCTT CAGGAGTCAA CGATGGAGCG
TCCGCGCTGT TATTAATGAG CGCGGAAAAA GCAAAAGAGC TTGGAATGAA ACCGCTCGCA
AAGTATGTAA CGTCAGCAGT TGCAGGAGTG GAGCCAGCGG TGATGGGAAT CGGACCGATT
TACGCGACGA GGAAAGCACT TTCGCGCGCG AAATTAACAA TCGATGATAT CGGCTTAGTG
GAATTGAATG AAGCGTTTGC CTCACAGTCG CTGGAATGCA TCAAACAGCT GGAACTTGAC
CGCGCGAAAG TAAACGTCAA TGGTGGAGCG ATCGCCCTTG GGCATCCGCT TGGGGCAAGC
GGCGCCCGCA TTTTAACGAC ACTCGTTTAC GAAATGAAAA AACGCAGGGT GAAATACGGC
CTTGCGACAA TGTGTGTCGG TGTAGGACAA GGAATCGCGA CGATTGTTGA AAATCCGGAA
GTCTAA
 
Protein sequence
MREVVIVDAV RTPIGKYKGA LKDVRPDDLG ATVIRALVER NPNLPVHEIE EVVLGNANQA 
GEDNRNVARM SALLAGLPVE VAGTTVNRLC GSGLDAVNYA ARTIMTGEAD IVIAGGTESM
TRAPFVMAKP STDFPRGNME MFDTTIGWRF INPKMEEMYG TDSMPQTAEN VAKRFGISRE
AQDEFAYESQ MKAKKAIESN RFADELVPVV YVDRKGNEVI VDKDEHPRPD TTLEKLAKLP
PLFENGTVTA GNASGVNDGA SALLLMSAEK AKELGMKPLA KYVTSAVAGV EPAVMGIGPI
YATRKALSRA KLTIDDIGLV ELNEAFASQS LECIKQLELD RAKVNVNGGA IALGHPLGAS
GARILTTLVY EMKKRRVKYG LATMCVGVGQ GIATIVENPE V