Gene PA14_26010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_26010 
Symbol 
ID4380986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp2272242 
End bp2273381 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content69% 
IMG OID639324655 
Productacyl-CoA thiolase 
Protein accessionYP_790233 
Protein GI116050942 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.653949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.286691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCGA TTTCAGTCGT TATCGCCGGT TACGCCCGTT CGCCGTTCCA CTTCGCCAGG 
AAAGGCGCGC TGGTCGATAT CCGCCCGGAT GACCTGGCCG CCGCGGTGCT CAAGGGCCTG
GTGGAGAAAC TCGATCTCGA CCCTGTTCAG CTGGAGGACG TGGTCATGGG CTGCGCCTAT
CCGGAGGCGG AACAGGGCAT GAACATCGCG CGCATCGCCA GTTTCCGCGC CGGCTTCCCG
CAGAGCCTCG GCGGCGCGAC CCTCAACCGT TTCTGCGGTT CGTCCATGAG CGCTGTCCAC
TACGCCGCCG GGCAAGTCCT GCTGGGTGCC GGAGAAGCCT TCATCGCCGC CGGGGTGGAG
TCGATGACCC GGGTGCCGAT GGGCGGCTTC AACCTGTCGC CCAATCCCGC GTTGCTGCAG
GACTATCCGG CGGTATACAT GAGCATGGGG CAGACCGCCG AGAATGTCGC CGAGCGCTAC
GCCGTCAGCC GCGTCGAGCA GGAAGAGATG GCCGTACGCT CCCACGCCAA GGCAGTCGCC
GCGCGCGAGG CCGGCTTGCT GCGCGAGGAG ATCGTCGCCA TCGACACCCC TGCCGGCCGG
GTCGCCGAGG ATGGCTGCAT CCGCCCGGGT ACCAACCTGG AGAGCCTAGC CCAACTGAAG
CCGGCGTTCG GCGGCAGCGT CACCGCCGCC ACCTCGTCGC CGCTCACCGA TGGCAGCGCC
GCGCTGCTGG TCTGTAGCGA GGAATTCGCG CGCCGCCACG GGCTGGCGAT CCTGGCGCGG
ATCAAGGCGG TGGCGGTGGC CGGTTGTGCG CCGGAGATCA TGGGCATGGG TCCGGTGCAG
GCGACGCGCA AGGTTCTGCA ACGGGCCGGG CTGGGCATCG CCGACATCGA CCTGGTGGAG
ATCAACGAGG CCTTCGCCAG CCAATCCATC GCCTGCATCC GTGAACTGGG GCTGGACATG
GACAGGATCA ACCTGGATGG CGGCGCCCTG GCCATCGGCC ATCCGCTGGG CGCCACCGGC
GCGCGGATCA CCGGCAAGGC CGCCGCGTTG CTGCGGCGCA CCGGCGGGCG CTATGCCATC
GCCACCCAGT GCATCGCCGG CGGCCAGGGC GTGGCGACCC TGCTGGAAGC GGTGGAGTGA
 
Protein sequence
MSPISVVIAG YARSPFHFAR KGALVDIRPD DLAAAVLKGL VEKLDLDPVQ LEDVVMGCAY 
PEAEQGMNIA RIASFRAGFP QSLGGATLNR FCGSSMSAVH YAAGQVLLGA GEAFIAAGVE
SMTRVPMGGF NLSPNPALLQ DYPAVYMSMG QTAENVAERY AVSRVEQEEM AVRSHAKAVA
AREAGLLREE IVAIDTPAGR VAEDGCIRPG TNLESLAQLK PAFGGSVTAA TSSPLTDGSA
ALLVCSEEFA RRHGLAILAR IKAVAVAGCA PEIMGMGPVQ ATRKVLQRAG LGIADIDLVE
INEAFASQSI ACIRELGLDM DRINLDGGAL AIGHPLGATG ARITGKAAAL LRRTGGRYAI
ATQCIAGGQG VATLLEAVE