Gene Sala_1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1005 
Symbol 
ID4081693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1029325 
End bp1030485 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content72% 
IMG OID638009365 
ProductAcetyl-CoA C-acyltransferase 
Protein accessionYP_616055 
Protein GI103486494 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0836573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGGACG TCTTTCTCTA TGACGCCGTC CGCACGCCAC GCGGCAAGGC ACGGCCCGAC 
GGCGGGCTCG CGAACCTCAG CCCGCAGGAA CTGGTGCGCC AGCAGGCCGC GGCACTGGCG
GCGCGCTGCG GCGATGTGGC CGCGCACCCC GACGCGCTGA TCCTCGGCTG CGTGACGCAG
ACCGGCGCGC AGGGTGGGCA TATCGCGCTC GTCGCCAAGC TCCACGCCGA CCTGCCCGAC
ACCATGGCGG CGCACAGCCT CAACAATTAT TGCGCCTCCG GACTCACGGC GATCGGACTG
GCGGTCGCAA AGGTCGCGAG CGGCGAGATC GACGTGGCGC TCGCCGGCGG CGTCGAATCG
ATGAGCGCGG CGCCCTTTCT GAGCGATCAT GCGGGCTTCT ACGCCAATGA CGAGCTGCCG
CCGCGCGCGC GCTTCGTGCC GCCGGTGCTC GGCGCCGACC GACTCGCGCA CGCCGAAGGC
ATCACGCGCG CCGAACTCGA CGCGGTTGCC CTCGCCTCGC AGCGCAAGGC GGCAATCGCC
GAAGGCGATG CCGCGCTCCA GAAATCGCGG ATAGCGACGG GCGCGCTCGC GGGCGAGGAA
TGTATCCGGC CGCAGACGAG CGCCGAATCG CTCGCGGCGA TGCCCGCTGC CTTCGGCGCA
TTGCAGGCGG AATATCCAGA CGCACTCGAA GGCACGCGCT TCGCGCCGCT GCACAGCGTC
GCGCACGCCC CGCCCATCTG CGACGGCGCG GGGCTGGCGC TCGTCGCGCG TGCGGCGCCT
GGCCCCCCTG CCCGCGCGCG CGTCGTCGCC TTTGCCGAAA GCGGCGGCGA TCCGGTCGCG
TCGCTGACCG CGGGATTTGC GGCGATGGAC AAGGTGCTCG TGCGCGCCGG ACTGTCGCTC
GCCGACATCG ACCGGATCGA GTTCATGGAG GCGTTCGCGG TCACCATCGC GAAGTTCCTG
CGCGACCGCG ACGTCGACCC GGAGCGGGTC AATGTCAGCG GTGGGCATCT GGCCAAGGGT
CATCCGATGG GCGCGAGCGG CGCGATCCTG ACCTCGACCT TGCTCGACGC GCTCGATGCG
TGTCGGGGCC GATACGGGCT GGTCGTGCTG ACCGGCGCGA TGGGGGTCGG CGCGGCGATG
GTGGTCGAAC GGGCGGCGTA A
 
Protein sequence
MADVFLYDAV RTPRGKARPD GGLANLSPQE LVRQQAAALA ARCGDVAAHP DALILGCVTQ 
TGAQGGHIAL VAKLHADLPD TMAAHSLNNY CASGLTAIGL AVAKVASGEI DVALAGGVES
MSAAPFLSDH AGFYANDELP PRARFVPPVL GADRLAHAEG ITRAELDAVA LASQRKAAIA
EGDAALQKSR IATGALAGEE CIRPQTSAES LAAMPAAFGA LQAEYPDALE GTRFAPLHSV
AHAPPICDGA GLALVARAAP GPPARARVVA FAESGGDPVA SLTAGFAAMD KVLVRAGLSL
ADIDRIEFME AFAVTIAKFL RDRDVDPERV NVSGGHLAKG HPMGASGAIL TSTLLDALDA
CRGRYGLVVL TGAMGVGAAM VVERAA