Gene Spro_3081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3081 
Symbol 
ID5604353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3390149 
End bp3391354 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content65% 
IMG OID640938622 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_001479310 
Protein GI157371321 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.426252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGG CGTTTATCTG CGATGGCGTT CGCACGCCGA TTGGCCGCTA CGGCGGCGCA 
TTGGCCAACG TGCGTGCCGA TGATTTGGCC GCTCTGCCGC TGCGTGCCCT GTTAGCTCGC
CACCCACAGG TGGACTGGTC ATTGGTCGAT GATGTGATCC TCGGCTGCGC CAATCAGGCC
GGGGAAGACA ACCGCAATCT GGCCCGGATG GCAGTATTGC TGGCCGGCCT GCCGGTGAAC
GTTTCCGGCA CTACCGTCAA TCGCCTGTGC GGTTCGGGGC TGGACGCGCT GGCCATGGCG
GCTCGCAGCA TCAAGGCCGG TGAAGCCGGG CTGGTGCTGG CCGGCGGCGC AGAATCAATG
ACCCGCGCCC CGCTGGTGAT GGGCAAAGCC GACAGCGCTT TCAGCCGTCA GGCGCAACTG
TATGACACCA CTCTGGGCTG GCGCTTTATC AATCCGCTGA TGCAGGCGCA GTTCGGCACC
GACTCGATGC CGGAAACCGC CGAAAACGTG GCGGCGCAGT TCAACATCAG CCGCGCCGAT
CAGGACGCCT TCGCGCTGCG CAGCCAGCAA CGCGCCGCCC GGGCGCAAGA GTCAGGTTTA
CTGGCGCAGG AGATAGTGCC GGTCAGCCTC AGCGGTAAAA AAGGCGCGGT GACGTTGTTC
AGCCAGGACG AACACCCGCG CGCAGACACC CGGCTGGAAC AATTGCAGGC GCTGAAAACG
CCGTTCCGCC AACCCGGTAC CGTGACCGCC GGTAATGCCT CCGGCTTAAA CGACGGCGCA
GCGGCGCTGA TTGTTGCCTC CGAGGCAATC GCCGTCAGTC AGGGCCTCAC CCCGCGGGCG
CGTATCGTCG CCACCGCCAC CTGCGGCGTC GAACCCGGTT TGATGGGGAT CGGCCCACTG
CCGGCCACCC GCAAGGTACT GGAGTTAGCC GGGCTAAGCC TGGCGCAAAT GGACGTGATC
GAACTGAATG AGGCCTTTGC CGCCCAGGCG TTGGCGGTAC TGCGCCAGCT TGGCCTGCCG
GACGACGCGC CGCAGGTGAA TCCCAACGGT GGCGCTATTG CCCTTGGCCA CCCGCTGGGG
ATGAGCGGTG CACGCCTGGC GCTGGCCGCC TTGTTTGAAC TGGAACGGCG TTCCGGCCGC
TACGCGCTTT GCACCATGTG CATCGGCGTC GGCCAAGGCA TCGCCATGAT CATTGAGCGA
GTTTGA
 
Protein sequence
MSQAFICDGV RTPIGRYGGA LANVRADDLA ALPLRALLAR HPQVDWSLVD DVILGCANQA 
GEDNRNLARM AVLLAGLPVN VSGTTVNRLC GSGLDALAMA ARSIKAGEAG LVLAGGAESM
TRAPLVMGKA DSAFSRQAQL YDTTLGWRFI NPLMQAQFGT DSMPETAENV AAQFNISRAD
QDAFALRSQQ RAARAQESGL LAQEIVPVSL SGKKGAVTLF SQDEHPRADT RLEQLQALKT
PFRQPGTVTA GNASGLNDGA AALIVASEAI AVSQGLTPRA RIVATATCGV EPGLMGIGPL
PATRKVLELA GLSLAQMDVI ELNEAFAAQA LAVLRQLGLP DDAPQVNPNG GAIALGHPLG
MSGARLALAA LFELERRSGR YALCTMCIGV GQGIAMIIER V