Gene EcSMS35_1502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1502 
Symbolpct 
ID6143325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1485678 
End bp1487273 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content47% 
IMG OID641616380 
Productpropionate CoA-transferase 
Protein accessionYP_001743560 
Protein GI170680395 
COG category[I] Lipid transport and metabolism 
COG ID[COG4670] Acyl CoA:acetate/3-ketoacid CoA transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00752619 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACCTG TAAAACCACC TCGTATTAAT GGACGAGTGC CGGTCCTGTC GGCACAGGAA 
GCGGTGAATT ATATTCCCGA CGAAGCAACA CTTTGTGTGT TAGGCGCTGG CGGCGGTATT
CTGGAAGCCA CCACGTTAAT TACTGCTCTT GCTGATAAAT ATAAACAGAC TCAAACACCA
CGTAATTTAT CGATTATTAG TCCAACAGGG CTTGGCGATC GCGCCGACCG TGGCATTAGT
CCGCTGGCAC AAGAAGGTCT GGTGAAATGG GCATTATGCG GTCACTGGGG ACAATCGCCG
CGTATTTCTG ATCTCGCAGA ACAAAATAAA ATTATTGCTT ATAACTATCC ACAAGGTGTA
CTTACACAAA CCTTACGCGC CGCCGCAGCC CACCAGCCTG GTATTATTAG TGATATTGGC
ATCGGAACAT TTGTCGATCC ACGCCAGCAA GGCGGCAAAC TGAATGAAGT CACTAAAGAA
GACCTGATTA AACTGGTCGA GTTTGATAAC AAAGAATATC TCTATTACAA ATCGATTGCG
CCAGATATCG CCTTCATTCG CGCTACCACC TGCGACAGCG AAGGCTACGC CACTTTTGAA
GATGAGGTGA TGTATCTCGA CGCATTGGTT ATTGCTCAGG CGGTGCACAA TAACGGCGGT
ATTGTGATGA TGCAGGTGCA GAAAATGGTT AAGAAAGCCA CGCTGCATCC TAAATCTGTC
CGTATTCCGG GTTATCTGGT GGATATTGTG GTGGTCGATC CGGATCAAAC CCAACTGTAT
GGCGGTGCAC CGGTTAACCG CTTTATTTCT GGTGACTTCA CCCTTGATGA CAGTACCAAA
CTTAGCCTGC CCCTAAACCA ACGTAAATTA GTTGCGCGGC GCGCATTATT CGAAATGCGC
AAAGGCGCAG TGGGGAATGT CGGCGTCGGT ATTGCTGACG GCATTGGCCT GGTCGCCCGG
GAAGAAGGTT GTGCTGATGA CTTTATTCTG ACGGTAGAAA CAGGTCCGAT TGGCGGTATT
ACTTCACAGG GGATCGCCTT TGGCGCGAAC GTGAATACCC GCGCCATTCT GGATATGACG
TCCCAGTTTG ATTTTTATCA CGGTGGTGGT CTGGATGTTT GTTATTTGAG TTTTGCTGAA
GTCGACCAGC ACGGTAACGT CGGCGTGCAT AAATTCAATG GTAAAATCAT GGGCACCGGT
GGATTTATTG ATATCAGTGC CACTTCGAAG AAAATCGTGT TCTGCGGCAC ATTAACTGCG
GGCAGTTTAA AAACAGAAAT TACCGACGGC AAATTAAATA TCGTCCAGGA AGGACGGGTG
AAGAAATTTA TTCGGGAACT ACCGGAAATA ACTTTCAGCG GAAAAATCGC TCTCGAGCGA
GGGCTGGATG TTCGTTATAT CACTGAGCGC GCCGTATTCA CGTTAAAAGA AGACGGCCTG
CATTTAATCG AAATCGCTCC TGGCGTCGAT TTACAAAAAG ATATTTTCGA CAAAATGGAT
TTCACCCCAG TGATTTCGCC AGAACTCAAA CTGATGGACG AAAGATTATT TATCGATGCG
GCGATGGGTT TTGTCCTGCC TGAAGCGGCT CATTAA
 
Protein sequence
MKPVKPPRIN GRVPVLSAQE AVNYIPDEAT LCVLGAGGGI LEATTLITAL ADKYKQTQTP 
RNLSIISPTG LGDRADRGIS PLAQEGLVKW ALCGHWGQSP RISDLAEQNK IIAYNYPQGV
LTQTLRAAAA HQPGIISDIG IGTFVDPRQQ GGKLNEVTKE DLIKLVEFDN KEYLYYKSIA
PDIAFIRATT CDSEGYATFE DEVMYLDALV IAQAVHNNGG IVMMQVQKMV KKATLHPKSV
RIPGYLVDIV VVDPDQTQLY GGAPVNRFIS GDFTLDDSTK LSLPLNQRKL VARRALFEMR
KGAVGNVGVG IADGIGLVAR EEGCADDFIL TVETGPIGGI TSQGIAFGAN VNTRAILDMT
SQFDFYHGGG LDVCYLSFAE VDQHGNVGVH KFNGKIMGTG GFIDISATSK KIVFCGTLTA
GSLKTEITDG KLNIVQEGRV KKFIRELPEI TFSGKIALER GLDVRYITER AVFTLKEDGL
HLIEIAPGVD LQKDIFDKMD FTPVISPELK LMDERLFIDA AMGFVLPEAA H