Gene EcSMS35_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0468 
SymbolcyoE 
ID6144229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp475117 
End bp476007 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content54% 
IMG OID641615362 
Productprotoheme IX farnesyltransferase 
Protein accessionYP_001742569 
Protein GI170682454 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0109] Polyprenyltransferase (cytochrome oxidase assembly factor) 
TIGRFAM ID[TIGR01473] protoheme IX farnesyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.519171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.259117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTTA AGCAATACCT GCAAGTAACG AAACCAGGCA TCATCTTTGG CAACCTGATC 
TCGGTGATTG GGGGATTCCT GCTGGCCTCA AAGGGCAGCA TTGATTATCC CCTGTTTATC
TACACGCTGG TTGGGGTGTC ACTGGTTGTG GCGTCGGGTT GTGTGTTTAA CAACTACATC
GACAGGGATA TCGACAGAAA GATGGAAAGG ACGAAGAATC GGGTGCTGGT TAAAGGCCTG
ATCTCTCCTG CTGTCTCGCT GGTGTACGCC ACCTTGCTGG GTATTGCTGG CTTTATGCTG
CTGTGGTTTG GCGCGAATCC GCTGGCCTGC TGGCTGGGGG TGATGGGCTT TGTGGTTTAT
GTCGGCGTCT ACAGCCTGTA CATGAAACGC CACTCGGTCT ACGGCACATT GATTGGTTCG
CTCTCCGGCG CAGCACCGCC AGTGATCGGC TACTGTGCGG TAACCGGCGA GTTCGATAGC
GGCGCAGCAA TCCTGCTGGC TATCTTCAGC CTGTGGCAGA TGCCTCATTC TTATGCCATC
GCCATTTTCC GCTTTAAGGA TTATCAGGCC GCAAACATTC CGGTACTGCC GGTGGTGAAA
GGCATTTCGG TGGCGAAGAA TCACATCACG CTGTATATCA TCGCCTTTGC CGTTGCCACG
CTGATGCTCT CTCTTGGCGG TTACGCTGGG TATAAATATC TGGTGGTCGC CGCGGCGGTG
AGCGTCTGGT GGTTAGGTAT GGCCCTGCGC GGTTATAAAG TTGCTGATGA TCGAATCTGG
GCGCGCAAGC TGTTCGGCTT CTCTATCATC GCCATCACTG CCCTCTCGGT GATGATGTCC
GTTGATTTTA TGGTGCCGGA CTCGCATACG CTGCTGGCTG CTGTGTGGTA A
 
Protein sequence
MMFKQYLQVT KPGIIFGNLI SVIGGFLLAS KGSIDYPLFI YTLVGVSLVV ASGCVFNNYI 
DRDIDRKMER TKNRVLVKGL ISPAVSLVYA TLLGIAGFML LWFGANPLAC WLGVMGFVVY
VGVYSLYMKR HSVYGTLIGS LSGAAPPVIG YCAVTGEFDS GAAILLAIFS LWQMPHSYAI
AIFRFKDYQA ANIPVLPVVK GISVAKNHIT LYIIAFAVAT LMLSLGGYAG YKYLVVAAAV
SVWWLGMALR GYKVADDRIW ARKLFGFSII AITALSVMMS VDFMVPDSHT LLAAVW