Gene EcSMS35_3847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3847 
Symbol 
ID6146221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3922751 
End bp3924430 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content52% 
IMG OID641618673 
Producthypothetical protein 
Protein accessionYP_001745813 
Protein GI170681114 
COG category 
COG ID 
TIGRFAM ID[TIGR03368] cellulose synthase operon protein YhjU 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAT TTACGCAAAA TACCGCCATG CCTTCTTCCC TCTGGCAATA CTGGCGCGGC 
CTTTCCGGCT GGAACTTCTA TTTTCTGGTT AAGTTCGGCC TGTTGTGGGC AGGATATCTT
AACTTCCATC CGCTCCTCAA TCTGGTGTTT GCCGCGTTTC TGCTGATGCC CATTCCGCGC
TACAGCCTGC ATCGCTTACG CCACTGGATT GCCCTGCCGA TCGGCTTTGC TTTGTTCTGG
CATGACACCT GGTTGCCTGG CCCGGAAAGC ATAATGAGCC AGGGTTCGCA GGTGGCGGGG
TTCAGTACCG ATTATTTAAT CGACCTTGTC ACACGCTTTA TTAACTGGCA GATGATTGGG
GCCATTTTTG TTTTATTAGT GGCCTGGTTA TTCCTGTCAC AATGGATTCG CATTACCGTT
TTTGTGGTTG CCATTCTGCT ATGGCTGAAC GTACTTACCC TGGCGGGACC AAGTTTCTCC
TTGTGGCCTG CCGGTCAACC GACGACCACT GTAACAACGA CGGGTGGTAA CGCAGCGGCA
ACCGTTGCGG CGACGGGTGG CGCACCGGTA GTGGGTGATA TGCCCGCACA AACTGCACCG
CCAACAACGG CGAACCTTAA CGCCTGGCTG AATAATTTCT ATAACGCGGA GGCGAAACGT
AAATCGACCT TCCCGTCTTC GCTGCCCGCT GATGCTCAGC CATTTGAACT ACTGGTGATT
AATATCTGTT CGCTTTCCTG GTCGGATATT GAAGCCGCCG GGTTGATGTC GCATCCGCTG
TGGTCGCATT TCGATATTGA GTTCAAGAAC TTTAACTCCG CCACCTCCCA CAGTGGCCCG
GCGGCGATCC GTTTACTGCG CGCCAGCTGC GGGCAGACTT CGCACACTAA TCTGTATCAA
CCGGCAAATA ATGACTGCTA TCTGTTTGAT AACCTGTCGA AACTGGGCTT TACCCAGCAC
CTGATGATGG GACATAACGG CCAGTTCGGC GGTTTTTTGA AAGAAGTTCG CGAAAATGGT
GGCATGCAGA CCGAATTGAT GGATCAAACA AATCTGCCGG TTATTTTGCT GGGCTTTGAT
GGTTCGCCGG TTTATGACGA TACCGCCGTG CTTAACCGCT GGCTGGACGT TACCGAAAAA
GATAAAAACA GCCGTAGTGC CACGTTCTAC AACACGCTTC CACTGCATGA CGGCAACCAT
TATCCGGGCG TCAGCAAAAC AGCGGATTAC AAAGCGCGGG CGCAGAAATT CTTTGATGAA
CTGGACGCCT TCTTTACTGA ACTGGAGAAA TCGGGTCGTA AAGTGATGGT GGTCGTGGTG
CCAGAACACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGG TATCTGGCCT ACGTGATATC
CCTAGCCCGT CTATCACAGA CGTCCCCGTT GGGGTGAAAT TCTTCGGCAT GAAGGCACCA
CATCAGGGGG CACCGATTGT CATCGACCAA CCGAGCAGCT TCCTGGCTAT CTCCGATCTG
GTGGTTCGGG TTCTTGATGG CAAGATTTTC ACCGAAGACA ATGTTGACTG GAAAAAACTC
ACCAGTGGGT TGCCACAAAC AGCACCGGTC TCCGAGAACT CAAATGCAGT AGTTATTCAA
TACCAGGATA AACCGTACGT TCGCCTGAAC GGCGGCGACT GGGTGCCTTA CCCGCAATAA
 
Protein sequence
MTQFTQNTAM PSSLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF AAFLLMPIPR 
YSLHRLRHWI ALPIGFALFW HDTWLPGPES IMSQGSQVAG FSTDYLIDLV TRFINWQMIG
AIFVLLVAWL FLSQWIRITV FVVAILLWLN VLTLAGPSFS LWPAGQPTTT VTTTGGNAAA
TVAATGGAPV VGDMPAQTAP PTTANLNAWL NNFYNAEAKR KSTFPSSLPA DAQPFELLVI
NICSLSWSDI EAAGLMSHPL WSHFDIEFKN FNSATSHSGP AAIRLLRASC GQTSHTNLYQ
PANNDCYLFD NLSKLGFTQH LMMGHNGQFG GFLKEVRENG GMQTELMDQT NLPVILLGFD
GSPVYDDTAV LNRWLDVTEK DKNSRSATFY NTLPLHDGNH YPGVSKTADY KARAQKFFDE
LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQVSGLRDI PSPSITDVPV GVKFFGMKAP
HQGAPIVIDQ PSSFLAISDL VVRVLDGKIF TEDNVDWKKL TSGLPQTAPV SENSNAVVIQ
YQDKPYVRLN GGDWVPYPQ