Gene EcSMS35_4824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4824 
Symbol 
ID6144928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4911105 
End bp4912757 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content38% 
IMG OID641619628 
Producthypothetical protein 
Protein accessionYP_001746735 
Protein GI170681444 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGCGC AGCTTTTTGA GCAGTTGTTT CAATCGATAG ACTCTACACT GATCACCAAT 
ATTTTCATCT GGGCTGTTAT ATTCGTATTT TTATCAGCGT GGTGGTGTGA CAAAAAAAAT
ATACATAGTA AGTTTAGAGA ATATGCTCCA ACCTTAATGG GGGCATTAGG TATTCTGGGT
ACTTTCATTG GTATTATTAT TGGGTTACTC AATTTTAATA CTGAAAGTAT TGATACCAGC
ATCCCCGTAT TATTAGGCGG CCTAAAAACA GCATTCATTA CAAGCATTGT AGGTATGTTT
TTTGCCATTT TATTTAATGG AATGGATGCT TTCTTTTTTG CCAATAAACG AAGTGCGTTA
GCAGAAAATA ACCCTGAATC TGTTACACCT GAACATATCT ATCATGAATT AAAAGAGCAG
AACCAGACTC TGACTAAATT AGTCTCGGGT ATTAACGGTG ATAGTGAAGG TTCTCTTATT
GCTCAAATAA AATTACTACG TACTGAGATT AGCGATTCCT CGCAGGCACA ATTAGCTAAT
CACACTCATT TCAGTAATAA GCTTTGGGAA CAACTTGAAC AATTTGCAGA TCTAATGGCA
AAAGGTGCTA CAGAACAAAT TATTGATGCT TTACGACAAG TCATTATTGA TTTTAATGAA
AATTTAACTG AACAGTTTGG TGAAAACTTT AAAGCACTTG ATGCCTCTGT AAAAAAACTT
GTTGAGTGGC AGGAAAATTA TAAAACGCAA GTTGAGCTGA TGTCAGAACA ATATCAACAA
AGTGTCGAGT CTCTGGTTGA AACAAAAACT GCGGTTGCCG GGATTTGGGA AGAATGTAAA
GAAATTCCTC TGGCTATGTC TGAACTGCGT GAAGTGCTTC AGGTGAACCA ACATCAAATC
AGCGAACTCT CCCGCCATTT AGAAACCTTT GTCGCCATCC GCGATAAAGC TACAACCGTA
TTACCTGAAA TACAGAACAA AATGGCTGAA GTGGGTGAAC TGCTGAAATC CGGAGCTGCA
AATGTTAGTG CATCTCTTGA GCAAACCAGC CAGCAAATAC TTCTTAATGC AGATTCAATG
CGCGTGGCCC TGGATGAAGG TACCGAAGGA TTCAGACAAT CGGTTACCCA AACACAACAA
GCATTTGCCT CGATGGCACA TGATGTCAGC AATTCCTCCG AAACTCTAAC CAGCACGTTA
GGTGAAACAA TTACTGAAAT GAAACAAAGT GGTGAAGAAT TCCTGAAGTC ACTAGAGTCG
CACTCGAAAG AATTGCATAG AAATATGGAA CAAAATACGA CTAATGTAAT TGATATGTTC
AGTAAGACTG GTGAAAAGAT TAACCATCAA CTATCCAGTA ATGCCGATAA TATGTTTGAT
TCAATCCAGA CATCATTTGA TAAGGCAAGT GCAGGGCTGA CTTCTCAAGT CAGAGAATCA
ATTGAAAAAT TTGCTCTATC CATCAACGAG CAGTTACATG CTTTTGAGCA AGCAACTGAA
CGTGAGATGA ACCGTGAAAT GCAATCATTA GGTAATGCTC TGCTTTCAAT CAGCAAAGGT
TTTGTCGGTA ACTATGAAAA ACTTATTAAA GATTACCAAA TAATTATGGG GCAGTTACAA
GCATTAATTT CTGCTAATAA ACATCGAGGG TAA
 
Protein sequence
MLAQLFEQLF QSIDSTLITN IFIWAVIFVF LSAWWCDKKN IHSKFREYAP TLMGALGILG 
TFIGIIIGLL NFNTESIDTS IPVLLGGLKT AFITSIVGMF FAILFNGMDA FFFANKRSAL
AENNPESVTP EHIYHELKEQ NQTLTKLVSG INGDSEGSLI AQIKLLRTEI SDSSQAQLAN
HTHFSNKLWE QLEQFADLMA KGATEQIIDA LRQVIIDFNE NLTEQFGENF KALDASVKKL
VEWQENYKTQ VELMSEQYQQ SVESLVETKT AVAGIWEECK EIPLAMSELR EVLQVNQHQI
SELSRHLETF VAIRDKATTV LPEIQNKMAE VGELLKSGAA NVSASLEQTS QQILLNADSM
RVALDEGTEG FRQSVTQTQQ AFASMAHDVS NSSETLTSTL GETITEMKQS GEEFLKSLES
HSKELHRNME QNTTNVIDMF SKTGEKINHQ LSSNADNMFD SIQTSFDKAS AGLTSQVRES
IEKFALSINE QLHAFEQATE REMNREMQSL GNALLSISKG FVGNYEKLIK DYQIIMGQLQ
ALISANKHRG