Gene EcSMS35_4888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4888 
Symbol 
ID6143246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5005139 
End bp5007232 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content44% 
IMG OID641619692 
Producthypothetical protein 
Protein accessionYP_001746799 
Protein GI170679657 
COG category[S] Function unknown 
COG ID[COG1479] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.886735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.908596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTA CAGAAGCCCG CCTACTCGAT TTTTTAAAAC GTTCGCAGCA GTTTGTGATT 
CCCATTTATC AACGGACTTA TTCATGGACA GAACAACAAT GTCGGCAACT TTGGGACGAC
ATCATTCGTG CCGGAAAGCG TGACGATATA TCAGCGCATT TTATCGGTTC GGTTGTTTAT
ATTGAGCAGG GGTTGTATCA GGTTTCTGGT ATTTCTCCGT TACTGGTCAT TGATGGTCAA
CAAAGGCTGA CGACCGCAAT GTTGCTGATT GAGGCTTTAT CGCGTCATCT TGGCGAAGAC
GAAGTTTTTG ATGGCTTTTC AGCAATGAAA TTGCGTAATT ATTATTTACT CAATCCTTAT
GAGTCCGGCG AGAAAGGATT TAAATTACTG CTGACCGAGA CTGATAAAGA CAGTTTGCTG
GCGTTAATAA AACAAAGACC AATGCCAGAA AACTATTCCC ATCGAATAAT GGAAAACTTT
ACTTTCTTTG ATGAGCAAAT TGCCAAACTC GGTGATGACT TGATCCCCTT ATGTCGTGGG
TTAGCAAAAT TATTAATTGT CGATGTGGCG CTTAATCGTG GTCAGGATAA TCCGCAACTG
ATTTTTGAAA GTATGAACTC TACCGGTAAG GCGTTAAGTC AGGCCGATCT GGTGCGCAAT
TTTATTCTGA TGGGTCTCGA ACCAGAGCAT CAAACCCGGT TGTATGAAGA TCACTGGCGT
CCAATGGAAG TCGCTTTTGG TCAGCAAGGT TACAGCGAAT ATTTTGACAG CTTTATGCGT
CATTATCTGA CGGTAAAAAC GGGGGAGATC CCTCGTACAG ATGAAGTCTA TGAGGCATTT
AAACTCCATG CCCGCAGCCA GAGTGTTGCT GAAAAAGGCG TGGATCACCT GGTTGAAGAT
ATTCACATCT ACGCGGAGTA TTACTGTGCA ATGGCATTGG GTAAAGAAAG TGACAAATCG
CTTGCTACGG CTTTTCAGGA TTTGCGCGAG TTAAAGGTCG ATGTGGCGTA TCCTTTCTTA
CTGGCGCTTT ATCATGACTA TAAAAATGGC GATTTGCCTC ACGAAGATTT CCTGAGCATA
ATTCGTTTAA TTGAATCTTA TGTTTTCCGC CGTGCAGTAT GTGCGATTCC GACAAATTCT
CTTAACAAGA CGTTTGCTAC CTTTTATAAG GTCATTATTA AAGAAAAATA TCTGGAAAGT
ATTCAGGTAC ATTTTCTGAA TCTGCCTTCA TATCGTCGTT TCCCCAACGA TGATGAATTT
AAACGGGAAT TAAAAGTTCG CGATCTCTAT AACTTCCGCA GTCGCAGCTA CTGGTTACGA
CGACTGGAAA ACGATAAACG CAGAGAGCGC GTGGAAGAGT TTACGATTGA GCATATTATG
CCGCAGAACG AAAATTTGTC GGCTAAATGG CGTGAAGAGC TGGGAAGCGA CTGGCAGCGT
GTGCATAAAG AACTGTTACA TACGTTGGGG AATCTCACAT TAACGCGCTA TAACTCCCGC
TACAGTGACA GACCTTTCGC GGAAAAACGC GATATTGAAG ACGGCTTTAA GCATAGCCCG
CTTTATTTGA ATATCGGTCT TGGACAGTGC GAAAAATGGG ATGAAGCCGC CATTCGCGCC
CGTGCCGATC GTCTGGCCGA TCTCGCGGTT CAGGTCTGGC AAGCGCCTGC TCTTCCTGAA
GAGGTTTTAG CTGTTTATCG GGCGCAGCCT GAAAACAAAA CCAGTTATAG CCTGAGTGAT
TATCCTTTTC TTGCTGATGG TTCGCATAGC CGGGTGTTAT TTGATCATCT TCGCGATGAA
GTTATGCGCC TGGACGCAGG GATCACGCAG GAAGTATTGA AGCTCTATAT TGCGTTTAAA
GCGGAAACGA ATTTTGTTGA TGTTGTGCCG CAAAAAAGCC GACTGCGATT GTCGCTTAAT
ATGCAGTTTC ATGAACTGGT CGATCCGAAA GGTATTGCCA AAGATGTGAC AAATGTTGGG
CGCTGGGGCA ATGGTGATGT GGAAATTGGT TTCAGCGACC TCGCACAACT TCCTTACGTT
ATGGGATTAA TTCGTCAGGC ATTTGAAAAA CAGATGGAGA GCGCGTTGGT CTAA
 
Protein sequence
MKATEARLLD FLKRSQQFVI PIYQRTYSWT EQQCRQLWDD IIRAGKRDDI SAHFIGSVVY 
IEQGLYQVSG ISPLLVIDGQ QRLTTAMLLI EALSRHLGED EVFDGFSAMK LRNYYLLNPY
ESGEKGFKLL LTETDKDSLL ALIKQRPMPE NYSHRIMENF TFFDEQIAKL GDDLIPLCRG
LAKLLIVDVA LNRGQDNPQL IFESMNSTGK ALSQADLVRN FILMGLEPEH QTRLYEDHWR
PMEVAFGQQG YSEYFDSFMR HYLTVKTGEI PRTDEVYEAF KLHARSQSVA EKGVDHLVED
IHIYAEYYCA MALGKESDKS LATAFQDLRE LKVDVAYPFL LALYHDYKNG DLPHEDFLSI
IRLIESYVFR RAVCAIPTNS LNKTFATFYK VIIKEKYLES IQVHFLNLPS YRRFPNDDEF
KRELKVRDLY NFRSRSYWLR RLENDKRRER VEEFTIEHIM PQNENLSAKW REELGSDWQR
VHKELLHTLG NLTLTRYNSR YSDRPFAEKR DIEDGFKHSP LYLNIGLGQC EKWDEAAIRA
RADRLADLAV QVWQAPALPE EVLAVYRAQP ENKTSYSLSD YPFLADGSHS RVLFDHLRDE
VMRLDAGITQ EVLKLYIAFK AETNFVDVVP QKSRLRLSLN MQFHELVDPK GIAKDVTNVG
RWGNGDVEIG FSDLAQLPYV MGLIRQAFEK QMESALV