Gene EcSMS35_4579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4579 
Symbol 
ID6143994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4678899 
End bp4680542 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content50% 
IMG OID641619395 
Productputative cell division protein 
Protein accessionYP_001746507 
Protein GI170683071 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.507155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAGC GCCTACTAAA AAGACCCTCT TTGAATTTAC TCGCCTGGCT ATTGTTAGCC 
GCTTTTTATA TCTCTATCTG CCTGAATATT GCCTTTTTTA AACAGGTGTT GCAGGCGCTG
CCGCTGGACT CGCTGCATAA CGTACTGGTT TTCTTGTCGA TGCCGGTCGT CGCCTTCAGC
GTGATTAATA TTGTCCTGAC ACTAAGCTCT TTCTTGTGGC TTAATCGACC GCTGGCCTGC
CTGTTTATTC TGGTCGGCGC AGCTGCACAA TATTTCATAA TGACTTACGG CATCGTCATC
GACCGCTCGA TGATTGCCAA TATTATTGAT ACCACTCCGG CAGAAAGTTA TGCGTTGATG
ACACCGCAAA TGTTATTAAC GCTGGGATTC AGCGGCGTGC TTGCTGCGCT GATTGCCTGC
TGGATTAAAA TCAAACCAGC CACCTCGCGC CTGCGCAGTG TTCTTTTCCG TGGAGCCAAT
ATTCTGGTTT CTGTACTGCT GATTTTGCTG GTCGCCGCAC TGTTTTATAA AGACTACGCC
TCGTTGTTTC GCAATAACAA AGAGCTGGTG AAATCCTTAA GCCCCTCTAA CAGCATTGTT
GCCAGCTGGT CATGGTACTC CCATCAGCGA CTGGCAAATC TGCCGCTGGT GCGAATTGGT
GAAGACGCGC ACCGCAACCC GTTAATGCAG AACGAGACAC GTAAAAATTT GACCATCCTG
ATTGTCGGCG AAACCTCGCG GGCGGAGAAC TTCTCCCTCA ACGGCTACCC GCGTGAAACT
AACCCGCGGC TGGCAAAAGA TAACGTGGTC TATTTTCCTA ATACCGCATC TTGCGGCACG
GCAACGGCAG TTTCAGTACC GTGCATGTTC TCGGATATGC CGCGTGAGCA CTACAAAGAA
GAGCTGGCAC AGCACCAGGA AGGCGTGCTG GATATCATTC AGCGAGCGGG CATCAACGTG
CTGTGGAATG ACAACGATGG CGGCTGTAAA GGCGTTTGCG ATCGCGTACC TCACCAGAAC
GTCACCGCGC TGAACCTGCC TGGTCAGTGC ATCAACGGCG AGTGCTATGA TGAAGTACTG
TTCCACGGGC TGGAAGATTA CATCAATAAC CTGCAGGGTG ATGGCGTGAT TGTCTTACAC
ACCATCGGCA GCCACGGCCC GACCTATTAC AACCGCTATC CGCCGCAGTT CAGGAAATTT
ACCCCAACCT GCGACACTAA CGAGATCCAG ACCTGTACCC AAGAGCAACT GGTGAACACT
TACGACAACA CGCTGGTTTA CGTCGACTAT ATTGTTGATA AAGCGATTAA TCTGCTGAAA
GAACATCAGG ATAAATTTAC CACCAGCCTG GTTTATCTTT CTGACCACGG TGAATCGTTA
GGTGAAAATG GCATCTATCT GCACGGTCTG CCTTATGCCA TCGCCCCGGA TAGCCAAAAA
CAGGTGCCGA TGCTGCTGTG GCTGTCGGAG GATTATCAAA AACGGTATCA GGTTGACCAG
AACTGCCTGC AAAAACAGGC GCAAACGCAA CACTATTCAC AAGACAATTT ATTCTCAACC
TTATTGGGCC TGACTGGCGT TGAGACGAAG TATTACCAGG CTGCGGATGA TATTCTGCAA
ACTTGCAGGA GAGTGAGTGA ATGA
 
Protein sequence
MLKRLLKRPS LNLLAWLLLA AFYISICLNI AFFKQVLQAL PLDSLHNVLV FLSMPVVAFS 
VINIVLTLSS FLWLNRPLAC LFILVGAAAQ YFIMTYGIVI DRSMIANIID TTPAESYALM
TPQMLLTLGF SGVLAALIAC WIKIKPATSR LRSVLFRGAN ILVSVLLILL VAALFYKDYA
SLFRNNKELV KSLSPSNSIV ASWSWYSHQR LANLPLVRIG EDAHRNPLMQ NETRKNLTIL
IVGETSRAEN FSLNGYPRET NPRLAKDNVV YFPNTASCGT ATAVSVPCMF SDMPREHYKE
ELAQHQEGVL DIIQRAGINV LWNDNDGGCK GVCDRVPHQN VTALNLPGQC INGECYDEVL
FHGLEDYINN LQGDGVIVLH TIGSHGPTYY NRYPPQFRKF TPTCDTNEIQ TCTQEQLVNT
YDNTLVYVDY IVDKAINLLK EHQDKFTTSL VYLSDHGESL GENGIYLHGL PYAIAPDSQK
QVPMLLWLSE DYQKRYQVDQ NCLQKQAQTQ HYSQDNLFST LLGLTGVETK YYQAADDILQ
TCRRVSE