Gene EcSMS35_4402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4402 
Symbol 
ID6146350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4494650 
End bp4496383 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content51% 
IMG OID641619223 
Producthypothetical protein 
Protein accessionYP_001746347 
Protein GI170680859 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.256321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.491316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCCA CAGAAGTCCA GGCTAAACCC CTTTTTAGCT GGAAAGCCCT GGGTTGGGCA 
CTGCTCTACT TTTGGTTTTT CTCTACTCTG CTACAGGCCA TTATTTACAT CAGTGGTTAT
AGTGGCACTA ACGGCATTCG CGACTCGCTG TTATTCAGTT CGCTGTGGTT GATCCCGGTA
TTCCTCTTTC CGAAGCGGAT CAAAATTATT GCCGCAGTGA TCGGCGTGGT GCTATGGGCG
GCCTCTCTGG CGGCGCTGTG CTACTACGTC ATCTACGGTC AGGAGTTCTC GCAGAGCGTT
CTGTTTGTGA TGTTCGAAAC CAACACCAAC GAAGCTAGCG AGTATTTAAG CCAGTATTTC
AGCCTGAAAA TTGTGCTTAT CGCGCTGGCC TATACGGCGG TGGCAGTTCT GCTGTGGACA
CGCCTGCGCC CGGTCTATAT TCCAAAGCCG TGGCGTTATG TTGTCTCTTT TGCCCTGCTT
TATGGCTTGA TTCTGCATCC GATCGCCATG AACACCATCA TCAAAGGCAA ACCGATTGAA
AAAACGCTGG ATAGTCTGGC ATCGCGAATG GAACCCGCAG CACCATGGCA ATTTATTTCC
GGCTACTACC AGTACCGCCA GCAACTTAAC TCGCTGACCA AATTACTCAA CGAAAACAAT
GCGCTGCCGC CGCTGGCTAA TTTCAAAGAT GAATCGGGTA ACGAACCGCG CACCTTAGTG
CTGGTGATTG GCGAGTCGAC CCAGCGTGGA CGCATGAGTC TGTACGGTTA TCCGCGTGAA
ACCACGCCGG AGCTGGATGC GCTGCATAAA ACCGATCCGA ATCTGACCGT GTTTAATAAC
GTGGTGACGT CTCGTCCGTA CACCATTGAA ATCCTGCAAC AGGCGCTGAC CTTTGCCAAT
GAAAAGAACC CGGATCTGTA TCTGACGCAG CCGTCGCTGA TGAACATGAT GAAACAGGCG
GGTTATAAAA CCTTCTGGAT CACCAACCAG CAGACGATGA CCGCCCGCAA TACCATGCTG
ACGGTATTTT CGCGCCAGAC CGACAAGCAG TACTACATGA ACCAGCAACG TACACAGAGT
GCGCGTGAAT ACGACACTAA CGTGCTGAAG CCGTTCCAGG AAGTGCTGAA GGACCCTGCG
CCGAAGAAAC TGATCATCGT TCATCTGCTG GGTACGCATA TCAAATACAA ATACCGCTAC
CCGGAAAATC AGGGCAAGTT TGATGGCAAT ACCGATCATG TCCCGCCGGG GTTAAACGCG
GAAGAGCTGG AGTCATATAA CGATTATGAC AACGCTAACC TGTATAACGA TCATGTGGTT
GCCAGCCTGA TTAAAGACTT TAAAGCGGCA GACCCGAACG GATTCCTGGT TTACTTCTCT
GACCACGGTG AAGAGGTTTA CGACACGCCG CCGCATAAAA CCCAGGGGCG TAATGAGGAC
AACCCGACGC GCCACATGTA CACCATTCCG TTCCTGCTGT GGACGTCGGA AAAATGGCAA
GCGACTCATC CGCGTGATTT CTCACAGGAT GTTGATCGTA AATACAGCCT GGCGGAATTG
ATCCACACCT GGTCAGATTT AGCGGGCTTA TCTTACGACG GTTACGACCC AACCCGTTCA
GTGGTGAATC CGCAGTTCAA AGAAACTACC CGCTGGATTG GTAACCCGTA CAAGAAAAAC
GCGCTGATCG ATTACGACAC GCTGCCGTAT GGCGACCAGG TAGGTAATCA GTAA
 
Protein sequence
MHSTEVQAKP LFSWKALGWA LLYFWFFSTL LQAIIYISGY SGTNGIRDSL LFSSLWLIPV 
FLFPKRIKII AAVIGVVLWA ASLAALCYYV IYGQEFSQSV LFVMFETNTN EASEYLSQYF
SLKIVLIALA YTAVAVLLWT RLRPVYIPKP WRYVVSFALL YGLILHPIAM NTIIKGKPIE
KTLDSLASRM EPAAPWQFIS GYYQYRQQLN SLTKLLNENN ALPPLANFKD ESGNEPRTLV
LVIGESTQRG RMSLYGYPRE TTPELDALHK TDPNLTVFNN VVTSRPYTIE ILQQALTFAN
EKNPDLYLTQ PSLMNMMKQA GYKTFWITNQ QTMTARNTML TVFSRQTDKQ YYMNQQRTQS
AREYDTNVLK PFQEVLKDPA PKKLIIVHLL GTHIKYKYRY PENQGKFDGN TDHVPPGLNA
EELESYNDYD NANLYNDHVV ASLIKDFKAA DPNGFLVYFS DHGEEVYDTP PHKTQGRNED
NPTRHMYTIP FLLWTSEKWQ ATHPRDFSQD VDRKYSLAEL IHTWSDLAGL SYDGYDPTRS
VVNPQFKETT RWIGNPYKKN ALIDYDTLPY GDQVGNQ