Gene EcSMS35_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1081 
Symbol 
ID6144773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1096148 
End bp1097182 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content48% 
IMG OID641615967 
Productphosphotriesterase family protein 
Protein accessionYP_001743159 
Protein GI170681678 
COG category[R] General function prediction only 
COG ID[COG1735] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.651089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGATT ATCTACAAAC TGTGACGGGC CCCGTCGCCC GTGAAGATAT GGGGCTAACA 
CTTCCTCATG AACATCTGTT CAACGATCTT TCCTCTGTTG TGGATGCTCC TTGCTATCCT
TTTTCACAAC GGCTTGTTGA TAAAAAAGTC ACGGCAGAAA TCCAGTGGGC ATTAAAACAC
GATCCTTATT GCTGCGCTGA TAATATGGAC CGCAAACCTA TTGAAGATGT GATCTTCGAA
ATTAATAACT TTATCTCGCT CGGTGGTCGC ACCATTGTTG ATGCCACAGG GTCTGAGTCT
ATCGGACGAG ATGCGCAGGC ATTACGAGAA GTCGCATTAA AAACAGGTCT GAATATTGTT
GCTTCCTCCG GTCCCTACCT GGAAAAATTT GAAAGCCAGA GAATTCATAA AACGGTTGAT
GAACTAGCGA CAACGATCGA TAAAGAATTG AATCAAGGGA TTGGCGATAC GGATATTCGT
GCCGGAATGA TCGGTGAAAT AGGTGTCTCA CCGACATTTA CCGAAGCCGA GCATAACAGC
TTGCGGGCTG CCTCGCTGGC ACAGATTAAC AATCCTCATG TGGCGATGAA TATTCACATG
CCGGGCTGGC TTCGTCGCGG TGATGAAGTA CTCGACATTG TGTTAGGTGA AATGGGCGTC
TCGCCAAATA AAGTCTCTCT CGCACACTCG GATCCGTCAG GAAAAGACGT GGCGTATCAG
CGGAAAATGC TTGATAAAGG TGTCTGGCTG GAATTCGACA TGATTGGCCT AGACATTACC
TTCCCGAAAG AGGGAATAGC GCCAGGGGTG CAGGAGACTG CCGATGCCGT CGCTCATCTC
ATTGAGTTGG GATACGCCGA TCAGCTTGTT CTCAGCCACG ATGTCTTCCT TAAACAAATG
TGGGCTAAAA ATGGCGGTAA TGGCTGGGGA TTTGTTCCAG ATGTTTTTCT GGCCTATCTG
GCGGAGCGCG GCGTCGATAA AACGATCCTC AAAAAACTCT GTATCGATAA TCCCGGACGA
TTATTGACCG CATAA
 
Protein sequence
MKDYLQTVTG PVAREDMGLT LPHEHLFNDL SSVVDAPCYP FSQRLVDKKV TAEIQWALKH 
DPYCCADNMD RKPIEDVIFE INNFISLGGR TIVDATGSES IGRDAQALRE VALKTGLNIV
ASSGPYLEKF ESQRIHKTVD ELATTIDKEL NQGIGDTDIR AGMIGEIGVS PTFTEAEHNS
LRAASLAQIN NPHVAMNIHM PGWLRRGDEV LDIVLGEMGV SPNKVSLAHS DPSGKDVAYQ
RKMLDKGVWL EFDMIGLDIT FPKEGIAPGV QETADAVAHL IELGYADQLV LSHDVFLKQM
WAKNGGNGWG FVPDVFLAYL AERGVDKTIL KKLCIDNPGR LLTA