Gene EcSMS35_1907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1907 
Symbol 
ID6146046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1928131 
End bp1929075 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content51% 
IMG OID641616783 
Producthypothetical protein 
Protein accessionYP_001743961 
Protein GI170679923 
COG category[R] General function prediction only 
COG ID[COG1752] Predicted esterase of the alpha-beta hydrolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.163252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.264575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTACGA TTGCATTCCA GGGGAATCTT GCGGGAATAA TGAGAAAGAT AAAAATAGGG 
CTGGCGCTGG GATCTGGCGC GGCGAGAGGT TGGTCGCATA TTGGCGTTAT TAATGCGCTA
AAAAAAGTGG GTATTGAAAT TGATATCGTT GCAGGATGCT CAATTGGTTC GCTGGTGGGC
GCGGCCTATG CATGCGATCG ATTATCTGCG CTGGAAGATT GGGTAACCTC TTTCAGTTAT
TGGGATGTTT TACGCCTGAT GGATCTCTCC TGGCAGCGCG GTGGGTTACT GCGCGGCGAG
CGTGTCTTCA ATCAATATCG CGAAATAATG CCGGAAACAG AGATCGAAAA TTGTTCCCGT
CGCTTCGCGG CTGTTGCCAC CAATTTAAGT ACTGGGCGCG AATTATGGTT TACTGAAGGC
GATCTCCATC TTGCTATTCG TGCATCATGC AGTATTCCAG GACTAATGGC ACCCGTTGCG
CATAACGGCT ACTGGCTGGT TGATGGCGCT GTAGTTAACC CAATTCCTAT TTCGCTCACG
CGTGCATTGG GTGCTGATAT TGTGATAGCG GTCGACCTGC AGCACGATGC TCATTTGATG
CAACAAGATC TGCTCTCCTT TAATGTCAGT GAAGAAAATA GCGAGAATGG TGATTCTCTG
CCGTGGCATG CGCGTCTGAA AGAAAGGTTG GGCAGCATAA CGACACGTCG GGCGGTGACA
GCGCCAACGG CAACAGAGAT TATGACCACT TCTATCCAGG TGCTGGAGAA CCGCCTTAAA
AGGAACCGCA TGGCAGGTGA TCCGCCCGAT ATTCTGATTC AACCTGTTTG CCCGCAAATA
TCTACGCTTG ATTTCCATCG CGCGCACGCT GCCATTGCGG CTGGGCAGCT GGCAGTGGAA
AAGAAAATGG ACGAACTTTT GCCGTTGGTA CGCACCAACA TTTGA
 
Protein sequence
MATIAFQGNL AGIMRKIKIG LALGSGAARG WSHIGVINAL KKVGIEIDIV AGCSIGSLVG 
AAYACDRLSA LEDWVTSFSY WDVLRLMDLS WQRGGLLRGE RVFNQYREIM PETEIENCSR
RFAAVATNLS TGRELWFTEG DLHLAIRASC SIPGLMAPVA HNGYWLVDGA VVNPIPISLT
RALGADIVIA VDLQHDAHLM QQDLLSFNVS EENSENGDSL PWHARLKERL GSITTRRAVT
APTATEIMTT SIQVLENRLK RNRMAGDPPD ILIQPVCPQI STLDFHRAHA AIAAGQLAVE
KKMDELLPLV RTNI