Gene EcSMS35_1762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1762 
Symbol 
ID6144629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1770033 
End bp1771790 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content50% 
IMG OID641616638 
Producthypothetical protein 
Protein accessionYP_001743816 
Protein GI170683337 
COG category[I] Lipid transport and metabolism 
COG ID[COG2267] Lysophospholipase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000563252 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAATT CACGCATCCC TGGGGAACAT TTTTTTACCA CCAGTGATAA TACAGCGTTG 
TTTTATCGGC ACTGGCCCAC TTTACAGCCA GGGGCGAAAA AGGTCATCGT CTTATTTCAT
CGCGGTCATG AACATTCTGG TCGTCTACAA CATATCGTTG ATGAACTGGC GATGCCAGAT
ACTGCTTTTT ATGCATGGGA TGCCCGAGGG CATGGACAAA CTTCGGGGCC GCGTGGTTAT
AGTCCGTCTC TTGCGCGTTC AGTGCAGGAT GTCGATGAAT TTGTCCGTTT TGCTGCCAGC
GACAGCCAGG TCGGACTGGA AGAGGTGGTT GTGATCGCGC AAAGCGTCGG CGCAGTGATG
GTTGCTACAT GGGTTCATGA TTATGCGCCT GCTATTCGCG GGCTGGTGCT GGCTTCTCCG
GCCTTTAAGG TTAAATTGTA TGTGCCGCTG GCACGTCCTG CGCTGGCGTT ATGGCATCGT
CTGCGTGGTC TGTTTTTTAT TAATTCCTAT GTGAAAGGAC GCTATTTGAC CCACGATCGG
CAACGGGTGG CGAGTTTCAA TAATGATCCG CTGATCACAC GGGCGATTGC CGTTAATATC
TTGCTCGATC TTTATAAAAC GTCTGAACGT ATTGTTAGCG ATGCGGCGGC GATTACGCTC
CCCACGCAAC TTCTGATATC AGGCGATGAC TATGTGGTGC ATCGTCAACC GCAGATTGAT
TTTTATCAGA GATTACGTAG CCCTCTGAAA GAGCTGCATC TGCTGCCAGG CTTTTATCAC
GACACGTTGG GTGAAGAGAA CAGGGCGCAG GCATTTGAAA AAATGCAAAG CTTTATTAGT
CGTTTATATG CTAACAAATC GCAAAAATTT GATTATCAGC ATGAAGACCG CACTGGACCA
TCAGCGGATC GCTGGCGGCT CCTTTCAGGT GGACCCGTGC CATTATCGCC GGTTGATTTG
GCGTATCGCT TTATGCGTAA AGCGATGAAA TTGTTCGAGA CGCACTCTGC GGGCCTGCAT
CTCGGAATGA GCACCGGCTT TGATTCAGGC AGTTCGCTGG ATTATGTCTA TCAAAATCAA
CCGCAAGGTA GTAACGCATT CGGGCGTTTA GTCGACAAAA TCTACCTGAA CAGTGTTGGC
TGGCGCGGTA TTCGCCAGCG CAAAACCCAT TTACAAATGC TGATTAAACA AGCCGTTGCC
GATCTCCACG CCAAAGGTTT AGCCATCCGC GTGGTTGACA TTGCCGCAGG GCATGGGCGC
TATGTACTGG ATGCGCTGGA GTATGAACCT GCCGTATGCG ATATTTTGTT ACATGATTAC
AGTGAGTTAA ATGTTGCACA GGGGCAAGAG ATGATTGCCC AACGGGGAAT GTCTGGGCGG
GTGCGTTATG AACAGGGCGA TGCGTTTAAT CCGGCAGAAC TCAGCACGTT AACTCCGCGG
CCTACGCTGG CGATTGTCTC TGGCCTGTAT GAGCTTTTTC CCGAAAATGA GCAGGTAAAA
AACTCACTCG CAGGTCTTGC CAATGCCATC GATCCGGGTG GCATTCTCAT CTACACCGGG
CAGCCGTGGC ACCCACAACT GGAGCTGATT GCCGGGGTGT TAACCAGTCA TAAAGATGGT
AAACCGTGGG TAATGCGCGT GCGTTCTCAA GGGGAGATGG ATTCGCTCGT GCATGATGCC
GGATTTGATA AATGCACACA ACGGATTGAT GAGTGGGGCA TTTTTACGGT TTCGATGGCG
GTGCGTCGTG ATAACTGA
 
Protein sequence
MENSRIPGEH FFTTSDNTAL FYRHWPTLQP GAKKVIVLFH RGHEHSGRLQ HIVDELAMPD 
TAFYAWDARG HGQTSGPRGY SPSLARSVQD VDEFVRFAAS DSQVGLEEVV VIAQSVGAVM
VATWVHDYAP AIRGLVLASP AFKVKLYVPL ARPALALWHR LRGLFFINSY VKGRYLTHDR
QRVASFNNDP LITRAIAVNI LLDLYKTSER IVSDAAAITL PTQLLISGDD YVVHRQPQID
FYQRLRSPLK ELHLLPGFYH DTLGEENRAQ AFEKMQSFIS RLYANKSQKF DYQHEDRTGP
SADRWRLLSG GPVPLSPVDL AYRFMRKAMK LFETHSAGLH LGMSTGFDSG SSLDYVYQNQ
PQGSNAFGRL VDKIYLNSVG WRGIRQRKTH LQMLIKQAVA DLHAKGLAIR VVDIAAGHGR
YVLDALEYEP AVCDILLHDY SELNVAQGQE MIAQRGMSGR VRYEQGDAFN PAELSTLTPR
PTLAIVSGLY ELFPENEQVK NSLAGLANAI DPGGILIYTG QPWHPQLELI AGVLTSHKDG
KPWVMRVRSQ GEMDSLVHDA GFDKCTQRID EWGIFTVSMA VRRDN