Gene EcolC_2248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2248 
Symbol 
ID6066943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2470448 
End bp2472205 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content51% 
IMG OID641601653 
Producthypothetical protein 
Protein accessionYP_001725212 
Protein GI170020258 
COG category[I] Lipid transport and metabolism 
COG ID[COG2267] Lysophospholipase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.555359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATT CACGCATCCC TGGGGAACAT TTTTTTACCA CCAGTGATAA TACAGCGTTG 
TTTTATCGGC ACTGGCCCGC TTTACAGCCC GGGGCGAAAA AGGTCATCGT CTTATTTCAT
CGCGGGCATG AACATTCTGG TCGTCTACAA CATCTCGTTG ATGAACTGGC GATGCCAGAT
ACTGCTTTTT ATGCCTGGGA TGCCCGAGGG CATGGAAAAA GTTCGGGGCC GCGTGGTTAT
AGCCCATCTC TTGCGCGTTC AGTGCGGGAT GTCGATGAAT TTGTCCGTTT TGCTGCCAGC
GACAGCCAGG TTGGACTGGA AGAGGTGGTA GTGATCGCGC AAAGCGTCGG CGCAGTGCTG
GTTGCCACAT GGATTCATGA TTATGCACCT GCAATTCGCG GGCTGGTGCT GGCTTCTCCG
GCCTTTAAGG TTAAATTGTA TGTGCCGCTG GCACGTCCTG CGCTGGCGTT ATGGCATCGT
CTGCGTGGTC TGTTTTTTAT TAATTCCTAT GTGAAAGGAC GCTATTTGAC CCACGATCGG
CAACGGGGGG CGAGTTTCAA TAATGATCCG CTGATCACAC GGGCGATTGC CGTTAATATC
TTGCTCGATC TCTACAAAAC GTCTGAACGT ATTATTAGAG ATGCGGCGGC GATTACGCTC
CCCACGCAAC TTCTGATATC AGGCGATGAC TATGTGGTGC ATCGCCAACC GCAGATTGAT
TTTTATCAGA GATTACGTAG CCCTCTGAAA GAGCTGCATC TGCTGCCAGG CTTTTATCAC
GACACGTTGG GTGAAGAGAA CAGGGCGCTG GCATTTGAAA AAATGCAAAG CTTTATTAGT
CGTTTATATG CTAACAAATC GCAAAAATTT GATTATCAGC ATGAAGACTG CACAGGACCA
TCAGCGGATC GATGGCGGCT ACTTTCTGGT GGACCCGTGC CATTATCGCC GGTTGATTTA
GCGTATCGCT TTATGCGAAA GGCGATGAAA TTGTTCGGGA CGCACTCTTC GGGCCTGCAT
CTCGGAATGA GCACCGGCTT TGATTCAGGC AGTTCGCTGG ATTATGTCTA TCAAAATCAA
CCGCAAGGTA GTAACGCATT CGGGCGCTTA ATCGACAAAA TCTACCTGAA CAGTGTTGGC
TGGCGCGGTA TTCGCCAGCG CAAAACCCAT TTACAAATAC TGATTAAACA AGCCGTTGCC
GATCTCCACG CCAAAGGTTT AGCCGTCCGC GTGGTTGACA TTGCCGCAGG GCATGGGCGC
TATGTACTGG ATGCACTGGC AAACGAGCCT GCCGTAAGCG ATATTTTATT ACGTGATTAC
AGTGAGTTAA ATGTTGCACA GGGGCAAGAG ATGATTGCCC AACGGGGAAT GTCTGGGCGG
GTGCGTTTTG AACAGGGCGA TGCGTTTAAC CCGGAGGAAC TCAGCGCGTT AACTCCGCGG
CCTACGCTGG CGATTGTCTC TGGACTGTAT GAGCTGTTTC CCGAAAATGA GCAGGTAAAA
AACTCACTCG CAGGTCTTGC CAATGCCATC GAACCGGGCG GCATTCTCAT CTACACCGGG
CAGCCGTGGC ACCCTCAACT GGAGATGATT GCCGGGGTGT TAACCAGTCA TAAAGATGGT
AAACCGTGGG TAATGCGCGT GCGTTCGCAA GGGGAGATGG ATTCACTCGT GCGTGATGCC
GGATTTGATA AATGCACACA ACGGATTGAT GAGTGGGGTA TTTTTACGGT TTCGATGGCG
GTGCGTCGTG ATAACTGA
 
Protein sequence
MENSRIPGEH FFTTSDNTAL FYRHWPALQP GAKKVIVLFH RGHEHSGRLQ HLVDELAMPD 
TAFYAWDARG HGKSSGPRGY SPSLARSVRD VDEFVRFAAS DSQVGLEEVV VIAQSVGAVL
VATWIHDYAP AIRGLVLASP AFKVKLYVPL ARPALALWHR LRGLFFINSY VKGRYLTHDR
QRGASFNNDP LITRAIAVNI LLDLYKTSER IIRDAAAITL PTQLLISGDD YVVHRQPQID
FYQRLRSPLK ELHLLPGFYH DTLGEENRAL AFEKMQSFIS RLYANKSQKF DYQHEDCTGP
SADRWRLLSG GPVPLSPVDL AYRFMRKAMK LFGTHSSGLH LGMSTGFDSG SSLDYVYQNQ
PQGSNAFGRL IDKIYLNSVG WRGIRQRKTH LQILIKQAVA DLHAKGLAVR VVDIAAGHGR
YVLDALANEP AVSDILLRDY SELNVAQGQE MIAQRGMSGR VRFEQGDAFN PEELSALTPR
PTLAIVSGLY ELFPENEQVK NSLAGLANAI EPGGILIYTG QPWHPQLEMI AGVLTSHKDG
KPWVMRVRSQ GEMDSLVRDA GFDKCTQRID EWGIFTVSMA VRRDN