Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1068 |
Symbol | |
ID | 6145294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1079244 |
End bp | 1080314 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615955 |
Product | patatin family phospholipase |
Protein accession | YP_001743147 |
Protein GI | 170684069 |
COG category | [R] General function prediction only |
COG ID | [COG1752] Predicted esterase of the alpha-beta hydrolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0343802 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACAG AAATGAAAAC GGGGCTGGTG CTGTCCGGTG GCGGGGCGGT GGGCGCTTAT CAGGCGGGAG TGGTTAAGGC ACTGGCAGAG TGTGGTACAC AGATCAGCAT GGTTTCAGGG GCCAGCATTG GCGCATTCAA TGGTGCCATT ATCGCGGCCT CTACCGATCT GTCAGAAGCT GCCGTACGCC TGGAGGCGCT CTGGGATCAT CTGGGGAATA ATCAGGTGCT GTCGGTAAAC AGATTGGTTT ACTTTTCATT GCTGAAAAAA TTGTTCCAGG CAATGAACCT CTGCCAGATC CCCGGACGTG CAGGAGCACT GCTTACGACG CTTCTTCGCC ATATATCGAC AATCAACGGG TTTGACAATC TGATGGCTCA GCCGTTGTTG TCAGATGAGC CCCTGACAGC GCTGATGGAT CATTATCTTG ATACTGATGC TCTGGCAGAC GGGCTACCGC TGTATGTGTC GCTGTACCCC ACAGAAGGGG GCATGCAGGA TATTATTGAC TGCATTCGTG CTGAACTGGG TGTCGGAACC ACGAAAAACG CCGTTTTTCA GCATATCCAG AGCCTGCCCC GCGGACAGCA GAAAGAGGCT CTGCTTGCGT CAGCCGCGCT GCCCCTGCTG TTCCGTCCCC GTGAGGTTCA GGGGACAATG TTCGGTGATG GTGGTATGGG AGGATGGCGA AATATGCAGG GAAATACCCC TGTGACGCCT CTGGTCGATG CCGGATGCAA TATGGTGATT GTGACGCATC TGAGTGACGG TTCTTTATGG GATCGCCAGG CTTTTCCGGA CACCACAATC CTTGAGATCC GTCCCCGGAA AAGGCTGAAA TATGCAGGTG ATGGTGGCAA CAGCGGCGGT CTGCTCAGTT TTACATCGGC ACATACCGAC GCCTGGCGTC AGCAGGGCTA TGAAGACACG ATGCTGGCGA TGGAGCATAT CCGGAAACCG CTGGCAGCAC GTCAGGCACT GACCCGGTCA GAGGCGGTAT TGCAGAAAAG CCTGGATATA ACGGAAGAGG CAGATTTGGC ACTGAGAAAC GCGATGGCCC GGATTAAATA A
|
Protein sequence | MSTEMKTGLV LSGGGAVGAY QAGVVKALAE CGTQISMVSG ASIGAFNGAI IAASTDLSEA AVRLEALWDH LGNNQVLSVN RLVYFSLLKK LFQAMNLCQI PGRAGALLTT LLRHISTING FDNLMAQPLL SDEPLTALMD HYLDTDALAD GLPLYVSLYP TEGGMQDIID CIRAELGVGT TKNAVFQHIQ SLPRGQQKEA LLASAALPLL FRPREVQGTM FGDGGMGGWR NMQGNTPVTP LVDAGCNMVI VTHLSDGSLW DRQAFPDTTI LEIRPRKRLK YAGDGGNSGG LLSFTSAHTD AWRQQGYEDT MLAMEHIRKP LAARQALTRS EAVLQKSLDI TEEADLALRN AMARIK
|
| |