Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0685 |
Symbol | |
ID | 5589471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 710879 |
End bp | 712756 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640924401 |
Product | S54 family peptidase |
Protein accession | YP_001461827 |
Protein GI | 157159255 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | [TIGR02276] 40-residue YVTN family beta-propeller repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.085166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCAT CTTCGGTTAA GCCGTTAAAT GTTCAATTAC CCGCAATAAC CCTTATCCTT TTTGCGCTCT GTGTTGGGAT ATTTTGTTAC TTCGCACAAT GGATGAGTTA TGAAGAAGTC GATCAATCCG CACTCATCCA TCTCGGTGCT AACGTTGCTT CTCTCACGTT GTCGGGTGAA TCCTGGCGCT TATTGAGCAG TGTCTTTCTG CACAGCAGTG TTTCTCATTT GCTGATGAAT ATGTTTGCAC TCCTGGTGGT GGGGGGAGTG GTGGAACGTA TTCTGGGGAA ATGGCGACTC CTGATTGTCT GGCTATTCTC TGGCGTCTTT GGTGGGCTAA TCAGCGCCTG TTATGCGTTA CGCGAAAGTG AGCAGATAGT CATCAGCGTT GGGGCGTCCG GGGCAATTAT GGGAATAGCT GGCGCTGCGA TAGCAACACA GCTTGCTTCA GGTGCGGGCA CACACCATAA AAACCAGCGG CGAGTATTTC CTCTGTTGGG TATGGTGGCG CTGACACTGT TGTACGGTGC CCGGCAAACA GGAATAGATA ACGCTTGCCA CATTGGCGGC CTGATTGCGG GTGGCGCGTT GGGTTGGCTG AGCGCGTGTT TATCTGGGCA AAACCGACTC GTTACGGAAG GCGGGATTAT TGTTGCGGGC AGTCTTCTTC TGACCGGGGC TATCTGGCTT GCGCAGCAGC AGATGGATGA GTCAGTTTTA CAGGTCAGGC AAAGCCTGCG TGAAGCGTTT TATCCGCAGG AGATTGAACA AGAGCGACGA CAAAAAAAAC AACAGTTAGC GGAGGAACGC AACGCCCTCA GGGAAACATT ATCCGCTCCG GTAAGTCGTG AACAGGCCAG TGGTGATTTA CTCGCTGAGA TTGCCGATAT CCATGATATG GCGATCAGTC GGGATGGTAA TACGTTGTAT GCCGCAATTG AAAACACCAA CAGCATTATT GTTTTCGACC TCGGACAAAA GAAAATCCTG CATACCTTTA CAGCGCCCAT AGCGAAAGAA AAGTCAGTCA AACATTGTGG TGGCTGTAAA GATCAGGGCG TCAGATCGCT GGCATTAAGC CCGGATGAAA AGTTGATTTA TGCGACTTCA TTTGAAGCGA ATGCGTTATC GGTCATTAAC GTGGCGACAG GGGAGATTAT TCAGTCGATT ACCACCGGTG CACATCCTGA CAGCCTTATC CTCTCGCGTG ATGGCACAAA AGCCTGGGTG ATGAATCGCA CCAGTAATAG TGTGTCAGCG ATTGATCTGG TGACTTATCA GCATGTGGCG GATATCCCGC TGGAGAAATA CGACGGGACG GGGACGAGTA ATAAACCTGG TGCCTGGGTT ATGGCTCTTT CCCCGGATGA AAAAATATTG TTGATACCCG GTATGGTCAG AGGTGACATT GTACGCATCA ATACCATCAC GCATCAGAAA GAAAGCTATC CGGCTAGTGA TGCGCGAGGA ACGATATCGG CGATGCGTTT TCGACCTGAA AACGGGGATG TAATTTTTGC CGACAGCCAG GGGATTACAC GTATAAGCGT AGGGGATCAA CAAGCCAGCA TTATGACGCA ATGGTGTAGC AGGAGCGTTT ATTCCGTTGA GGGTATTAGC CCGGACGGTC AGTATTTAGC GTTGGTGTCA TATGGCCTGC AAGGTTATGT CATCCTGCTC AATATTAATG CCGGGCAGAT TATTGGCGTT TATCCTGCCA GCTACGTTAA TCACCTTCGT TTTTCAGCGG ATGGTAGAAA AATATTTGTT ATGGCGAAGA ACGGGTTGAT CCAACTGGAC AGGACGCGCT CGCTTGATCC GCAGGCAATT ATTCGTCATC CCCAATATGG CAATGTGGCT TGTATCCCTG AACCGTAA
|
Protein sequence | MSASSVKPLN VQLPAITLIL FALCVGIFCY FAQWMSYEEV DQSALIHLGA NVASLTLSGE SWRLLSSVFL HSSVSHLLMN MFALLVVGGV VERILGKWRL LIVWLFSGVF GGLISACYAL RESEQIVISV GASGAIMGIA GAAIATQLAS GAGTHHKNQR RVFPLLGMVA LTLLYGARQT GIDNACHIGG LIAGGALGWL SACLSGQNRL VTEGGIIVAG SLLLTGAIWL AQQQMDESVL QVRQSLREAF YPQEIEQERR QKKQQLAEER NALRETLSAP VSREQASGDL LAEIADIHDM AISRDGNTLY AAIENTNSII VFDLGQKKIL HTFTAPIAKE KSVKHCGGCK DQGVRSLALS PDEKLIYATS FEANALSVIN VATGEIIQSI TTGAHPDSLI LSRDGTKAWV MNRTSNSVSA IDLVTYQHVA DIPLEKYDGT GTSNKPGAWV MALSPDEKIL LIPGMVRGDI VRINTITHQK ESYPASDARG TISAMRFRPE NGDVIFADSQ GITRISVGDQ QASIMTQWCS RSVYSVEGIS PDGQYLALVS YGLQGYVILL NINAGQIIGV YPASYVNHLR FSADGRKIFV MAKNGLIQLD RTRSLDPQAI IRHPQYGNVA CIPEP
|
| |