Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1168 |
Symbol | |
ID | 6972429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1183499 |
End bp | 1185349 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385167 |
Product | hypothetical protein |
Protein accession | YP_002269663 |
Protein GI | 209399945 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.44098 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTTA AACACTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCGGAAAAG CTGACACACA AACTGAAAGA GGGCTGGCAG CCATACGGCG GACCGGTTGC CATTACGCCG TACACACTGA TGCAGGCGGT GGCTATTGAA GGAGATCCAC AGGTCGGCCC TTCATCTAAG CCGGACTGGT TCTACGTGGT TGTGCTTGCC GGACAGTCCA ACGGCATGGC CTACGGTGAA GGGCTTCCGT TACCGGATTC TTACGATGCT CCGGATCCGC GCATTAAACA GCTGGCGCGC CGCAGCACGG TAACTCCGGG TGGAGAGAGT TGTACGTATA ACGACATCAT TCCGGCCGAC CACTGCCTGC ATGATGTGCA GGATATGAGT ACGCTGAATC ATCCGAAGGC AGACCTGAGC AAAGGGCAGT ACGGCTGTGT CGGCCAGGGC TTACATATTG CCAAAAAACT GCTTCCGTAT ATCCCGAATA ACGCGGGGAT CCTGCTGGTA CCATGCTGTC GTGGTGGTTC TGCATTCACC CAGGGCGCTG AGGGGACATT CAGTGCGGAC ACGGGGGCCA GCCAGGATTC GGCACGCTGG GGTGTGGGTA AACCGTTATA TCAGGACCTG ATTGCGCGCA CTAAAGCTGC ATTACAGAAG AACCCGAAAA ATGTGTTGCT GGCGGTGTGC TGGATGCAGG GAGAGTTTGA CATGAGCGCC GCCACCCACG CACAGCAACC TGCGCTGTTT ACAGCCATGC TGACACAGTT TCGTGCTGAC CTCTCCGTGT TTAACGCGCA GTGCCATGGT GGCAGTGCTG CAGATGTGCC GTGGATTTGT GGTGATACGA CGTATTACTG GAAAAATACA TACGCTACCC AGTACGACAC CGTGTACGGC GGGTATAAAA ACAGGGAGAG TGAGGGCGTT TATTTTGTGC CCTTCATGAC AGACGGTAAC GGCGTCAATA CCGCCACTAA CGCGCCGGCA GAAGATCCGG ATATTCCGGC ATCAGGATAT TACGGTGCGG CATCGAGAAC GAATGGAAAC CAGGTATCAT CAAACCGCCC GACACATTTC AGTTCATGGG CGCGCAGGAG CATTATTCCG GATCGTCTGG CAACCGCTAT TCTGAACGCA GCCGGGCGCA CCTCCGCCTT CATCAGTGGT AAGGCACCGG AAATCAAACC CTCGCCCGGC GGCAACACGC CATCGGGTCC GTCTGCAGAT ACGTCCGTTC GCACAATCTC CCTGCTGCCG GCAGCCGGAG AGGCTGCTGC GCAGGGCTGG AGCATTAAGG ATGGCGGAAT TCAGTTGTCA GATGGTGTAT TTAAGATCAC CAGGCAGAGC AATAAAACCT GGTCCCTGAC GCATCCGGTG GATGACGCAA TTACCCTGCT GACACAGGGC GGCAGACTGA ACTGTAAGTT CCGCCTGTCA GGCGCACTGA CCAACAATCA GTTCGGGCTG GGGATTTATC TGTATACGGA TGCTCCCGTT CCTGATGGTG TGGCGATGAC GGGTACCGGT AATCCGTTCC TGATGTCGTA CTTCACTCAG ACCACTGACG GCAGAGTGAA TCTGATGCAT CACAGGAAAG CCGGAAACAC GAAGCTGGGG GAGTTCGGCG ATTACGGTAA CGACTGGCAG ACGCTGGAGC TGGTGTTCAC CGCCGGCAGT GCCACGGTTA CTCCGAAACT GAATGGAGTG GCTGGCCCGG CATTCCAGGT TATAAAAGAC AGTCTGACAC TGGGACTGAA TGCGCTGACG CTGACGGATG TTACAAAAAA TGCAGCGTAT GGCGTTGAGA TAGAAAGTCT GGTGCTGGAG ATAAATGCAC CGGCAGCATA A
|
Protein sequence | MAFKHYDVVR AASPSDLAEK LTHKLKEGWQ PYGGPVAITP YTLMQAVAIE GDPQVGPSSK PDWFYVVVLA GQSNGMAYGE GLPLPDSYDA PDPRIKQLAR RSTVTPGGES CTYNDIIPAD HCLHDVQDMS TLNHPKADLS KGQYGCVGQG LHIAKKLLPY IPNNAGILLV PCCRGGSAFT QGAEGTFSAD TGASQDSARW GVGKPLYQDL IARTKAALQK NPKNVLLAVC WMQGEFDMSA ATHAQQPALF TAMLTQFRAD LSVFNAQCHG GSAADVPWIC GDTTYYWKNT YATQYDTVYG GYKNRESEGV YFVPFMTDGN GVNTATNAPA EDPDIPASGY YGAASRTNGN QVSSNRPTHF SSWARRSIIP DRLATAILNA AGRTSAFISG KAPEIKPSPG GNTPSGPSAD TSVRTISLLP AAGEAAAQGW SIKDGGIQLS DGVFKITRQS NKTWSLTHPV DDAITLLTQG GRLNCKFRLS GALTNNQFGL GIYLYTDAPV PDGVAMTGTG NPFLMSYFTQ TTDGRVNLMH HRKAGNTKLG EFGDYGNDWQ TLELVFTAGS ATVTPKLNGV AGPAFQVIKD SLTLGLNALT LTDVTKNAAY GVEIESLVLE INAPAA
|
| |