Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2790 |
Symbol | |
ID | 6968329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2602440 |
End bp | 2604290 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643386643 |
Product | hypothetical protein |
Protein accession | YP_002271122 |
Protein GI | 209397280 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.578932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATTA AACATTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCGGAAAAG CTGACACACA AACTGAAAGA GGGCTGGCAG CCATACGGCG GACCGGTTGC CATTACGCCG TACACACTGA TGCAGGCGGT GGCTATTGAA GGAGAGCCAC AGGTCGGCCC TTCATCTGAG CCGGATTGGT ACTACGTCAT CGTACTGGCC GGGCAGTCCA ATGCCATGGC TTACGGTGAA GGGCTTCCGC TGCCGGATTC ATACGATGCT CCGGATCCGC GCATTAAACA GCTGGCGCGC CGCAGTACAG TGACGCCGGG CGGGGCTGCC TGCAGATATA ACGATATTAT TCCGGCTGAC CACTGTCTGC ATGATGTGCA GGATATGAGT ACGCTGAATC ATCCGAGGGC TGACCTGAGC AAAGGGCAGT ACGGCTGTGT CGGCCAGGGT TTACATATTG CCAAAAAACT GCTCCCGTAT ATCCCGAATA ACGCGGGGAT CCTGCTGGTA CCATGCTGTC GTGGTGGTTC GGCATTTACC CAGGGCGCGG AGGGGACATT CAGCGAGTCC ACGGGGGCCA GTCAGGATTC GGCACGCTGG GGGGTGGGCA AGCCGTTATA TCAGGATCTG ATTTCCCGCA CAAAAGCGGC ATTGCAGAAA AATCCCAAAA ACGTTCTGCT GGCCGTCTGC TGGATGCAGG GTGAGTTTGA CATGAGCGCC GCCACCCACG CACAGCAACC TGCGCTGTTT ACAGCCATGC TGACACAGTT TCGTGCTGAC CTCTCCGTGT TTAACGCGCA GTGCCATGGT GGCAGCGCTG CAGATGTGCC GTGGATTTGT GGTGACACGA CGTATTACTG GAAAAATACA TACGCTACCC AGTACGACAC CGTGTACGGC GGGTATAAAA ACAGGGAGAG TGAGGGCGTT TATTTTGTGC CCTTCATGAC AGACGGTAAC GGCGTCAATA CCGCCACTAA CGCGCCGGCA GAAGATCCGG ATATTCCGGC ATCAGGATAT TACGGTGCGG CATCGAGAAC GAATGGAAAC CAGGTATCAT CAAACCGCCC GACACATTTC AGTTCATGGG CGCGCAGGAG CATTATTCCG GATCGTCTGG CAACCGCTAT TCTGAACGCA GCCGGGCGCA CCTCAGCCTT CATCAGTGGT AAGGCACCGG AAATCAAACC CTCGCCCGGC GGCAACACGC CATCGGGTCC GTCTGCAGAT ACGTCCGTTC GCACAATCTC CCTGCTGCCG GCAGCCGGAG AGGCTGCTGC GCAGGGCTGG AGCATTAAGG ATGGCGGAAT TCAGTTGTCA GATGGTGTAT TTAAGATCAC CAGGCAGAGC AATAAAACCT GGTCCCTGAC GCATCCGGTG GATGACGCAA TTACCCTGCT GACACAGGGC GGCAGACTGA CCTGTAAGTT CCGCCTGTCA GGCGCACTGA CCAACAATCA GTTCGGGCTG GGGATTTATC TGTATACGGA TGCTCCCGTT CCTGATGGTG TGGCGATGAC GGGTACCGGT AATCCGTTCC TGATGTCGTA CTTCACTCAG ACCACTGACG GCAGGGTGAA TCTGATGCAT CACAGGAAAG CCGGAAACAC GAAGCTGGGG GAGTTCGGCG ATTACGGTAA CGACTGGCAG ACGCTGGAGC TGGTGTTCAC CGCCGGCAGT GCCATGGTTA CTCCGAAACT GAATGGAGTG GCTGGCCCGG CATTCCAGGT TATAAAAGAC AGTCTGACAC TGGGACTGAA TGCGCTGACG CTGACGGATG TTACAAAAAA TGCAGCGTAT GGCGTTGAGA TAGAAAGTCT GATGCTGGAG ATAAATGCAC CGGCAGCATA A
|
Protein sequence | MSIKHYDVVR AASPSDLAEK LTHKLKEGWQ PYGGPVAITP YTLMQAVAIE GEPQVGPSSE PDWYYVIVLA GQSNAMAYGE GLPLPDSYDA PDPRIKQLAR RSTVTPGGAA CRYNDIIPAD HCLHDVQDMS TLNHPRADLS KGQYGCVGQG LHIAKKLLPY IPNNAGILLV PCCRGGSAFT QGAEGTFSES TGASQDSARW GVGKPLYQDL ISRTKAALQK NPKNVLLAVC WMQGEFDMSA ATHAQQPALF TAMLTQFRAD LSVFNAQCHG GSAADVPWIC GDTTYYWKNT YATQYDTVYG GYKNRESEGV YFVPFMTDGN GVNTATNAPA EDPDIPASGY YGAASRTNGN QVSSNRPTHF SSWARRSIIP DRLATAILNA AGRTSAFISG KAPEIKPSPG GNTPSGPSAD TSVRTISLLP AAGEAAAQGW SIKDGGIQLS DGVFKITRQS NKTWSLTHPV DDAITLLTQG GRLTCKFRLS GALTNNQFGL GIYLYTDAPV PDGVAMTGTG NPFLMSYFTQ TTDGRVNLMH HRKAGNTKLG EFGDYGNDWQ TLELVFTAGS AMVTPKLNGV AGPAFQVIKD SLTLGLNALT LTDVTKNAAY GVEIESLMLE INAPAA
|
| |