Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3155 |
Symbol | |
ID | 6969352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2921597 |
End bp | 2922646 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386978 |
Product | hypothetical protein |
Protein accession | YP_002271445 |
Protein GI | 209397571 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000300414 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0000000000395416 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGGGTAT TACTTCGACC TGTTCTGGTA CCGGAACTCG GTCTGGTTAT CGTTAAGCCA GGCCGTGAAT CAATGTCAGC ATTCCATAAC GGCAGAATAC TGGTGGAGCC GGAACCAAAA AGCATGCGAG CTCTGCCGTC CGGGGTTGTA CCTGCCGTTC ACCAGCCGCT GGCGGAAGAT AAATCACTAC TGCCATTTTT CAGCGATGAG CGGGTGATCC GTGCTGCGGG TGGCGCTGGT GCACTGTCTG ACTGGTTATT ACGTCACGTG AAATCCTGCC AGTGGCTACA CGGTGATTAT CATCACAGCG AAACCGTCAT TCACCGTTAC GGTACCGGCG CGATGGTGTT GTGCTGGCAC TGCGACAACC AGCTGCGGGA GCAGACATCT GATTCACTGG ATCAACTTGC TCAACAGAAT CTGGCCGCCT GGATGATTGA CATCATCCGT CACGCAATGA ATGGCGCACA GGAGCGTGAA TTATCTCTGG CTGAATTATC CTGGTGGGCG GCCTGCAATC AGGTGGTGGA TGCACTACCT GAGGCAGTAG CGCGTCGTTC TCTGGGATTA CCGGCGGAAA AAATCCGCTC CGTATACCGT GAAAGCGACA TCATACCGGG AGAACAGACC GCCACCAGCA TACTGAAGCA GCGCACAAAA AATATTGCGC TACCGCCTCA CACCCACCAG CAACAGAACC CACCACAGGA AAAGACGGTG GTCAGCATTG CCGTTGATCC GGAGTCTCCG GAATCCTTCA TGAAACGACC TAAACGTCGC CGCTGGGTAA ATGAGAAATA CACACGCTGG GTAAAGACAC AGCCGTGTGC GTGTTGTGGT AAGCCAGCCG ACGATCCGCA TCACCTGATT GGTCATGGTC AGGGCGGAAT GGGGACAAAA TCTCACGATA TTTTCACGCT ACCGCTGTGT CGGGAGCATC ACAACGAGCT TCATGCGGAT CCGCTGGCGT TCGAAGAAAA GCATGGTTCT CAGGTTGATT TAATTTTTCG TTTTCTTGAT CACGCCTTTG CAACCGGCGT GCTCGGGTAA
|
Protein sequence | MRVLLRPVLV PELGLVIVKP GRESMSAFHN GRILVEPEPK SMRALPSGVV PAVHQPLAED KSLLPFFSDE RVIRAAGGAG ALSDWLLRHV KSCQWLHGDY HHSETVIHRY GTGAMVLCWH CDNQLREQTS DSLDQLAQQN LAAWMIDIIR HAMNGAQERE LSLAELSWWA ACNQVVDALP EAVARRSLGL PAEKIRSVYR ESDIIPGEQT ATSILKQRTK NIALPPHTHQ QQNPPQEKTV VSIAVDPESP ESFMKRPKRR RWVNEKYTRW VKTQPCACCG KPADDPHHLI GHGQGGMGTK SHDIFTLPLC REHHNELHAD PLAFEEKHGS QVDLIFRFLD HAFATGVLG
|
| |