Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3203 |
Symbol | |
ID | 6972182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2954131 |
End bp | 2955633 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387022 |
Product | phage portal protein, lambda family |
Protein accession | YP_002271489 |
Protein GI | 209398345 |
COG category | [R] General function prediction only |
COG ID | [COG5511] Bacteriophage capsid protein |
TIGRFAM ID | [TIGR01539] phage portal protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.00254824 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAATTA TTGATGATGT GATCGGCGTG TTTTCCCCCG GGTGGAAAGC AGCCAGACTG CGTTCAAGGG CGTTAATCAT GGCCTATGAG GCGGTGAAAC CGACCCGGAC ACATAAAGCC CGGCGGGAAA ATCGCTCTGC TGATCAGCTC AGTAAATACG GTGCGGTTTC CCTGCGGGAG CAGGCCCGTT TTCTGGATAT CAATCATGAC CTGGTGATTG GTGTGTTTGA CAAGCTGGAA GAGCGGGTGA TTGGTGCCAG GGGAATTATT GTGGAGCCTC AGCCATTACG AAAAAACGGG GAAATGGCGG CTGAGCTGGC TGCGGATATC CGCCGTTTGT GGGCTGAATG GTCCGTGAGT CCGGATGTGA CAGGGCAGTA TACCCGTCCT GTGCTTGAAC GTTTACTGCT GCGGACCTGG CTGCGGGATG GTGAAGTGTT TGCGCAGATG GTCAGTGGTG CGGGAAACGG TCTGGAACGG ACGGCGGGAG TGCCATTCTG GCTTGAGGCG ATGGAGCCGG ATTTTGTTCC CATGCGCACT GATGAATCCG CCGGACTGAA TCAGGGGGTT TTTCTTGATG AGTGGGGAAG ACCGAAAAAA TATCTGGTTT ATAAAAATTA TCCGGTCAGA GGCCGGCAGA GTGATACGAA AGAAATCGCT GCCGGAAAAA TGATCCACCT GAAGTTCACT CGTCGTCTGC ATCAGACGCG AGGCTCATCC ATGTTATCGG GGGTGCTGAT GCGGATCAGT GCCCTTAAGG AGTATGAGGA TGCGGAACTG ACAGCGGCGC GTATTGCTGC GGCGCTGGGA CTGTATATCC GTAAAGGTGA CGGACAGGAC TATGAAGATC CGGGGAGCAA AGAGACCGAG CGGGAAGTCC ATATCACCCC GGGTATTATT TATGACGATT TGCGCAAGGG CGAGGATATC GGCATGGTCA AATCTGACCG TCCCAATCCC AACCTTGAAA CTTTCCGCAA CGGCCAGTTG CGTGCAGTGG CAGCAGGCAG TCGTCTGAGT TTTTCCAGTG CGGCGCGTAA CTATAACGGC ACCTACAGCG CCCAGCGGCA GGAGTTGGTC GAGTCCACGG ATGGTTACCT GATCCTGCAG GACTGTTTTA TTGGCGCGGT AACCCGCCCG GTGTACCGGA CATGGCTGAA TATGGTGGTT GCGGCAGGTC TGCTGAAAAT TCCGGCGGAT GTGGAGATGA AAACGCTATA TAACGCGACG TATTCCGGTC CGGTGATGCC GTGGATCGAC CCGGTTAAGG AAGCTGAAGC CTGGAGAATT CAGATCCGGG GTGGTGCAGC GACAGAATCT GACTGGGTGC GTGCTGGTGG GCGCAATCCG GATGAGGTCA AACGTCGCCG CAAGGCTGAA ATTGATGAAA ACAGCAGACT GGGGCTGGTC TTTGATACTG ACCCCGTCAA CGACAAAGGA GGCAACAGTG CCGGAACTGA ACGACAGTAT CAGCGCGACA CCGAAAGCCA GCATGAAGAA TAA
|
Protein sequence | MAIIDDVIGV FSPGWKAARL RSRALIMAYE AVKPTRTHKA RRENRSADQL SKYGAVSLRE QARFLDINHD LVIGVFDKLE ERVIGARGII VEPQPLRKNG EMAAELAADI RRLWAEWSVS PDVTGQYTRP VLERLLLRTW LRDGEVFAQM VSGAGNGLER TAGVPFWLEA MEPDFVPMRT DESAGLNQGV FLDEWGRPKK YLVYKNYPVR GRQSDTKEIA AGKMIHLKFT RRLHQTRGSS MLSGVLMRIS ALKEYEDAEL TAARIAAALG LYIRKGDGQD YEDPGSKETE REVHITPGII YDDLRKGEDI GMVKSDRPNP NLETFRNGQL RAVAAGSRLS FSSAARNYNG TYSAQRQELV ESTDGYLILQ DCFIGAVTRP VYRTWLNMVV AAGLLKIPAD VEMKTLYNAT YSGPVMPWID PVKEAEAWRI QIRGGAATES DWVRAGGRNP DEVKRRRKAE IDENSRLGLV FDTDPVNDKG GNSAGTERQY QRDTESQHEE
|
| |