Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0641 |
Symbol | |
ID | 6970240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 662415 |
End bp | 665387 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643384679 |
Product | bacteriophage N4 receptor, outer membrane subunit |
Protein accession | YP_002269192 |
Protein GI | 209397557 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.369188 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGAGA ATAACCTTAA TCGCGTCATC GGATGGTCTG GTTTACTGCT GACGTCTTTA TTGAGTACCA GCGCACTCGC AGACAATATC GGCACCAGCG CAGAAGAGCT GGGGCTGAGC GATTATCGCC ATTTTGTTAT TTATCCCCGT CTCGATAAGG CGCTGAAGGC ACAGAAAAAT AACGACGAAG CAACCGCCAT CCGCGAATTT GAATATATAC ACCAGCAGGT GCCGGATAAT ATTCCGCTGA CTTTATACCT TGCGGAAGCC TATCGCCATT TTGGTCATGA TGACCGGGCA CGGCTGTTGC TTGAGGATCA ACTGAAACGT CACCCAGGAG ATGCCCGACT TGAGCGCAGT CTGGCGGCTA TTCCGGTTGA AGTGAAAAGC ATTACAACTG TTGAAGAACT GCTTGCCCAA CAAAAAGCGT GCGATGCTGC GCCGACCCTG CGTTGTCGCA GTGAAGTCGG GCAGAATGCC CTGCGGCTGG CACAGTTACC TGTCGCCAGA GCGCAACTGA ACGACGCGAC GTTTGCTGCA TCGCCGGAAG GAAAAACGCT GCGAACCGAT CTGCTGCAAC GGGCAATCTA CCTGAAACAA TGGTCCCAGG CAGATACGCT ATACAATGAA GCACGCCAGC AGAACACATT AAGCGCGGCA GAACGCCGTC AGTGGTTTGA CGTGCTTCTT GCCGGGCAGC TGGACGATCG GATCCTGGCA CTGCAATCAC AGGGGATCTT CACCGATCCT CAGTCATATA TTACTTACGC GACCGCGCTG GCTTATCGTG GCGAAAAAGC ACGCCTCCAG CATTATCTCA TTGAAAATAA GCCGCTGTTT ACCACGGACG CACAAGAGAA AAGTTGGCTC TATCTGTTAT CTAAATACAG CGCCAACCCC GTTCAGGCAT TGGCGAATTA TACGGTGCAG TTTGCCGATA ACCGCCAGTA TGTTGTTGGC GCGACGCTAC CGGTGCTGTT AAAAGAAGGT CAGTACGACG CAGCGCAAAA ACTGCTCGCC ACCCTCCCCG CCAATGAAAT GCTTGAGGAG CGTTATGCTG TCAGCGTGGC GACCCGTAAC AAGGCTGAAG CTCTGCGTCT GGCACGCTTG CTGTATCAGC AAGAATCGGC AAATCTTACC CGCCTGGATC AACTAACCTG GCAACTGATG CAGAACGAGC AGTCACGGGA AGCTGCCGAT TTATTGCTGC AACGCTATCC TTTCCAGGGC GATGCGCGTA TCAGCCAGAC TTTAATGGCG CGACTGGCGT CTCTGCTGGA AAGTCATCCT TACCTGGCAA CGCCGGCGAA GGTGGCGATT TTATCGAAAC CCTTACCGCT GGCGGAGCAA CGTCAGTGGC AAAGTCAGTT GCCGGGTATT GCAGATAATT GCCCGGCAAT AGTTCGCTTG CTGGGCGATA TGTCGCCTTC CTACGATGCC GCCGCCTGGA ACCGTCTGGC AAAGTGTTAT CGGGACACGC TACCCGGTGT GGCGTTGTAT GCATGGCTTC AGGCCGAACA ACGACAACCG AACGCCTGGC AACATCGTGC GGTAGCCTAT CAGGCGTATC AGGTTGAGGA CTACGCCACC GCACTGGCGG CCAGGCAGAA AATCAGTCTT CACGACATGA GCAATGAGGA TCTGCTTGCT GCTGCCAATA CCGCCCAGGC GGCAGGAAAT GGTGCGGCTC GCGATCGCTG GCTACAACAG GCAGAACAAC GTGGACTGGG AAGCAATGCC CTCTACTGGT GGCTGCATGC GCAACGTTAC ATTCCTGGTC AGCCGGAACT CGCACTGAAC GATCTCACGC GCTCAATCAA TATTGCGCCT TCTGCCAACG CTTACGTTGC GCGGGCGACA ATTTATCGCC AACGTCATAA TGTCCCGGCC GCGGTGAGTG ATTTGCGCGC CGCGCTGGAA CTGGAACCGA ATAATAGCAA CACCCAGGCA GCGCTTGGTT ACGCCTTGTG GGATAGCGGT GATATCGCAC AGTCGCGGGA AATGCTCGAA CAGGCGCATA AAGGGCTACC GGACGATCCG GCACTGATCC GACAACTGGC CTATGTGAAC CAGCGTCTGG ATGACATGCC TGCGACGCAG CACTACGCCC GGCTGGTGAT TGATGACATT GATAATCAGG CGCTGATAAC CCCACTGACC CCAGAGCAAA ATCAGCAACG CTTCAATTTC CGCCGTCTGC ATGAGGAGGT CGGTCGCCGC TGGACGTTCA GTTTCGATTC TTCCATCGGC TTGCGTTCCG GGGCAATGAG TACCGCTAAC AATAATGTTG GCGGCGCAGC GCCAGGGAAA AGCTATCGTA GCTACGGGCA ACTGGAAGCC GAATACCGCA TCGGACGCAA TATGCTGCTG GAAGGCGACC TGCTCTCAGT TTATAGCCGC TTCTTTGCCG ATACCGGAGA AAACGGGGTG ATGATGCCGG TGAAAAATCC GATGTCCGGC ACCGGTCTGC GCTGGAAGCC GCTGCGCGAT CAGATCTTTT TCCTCGCCGT CGAACAGCAG TTGCCGCTGA ACGGGCAAAA TGGCGCATCC GATACCATGC TGCGCGCCAG CGCCTCATTC TTTAATGGCG GCAAATACAG CGACGAATGG CACCCGAACG GTTCAGGCTG GTTTGCCCAA AACCTGTACC TCGATGCGGC GCAATATATC CGCCAGGATA TTCAGGCGTG GACGGCAGAT TATCGCGTCA GCTGGCATCA GAAGGTAGCT AACGGACAGA CTATTGAGCC TTACGCTCAC GTTCAGGACA ACGGCTATCG TGATAAAGGC ACTCAGGGCG CGCAGCTTGG CGGTGTCGGG GTCCGCTGGA ATATCTGGAC CGGCGAGACG CACTACGACG CCTGGCCGCA CAAAGTCAGT CTCGGCGTCG AGTATCAACA TACCTTTAAG GCGATTAATC AACGTAACGG AGAGCGCAAC AACGCGTTTC TCACCATTGG AGTGCACTGG TAA
|
Protein sequence | MKENNLNRVI GWSGLLLTSL LSTSALADNI GTSAEELGLS DYRHFVIYPR LDKALKAQKN NDEATAIREF EYIHQQVPDN IPLTLYLAEA YRHFGHDDRA RLLLEDQLKR HPGDARLERS LAAIPVEVKS ITTVEELLAQ QKACDAAPTL RCRSEVGQNA LRLAQLPVAR AQLNDATFAA SPEGKTLRTD LLQRAIYLKQ WSQADTLYNE ARQQNTLSAA ERRQWFDVLL AGQLDDRILA LQSQGIFTDP QSYITYATAL AYRGEKARLQ HYLIENKPLF TTDAQEKSWL YLLSKYSANP VQALANYTVQ FADNRQYVVG ATLPVLLKEG QYDAAQKLLA TLPANEMLEE RYAVSVATRN KAEALRLARL LYQQESANLT RLDQLTWQLM QNEQSREAAD LLLQRYPFQG DARISQTLMA RLASLLESHP YLATPAKVAI LSKPLPLAEQ RQWQSQLPGI ADNCPAIVRL LGDMSPSYDA AAWNRLAKCY RDTLPGVALY AWLQAEQRQP NAWQHRAVAY QAYQVEDYAT ALAARQKISL HDMSNEDLLA AANTAQAAGN GAARDRWLQQ AEQRGLGSNA LYWWLHAQRY IPGQPELALN DLTRSINIAP SANAYVARAT IYRQRHNVPA AVSDLRAALE LEPNNSNTQA ALGYALWDSG DIAQSREMLE QAHKGLPDDP ALIRQLAYVN QRLDDMPATQ HYARLVIDDI DNQALITPLT PEQNQQRFNF RRLHEEVGRR WTFSFDSSIG LRSGAMSTAN NNVGGAAPGK SYRSYGQLEA EYRIGRNMLL EGDLLSVYSR FFADTGENGV MMPVKNPMSG TGLRWKPLRD QIFFLAVEQQ LPLNGQNGAS DTMLRASASF FNGGKYSDEW HPNGSGWFAQ NLYLDAAQYI RQDIQAWTAD YRVSWHQKVA NGQTIEPYAH VQDNGYRDKG TQGAQLGGVG VRWNIWTGET HYDAWPHKVS LGVEYQHTFK AINQRNGERN NAFLTIGVHW
|
| |