Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0585 |
Symbol | |
ID | 6144465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 590591 |
End bp | 593563 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615477 |
Product | bacteriophage N4 receptor, outer membrane subunit |
Protein accession | YP_001742683 |
Protein GI | 170679633 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGAGA ATAACCTTAA TCGCGTCATC GGATGGTCTG GTTTACTGCT GACGTCTTTA TTGAGTACCA GCGCACTCGC AGACAATATC GGCACCAGCG CAGAAGAGCT GGGGCTGAGC GATTATCGCC ATTTTGTTAT TTATCCCCGT CTCGACAAGG CGCTGAAGGC ACAGAAAAAT AACGACGAAG CAACCGCCAT CCGCGAATTT GAATATATAC ACCAACAGGT GCCGGATAAT ATTCCACTAA CTTTATACCT TGCGGAAGCC TATCGCCATT TTGGTCATGA TGACCGGGCG CGGCTGTTGC TTGAGGATCA ACTGAAACGT CACCCAGGAG ATGCCCGACT TGAGCGCAGT CTGGCGGCTA TTCCAGTTGA AGTGAAAAGC GTTACGACTG TTGAAGAACT GCTTGCCCAG CAAAAAGCGT GCGACGCTGC GCCGACCCTG CGTTGTCGCA GTGAAGTTGG GCAGAATGCC CTGCGGCTGG CACAATTACC TGTCGCCAGA GCGCAACTGA ACGATGCGAC GTTTGCTGCA TCGCCGGAAG GAAAAACGCT GCGAACCGAT CTGCTGCAAC GCGCAATCTA CCTGAAACAA TGGTCCCAGG CAGATACGCT ATACAATGAA GCACGCCAGC AGAACACATT AAGCGCGGCA GAACGCCGCC AGTGGTTTGA TGTGCTTCTT GCCGGGCAGC TGGACGATCG GATCCTGGCG CTGCAATCAC AGGGGATCTT CACCGATCCG CAGTCATATA TTACTTACGC GACCGCGCTG GCTTATCGTG ACGAAAAAGC ACGCCTCCAG CGTTATCTCA TTGAAAATAA GCCGCTGTTT ACCACGGACG CACAAGAGAA AAGTTGGCTC TATCTGTTAT CTAAATACAG CGCCAACCCC GTTCCGGCGT TGGCGAATTA TACGGTACAG TTTGCCGATA ACCGCCAGTA TGTTGTTGGC GCGACGCTAC CGGTGCTGTT AAAAGAAGGT CAGTACGACG CAGCGCAAAA ACTGCTCGCC ACCCTCCCCG CCAATGAAAT GCTTGAGGAG CGTTATACCG TTAGCGTAGC GACCCATAAC AAGGCTGAAG CTCTGCGTCT GGCACGATTG CTGTATCAGC AAGAACCGGC AAATCTTACC CGCCTGGATC AACTGACCTG GCAACTGATG CAGAACGAGC AGTCACGCGA AGCTGCCGAT TTATTGCTGC AACGCTATCC TTTCCAGGGC GATGCGCGTG TCAGCCAGAC GTTAATGGCG CGACTGGCGT CTCTGCTGGA AAGTCATCCT TACCTGGCAA CGCCGGCGAA GGTGGCGATT TTATCGAAAC CCTTACCGCT GGCGGAGCAA CGTCAGTGGC AAAGTCAGTT GCCGGGTATT GCAGATAATT GCCCGGCTAT AGTTCGCTTG CTGGGCGATA TGTCGCCTTC CTACGATGCC GCCGCCTGGA ACCGTCTGGC AAAGTGTTAT CGGGACACGC TACCCGGTGT GGCGTTGTAT GCATGGCTTC AGGCCGAACA ACGACAACCG AACGCCTGGC AACATCGTGC GGTAGCCTAT CAGGCGTATC AGGTTGAGGA CTACGCCACC GCACTGGCGG CCTGGCAGAA AATCAGTCTT CACGACATGA GCAATGAGGA TCTGCTTGCT GCTGCCAATA CCGCCCAGGC GGCAGGAAAT GGCGCAACTC GCGATCGCTG GCTTCAGCAG GCAGAACAAC GTGGGCTGGG AAACAATGCC CTCTACTGGT GGCTGCATGC GCAACGTTAC ATTCCTGGTC AGCCGGAACT CGCACTGAAC GATCTCACGC GCTCAATCAA TATTGCGCCT TCTGCCAACG CTTACGTTGC GCGGGCGACA ATTTATCGCC AACGTCATAA TGTCCCGGCG GCGGTGAGTG ATTTGCGCGC CGCGCTGGAA CTGGAACCGA ATAATAGCAA CACCCAGGCA GCGCTCGGTT ACGCCTTGTG GGATAGCGGT GATATCGCAC AGTCGCGGGA AATGCTCGAA CAGGCGCATA AAGGGCTACC GGACGATCCG GCACTTATCC GACAACTGGC CTACGTGAAC CAGCGTCTGG ATGACATGCC TGCGACGCAG CACTACGCCC GGCTGGTGAT TGATGACATT GATAATCAGG CGCTGATAAC CCCACTGACC CCAGAGCAAA ATCAGCAACG CTTCAATTTC CGCCGTCTGC ATGAGGAGGT CGGTCGCCGC TGGACGTTCA GTTTCGATTC TTCTATCGGC TTGCGTTCCG GCGCAATGAG TACCGCTAAC AATAATGTCG GCGGCGCAGC GCCAGGGAAA AGCTATCGTA GCTACGGGCA ACTGGAAGCC GAGTATCGCA TCGGACGCAA TATGCTGCTG GAAGGCGACC TGCTCTCGGT TTACAGCCGT GTATTTGCCG ATACCGGAGA AAACGGGGTG ATGATGCCGG TGAAAAATCC GATGTCCGGC ACCGGTCTGC GCTGGAAGCC GCTGCGCGAT CAGATCTTTT TCCTCGCCGT CGAACAGCAG TTGCCGCTGA ACGGCCAAAA TGGTGCATCC GATACCATGC TGCGCGCCAG CGCCTCATTC TTTAATGGCG GCAAATACAG CGACGAATGG CACCCGAACG GTTCAGGCTG GTTTGCCCAA AACCTGTACC TCGATGCAGC GCAATATATC CGCCAGGATA TTCAGGCGTG GACGGCAGAT TATCGCGTCA GCTGGCATCA GAAGGTAGCT AACGGACAGA CTATTGAGCC TTACGCTCAC GTTCAGGACA ACGGCTATCG TGATAAAGGC ACTCAGGGCG CGCAGCTTGG CGGTGTCGGG GTCCGCTGGA ATATCTGGAC CGGCGAGACG CACTACGACG CCTGGCCGCA CAAAGTCAGT CTCGGCGTCG AGTATCAACA TACCTTTAAG GCGATTAATC AACGTAACGG AGAGCGCAAC AATGCGTTTC TCACCATTGG AGTGCACTGG TAA
|
Protein sequence | MKENNLNRVI GWSGLLLTSL LSTSALADNI GTSAEELGLS DYRHFVIYPR LDKALKAQKN NDEATAIREF EYIHQQVPDN IPLTLYLAEA YRHFGHDDRA RLLLEDQLKR HPGDARLERS LAAIPVEVKS VTTVEELLAQ QKACDAAPTL RCRSEVGQNA LRLAQLPVAR AQLNDATFAA SPEGKTLRTD LLQRAIYLKQ WSQADTLYNE ARQQNTLSAA ERRQWFDVLL AGQLDDRILA LQSQGIFTDP QSYITYATAL AYRDEKARLQ RYLIENKPLF TTDAQEKSWL YLLSKYSANP VPALANYTVQ FADNRQYVVG ATLPVLLKEG QYDAAQKLLA TLPANEMLEE RYTVSVATHN KAEALRLARL LYQQEPANLT RLDQLTWQLM QNEQSREAAD LLLQRYPFQG DARVSQTLMA RLASLLESHP YLATPAKVAI LSKPLPLAEQ RQWQSQLPGI ADNCPAIVRL LGDMSPSYDA AAWNRLAKCY RDTLPGVALY AWLQAEQRQP NAWQHRAVAY QAYQVEDYAT ALAAWQKISL HDMSNEDLLA AANTAQAAGN GATRDRWLQQ AEQRGLGNNA LYWWLHAQRY IPGQPELALN DLTRSINIAP SANAYVARAT IYRQRHNVPA AVSDLRAALE LEPNNSNTQA ALGYALWDSG DIAQSREMLE QAHKGLPDDP ALIRQLAYVN QRLDDMPATQ HYARLVIDDI DNQALITPLT PEQNQQRFNF RRLHEEVGRR WTFSFDSSIG LRSGAMSTAN NNVGGAAPGK SYRSYGQLEA EYRIGRNMLL EGDLLSVYSR VFADTGENGV MMPVKNPMSG TGLRWKPLRD QIFFLAVEQQ LPLNGQNGAS DTMLRASASF FNGGKYSDEW HPNGSGWFAQ NLYLDAAQYI RQDIQAWTAD YRVSWHQKVA NGQTIEPYAH VQDNGYRDKG TQGAQLGGVG VRWNIWTGET HYDAWPHKVS LGVEYQHTFK AINQRNGERN NAFLTIGVHW
|
| |