Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3084 |
Symbol | |
ID | 6066207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3376741 |
End bp | 3379713 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641602500 |
Product | bacteriophage N4 receptor, outer membrane subunit |
Protein accession | YP_001726035 |
Protein GI | 170021081 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGAGA ATAACCTTAA TCGCGTCATC GGATGGTCTG GTTTACTGCT GACGTCTTTA TTGAGTACCA GCGCACTCGC AGACAATATC GGCACCAGCG CAGAAGAGCT GGGGCTGAGC GATTATCGCC ATTTTGTTAT TTATCCCCGT CTCGATAAGG CGCTGAAGGC ACAGAAAAAT AACGACGAAG CAACCGCCAT CCGCGAATTT GAATATATAC ACCAGCAGGT GCCGGATAAT ATTCCGCTGA CTTTATACCT TGCGGAAGCC TATCGCCATT TTGGTCATGA TGACCGGGCG CGGCTGTTGC TGGAGGATCA ACTGAAACGT CACCCAGGAG ATGCCCGACT TGAGCGCAGT CTGGCGGCTA TTCCAGTTGA AGTGAAAAGC GTTACGACTG TTGAAGAACT GCTTGCCCAG CAAAAAGCGT GCGATGCTGC GCCGACCCTG CGTTGTCGCA GTGAAGTCGG GCAGAATGCC CTGCGGCTGG CACAGTTACC TGTCGCCAGA GCGCAACTGA ACGATGCGAC GTTTGCTGCA TCGCCGGAAG GAAAAACGCT GCGAACCGAT CTGCTGCAAC GGGCAATCTA CCTGAAACAA TGGTCCCAGG CAGATACGCT ATACAATGAA GCACGCCAGC AGAACACATT AAGCGCGGCA GAACGCCGTC AGTGGTTTGA CGTGCTTCTT GCCGGGCAGC TGGACGATCG GATCCAGGCA CTGCAATCAC AGGGGATCTT CACCGATCCT CAGTCATATA TTACTTACGC GACCGCGCTG GCTTATCGTG GCGAAAAAGC ACGCCTCCAG CATTATCTCA TTGAAAATAA GCCGCTGTTT ACCACGGACG CACAAGAGAA AAGTTGGCTC TATCTGTTAT CTAAATACAG CGCCAACCCC GTTCAGGCAT TGGCGAATTA TACGGTGCAG TTTGCCGATA ACCGCCAGTA TGTTGTTGGC GCGACGCTAC CGGTGCTGTT AAAAGAAGGT CAGTACGACG CAGCGCAAAA ACTGCTCGCC ACCCTCCCCG CCAATGAAAT GCTTGAGGAG CGTTATGCTG TCAGCGTGGC GACCCGTAAC AAGGCTGAAT CTCTGCGTCT GGCACGATTG CTGTATCAGC AAGAACCGGC AAATCTTACC CGCCTGGATC AACTGACCTG GCAACTGATG CAGAACGAGC AGTCACGCGA AGCTGCCGAT TTATTGCTGC AACGCTATCC TTTCCAGGGC GATGCGCGTG TCAGCCAGAC TTTAATGGCG CGACTGGCGT CTCTGCTGGA AAGTCATCCT TACCTGGCAA CTCCAGCGAA GGTGGCGATT TTATCAAAAC CCTTGCCGCT GGCGGCGCAA CGTCAGTGGC AAAGCCAGTT GCCGGGTATT GCAGATAATT GCCCGGCAAT AGTTCGCTTG CTGGGCGATA TGTCGCCTTC CTACGATGCC GCCGCCTGGA ACCGTCTGGC AAAGTGTTAT CGGGACACGC TACCCGGTGT GGCCTTGTAT GCATGGCTTC AGGCCGAACA ACGACAACCG AACGCCTGGC AACATCGTGC GGTAGCCTAT CAGGCGTATC AGGTTGAGGA CTACGCCACC GCACTGGCGG CCTGGCAGAA AATCAGTCTT CACGACATGA GCAATGAGGA TCTGCTTGCT GCTGCCAATA CCGCCCAGGC GGCAGGAAAT GGTGCAGCTC GCGATCGCTG GCTACAACAG GCAGAACAAC GTGGACTGGG AAACAATGCC CTCTACTGGT GGCTGCATGC GCAACGTTAC ATTCCTGGTC AGCCGGAACT CGCACTGAGC GATCTCACGC GCTCAATCAA TATTGCGCCT TCTGCCAACG CTTACGTTGC GCGGGCGACA ATTTATCGCC AACGTCATAA TGTCCCGGCG GCGGTGAGTG ATTTGCGCGC CGCGCTGGAA CTGGAACCGA ATAATAGCAA CATCCAGGCA GCGCTTGGTT ACGCCTTGTG GGATAGCGGT GATATCGCAC AGTCGCGGGA AATGCTCGAA CAGGCGCATA AAGGGCTACC GGACGATCCG GCACTGATCC GACAACTGGC CTATGTGAAC CAGCGTCTGG ATGACATGCC TGCGACGCAG CACTACGCCC GCCTGGTGAT TGATGACATT GATAATCAGG CGCTGATAAA CCCACTCACC CCAGAGCAAA ATCAGCAACG CTTCAATTTC CGCCGTCTGC ATGAGGAGGT CGGTCGCCGC TGGACGTTCA GTTTCGATTC TTCTATCGGC TTGCGTTCCG GGGCAATGAG CACCGCTAAC AATAATGTCG GCGGCGCAGC GCCAGGGAAA AGCTATCGTA GCTACGGGCA ACTGGAAGCC GAGTACCGCA TCGGACGCAA TATGCTGCTG GAAGGCGACC TGCTCTCAGT TTATAGCCGC GTCTTTGCCG ATACCGGAGA AAACGGGGTG ATGATGCCGG TGAAAAATCC GATGTCCGGC ACCGGTCTAC GCTGGAAGCC GCTGCGCGAT CAGATCTTTT TCCTCGCCGT CGAACAGCAG TTGCCGCTGA ACGGGCAAAA TGGCGCATCC GATACCATGC TGCGCGCCAG CGCCTCATTC TTTAATGGCG GCAAATACAG CGACGAATGG CACCCGAACG GTTCAGGCTG GTTTGCCCAA AACCTGTACC TCGATGCGGC GCAATATATC CGCCAGGATA TTCAGGCGTG GACGGCAGAT TATCGCGTCA GCTGGCATCA GAAGGTAGCT AACGGACAGA CTATTGAGCC TTACGCTCAC GTTCAGGACA ACGGCTATCG TGATAAAGGC ACTCAGGGCG CGCAGCTTGG CGGTGTCGGG GTCCGCTGGA ATATCTGGAC CGGCGAGACG CACTACGACG CCTGGCCGCA CAAAGTCAGT CTCGGCGTCG AGTATCAACA TACCTTTAAG GCGATTAATC AACGTAACGG AGAGCGCAAC AACGCGTTTC TCACCATTGG AGTGCACTGG TAA
|
Protein sequence | MKENNLNRVI GWSGLLLTSL LSTSALADNI GTSAEELGLS DYRHFVIYPR LDKALKAQKN NDEATAIREF EYIHQQVPDN IPLTLYLAEA YRHFGHDDRA RLLLEDQLKR HPGDARLERS LAAIPVEVKS VTTVEELLAQ QKACDAAPTL RCRSEVGQNA LRLAQLPVAR AQLNDATFAA SPEGKTLRTD LLQRAIYLKQ WSQADTLYNE ARQQNTLSAA ERRQWFDVLL AGQLDDRIQA LQSQGIFTDP QSYITYATAL AYRGEKARLQ HYLIENKPLF TTDAQEKSWL YLLSKYSANP VQALANYTVQ FADNRQYVVG ATLPVLLKEG QYDAAQKLLA TLPANEMLEE RYAVSVATRN KAESLRLARL LYQQEPANLT RLDQLTWQLM QNEQSREAAD LLLQRYPFQG DARVSQTLMA RLASLLESHP YLATPAKVAI LSKPLPLAAQ RQWQSQLPGI ADNCPAIVRL LGDMSPSYDA AAWNRLAKCY RDTLPGVALY AWLQAEQRQP NAWQHRAVAY QAYQVEDYAT ALAAWQKISL HDMSNEDLLA AANTAQAAGN GAARDRWLQQ AEQRGLGNNA LYWWLHAQRY IPGQPELALS DLTRSINIAP SANAYVARAT IYRQRHNVPA AVSDLRAALE LEPNNSNIQA ALGYALWDSG DIAQSREMLE QAHKGLPDDP ALIRQLAYVN QRLDDMPATQ HYARLVIDDI DNQALINPLT PEQNQQRFNF RRLHEEVGRR WTFSFDSSIG LRSGAMSTAN NNVGGAAPGK SYRSYGQLEA EYRIGRNMLL EGDLLSVYSR VFADTGENGV MMPVKNPMSG TGLRWKPLRD QIFFLAVEQQ LPLNGQNGAS DTMLRASASF FNGGKYSDEW HPNGSGWFAQ NLYLDAAQYI RQDIQAWTAD YRVSWHQKVA NGQTIEPYAH VQDNGYRDKG TQGAQLGGVG VRWNIWTGET HYDAWPHKVS LGVEYQHTFK AINQRNGERN NAFLTIGVHW
|
| |