Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2111 |
Symbol | |
ID | 6067136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2305959 |
End bp | 2308982 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641601519 |
Product | prophage tail fibre domain-containing protein |
Protein accession | YP_001725078 |
Protein GI | 170020124 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0810598 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTAA AGATTTCAGG TGTACTGAAA GACGGCACAG GAAAACCGGT ACAGAACTGC ACAATCCAGC TGAAAGCAAA ACGTAACAGC ACCACGGTGG TGGTGAACAC GCTGGCCTCA GAAAATCCGG ATGAAGCCGG GCGTTACAGT ATGGACGTTG AGTACGGTCA GTACAGCGTT ATTCTGTTGG TGGAAGGATT CCCGCCGTCA CATGCCGGGA CCATCACCGT GTATGAAGAT TCCCGACCCG GTACGCTGAA TGATTTTCTC GGTGCCATGA CGGAGGATGA TGCCCGTCCG GAGGCACTGC GCCGTTTTGA GCTGATGGTG GAAGAGGTGG CGCGTAACGC GTCCGCGGTG GCACAGAACA CGGCAGCCGC GAAGAAGTCA GCCGGCGATG CCGGCACATC AGCCCGTGAG GCGGCAACCC ATGCGACTGA TGCTGCAGGC TCAGCACGTG CAGCCAGCAC ATCAGCCGGG CAGGCCGCGA CGTCGGCTCA GTCAGCGTCT TCCAGCGCAG GAACGGCATC AACAAAGGCT ACTGAAGCAT CAAAAAGTGC TGCCGCTGCA GAGTCCTCAA AAAGCGCGGC AGCTACCAGT GCCGGTGCGG CGAAAACGTC AGAAACGAAT GCGGCAGCGT CACAAAAATC TGCAGCCACT TCTGCATCCG CAGCGACCAC AAAGGCGTCA GAAGCTGCCA CCTCAGCCCG GGATGCGGCG GCCTCAAAAG AGGCAGCGAA ATCATCAGAA ACGAACGCAT CATCAAGCGC CAGTAGTGCC GCTTCCTCGG CAACGGCGGC AGGAAATTCC GCGAAGGCGG CAAAGACGTC CGAGACGAAC GCCAGGTCTT CTGAAACGGC AGCGGGACAG AGTGCCTCAG CTGCGGCAGG CTCAAAAACA GCGGCAGCAT CATCTGCCAG TGCCGCGTCA ACAAGTGCCG GGCAGGCCTC AGCCAGTGCC ACCGCCGCCG GAAAATCGGC AGAAAGCGCC GCATCGTCTG CTTCAACAGC CACAACGAAG GCTGGCGAAG CCACTGAACA GGCCAGCGCA GCAGCGAGGT CTGCTTCCGC AGCGAAGACA TCCGAAACGA ACGCGAAAGC GTCGGAAACC AGCGCAGAAT CCTCAAAAAC GGCTGCCGCA TCGTCAGCCA GTTCGGCGTC GTCATCGGCA TCATCTGCGT CTGCTTCAAA AGATGAGGCG ACCAGACAGG CGTCAGCAGC GAAGGGTAGC GCCACGACGG CATCCACGAA GGCGACAGAG GCAGCTGGCA GTGCGACGGC AGCAGCTCAG AGCAAAAGTA CGGCGGAATC TGCAGCAACG CGCGCTGAGA CAGCGGCAAA ACGGGCAGAG GATATTGCAT CCGCCGTGGC GCTTGAGGAT GCGAGCACGA CGAAAAAGGG GATAGTACAG CTCAGCAGTG CGACCAACAG CACTTCCGAG TCACTGGCGG CAACGCCAAA AGCCGTTAAG GCCGCGTATG ACCTGGCTAA CGGGAAATAC ACCGCACAGG ATGCAACGAC AGCACAGAAA GGGATAATCC AGCTAAGCAG CGCGACCAAC AGCACGTCTG AAACGCTGGC GGCAACGCCA AAGGCAGTAA AAGCAGCCAA TGACAATGCT GAGAAACGTC TGCAGAAAGA TCAGAACGGT GCGGATATCC CTGGCAAAGA CACCTTTACG AAAAATATTG GTGCCTGCCG TGCCTTCGGT GGGTCAGTAA GCACAACAAC AGGAAACTGG ACGACTGCAC AGTTTATCGA GTGGCTGGAT TCTCAGGGAG CATTTAACCA TCCATACTGG ATGTGCAAGG GTTCCTGGTC TTATGGCAAT AATAAAATCA TTACTGATAC TGGCTGCGGT AATATTCATC TCGCCGGAGC TGTCATTGAA GTAATGGGGA TAAAGTCAGC GATGACGATC CGCATTACCA CACCGACCAC CTCCACTGGT GGTGGAACAA CTAACGCCCA GTTTACCTAT ATTAATCACG GAACAGATTA TTCACCTGGC TGGCGAAGGG ACTATAACTC CAGAAATAAG CCAACGGCAT CAGAGATCGG GGCGTTACCG TCAGGTGGAA CAGCAGTATC ATCAGTTAAT CTGTCTTCAA AAGGTCGGGT AACCGCGCTG ACAGACAATA CGCAGGGGGC AACAGGTCTT GAGTTATACG AGGTGTATAA CAACGGATAT CCAACAGCGT ATGGAAATAT CATTCACCTG AAAGGGATGA CAGCCGTTGG CGAAGGTGAG TTACTCATCG GCTGGAGTGG TACAAGCGGT GCTCATGCTC CGGCATTTAT TCGTTCACGA CGGGATACGA CCGACGCAAA CTGGTCGCCG TGGGCGCAGC TTTACACCTC GGCTCATCCT CCTGCAGAGT TTTATCCAGT CGGTGCACCA ATCCCGTGGC CATCAGATAC CGTTCCGTCT GGTTATGCCC TGATGCAGGG GCAGACTTTT GACAAATCTG CTTACCCGAA ACTTGCAGCC GCTTATCCGT CAGGCGTGAT CCCTGATATG CGTGGCTGGA CGATTAAGGG CAAGCCCGCC AGTGATCGAG CCGTATTGTC TCAGGAACAG GACGGCATTA AATCACACAC CCACAGCGCC AGCGTATCCA GTACGGATTT GGGGACGAAA ACCACATCGT CGTTTGATTA CGGCACTAAA TCCACGAATA ACACTGGTGC GCATACCCAT AGTGTTAGCG GTACGGCTGC TTCAGCCGGT GCACATACCC ATTCGATGAC ATTTGTTTCA GGTGGTTCCA GTGGTGCTCC GGGAAGTGGA TCACCTGATT ATTCTAAATA CAGTGTTAAC ACTTCTTCTG CAGGCGCTCA TACGCACTCT GTATCGGGTA CTGCTGCAAG CGCAGGTGCA CACGCACATA CTGTCGGTAT TGGTGCTCAT ACGCACTCCG TTGCGATTGG TTCACATGGA CATACCATCA CCGTTAACGC TGCTGGTAAC GCGGAAAACA CCGTCAAAAA CATCGCATTT AACTATATTG TGAGGCTTGC ATAA
|
Protein sequence | MAVKISGVLK DGTGKPVQNC TIQLKAKRNS TTVVVNTLAS ENPDEAGRYS MDVEYGQYSV ILLVEGFPPS HAGTITVYED SRPGTLNDFL GAMTEDDARP EALRRFELMV EEVARNASAV AQNTAAAKKS AGDAGTSARE AATHATDAAG SARAASTSAG QAATSAQSAS SSAGTASTKA TEASKSAAAA ESSKSAAATS AGAAKTSETN AAASQKSAAT SASAATTKAS EAATSARDAA ASKEAAKSSE TNASSSASSA ASSATAAGNS AKAAKTSETN ARSSETAAGQ SASAAAGSKT AAASSASAAS TSAGQASASA TAAGKSAESA ASSASTATTK AGEATEQASA AARSASAAKT SETNAKASET SAESSKTAAA SSASSASSSA SSASASKDEA TRQASAAKGS ATTASTKATE AAGSATAAAQ SKSTAESAAT RAETAAKRAE DIASAVALED ASTTKKGIVQ LSSATNSTSE SLAATPKAVK AAYDLANGKY TAQDATTAQK GIIQLSSATN STSETLAATP KAVKAANDNA EKRLQKDQNG ADIPGKDTFT KNIGACRAFG GSVSTTTGNW TTAQFIEWLD SQGAFNHPYW MCKGSWSYGN NKIITDTGCG NIHLAGAVIE VMGIKSAMTI RITTPTTSTG GGTTNAQFTY INHGTDYSPG WRRDYNSRNK PTASEIGALP SGGTAVSSVN LSSKGRVTAL TDNTQGATGL ELYEVYNNGY PTAYGNIIHL KGMTAVGEGE LLIGWSGTSG AHAPAFIRSR RDTTDANWSP WAQLYTSAHP PAEFYPVGAP IPWPSDTVPS GYALMQGQTF DKSAYPKLAA AYPSGVIPDM RGWTIKGKPA SDRAVLSQEQ DGIKSHTHSA SVSSTDLGTK TTSSFDYGTK STNNTGAHTH SVSGTAASAG AHTHSMTFVS GGSSGAPGSG SPDYSKYSVN TSSAGAHTHS VSGTAASAGA HAHTVGIGAH THSVAIGSHG HTITVNAAGN AENTVKNIAF NYIVRLA
|
| |