Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2761 |
Symbol | |
ID | 6064783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3032686 |
End bp | 3034107 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641602167 |
Product | Phage-related tail fibre protein-like protein |
Protein accession | YP_001725716 |
Protein GI | 170020762 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000146367 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCACAA AATTTTATAC CCTGCTGACG GATATTGGCG CGGCGAAACT TGCCAGCGCC GCCGCGCTCG GTGTGCCGCT AAAAATTACC CATATGGCGG TGGGCGATGG CGGTGGAGTA TTGCCAACGC CGGACGCAAA GCAGACGGCA CTGGTAAATG AGAAACGCCG GGCTGCGCTG AATATGCTTT ATATCGACCC GCAGAACAGC AGCCAGATTA TTGCCGAACA GGTGATCCCT GAAAACGAGG GCGGTTGGTG GATACGTGAA GTGGGCTTGT TTGATGAGTC CGGGGCATTG ATTGCCGTGG GCAACTGCCC GGAAAGCTAT AAGCCGCAAC TGGCTGAAGG TAGCGGGCGC ACTCAGACCG TGCGCATGGT GCTGATTACC AGCAGCACGG ACAATATCAC CCTGAAAATC GACCCTGCTG TAGTGCTGGC AACCCGCAAG TATGTGGATG ACAAGGCACT GGAGCTGAAG GTGTACGCGG ATGATCAGAT GGCAAAACAT CTTGCCGCAC CGGACCCGCA TTCACAGTAC GCGCCAAAAG CCAGCCCGAC ATTTACCGGA ACCCCCAAAG CGCCAACGCC AGCGGCGGGG AATAATACCA CGCAGGTTGC GACCACTGCG TTTGTACAGG CGGCACTGAC GGCCCTTATT AATGGTGCGC CAGCCACGCT GGACACGCTG AAAGAAATAG CCGCAGCCAT TAACAATGAT CCGAATTTCA GTACCACCAT TAACAATGCG CTGGCACTAA AAGCACCGTT GTCGAGTCCG GCACTCACCG GAACGCCAAC AGCCCCCACG GCGGCGCAGT CGGTCAACAA TACACAGATT GCCACCACGG CATTTGTGAA ATCGGCGATT GCGGCAATGG TGGGTTCTGC ACCTGCGGCA CTGGATACAC TGAACGAACT GGCGGCGGCA CTGGGGAATG ATCCGAACTT TGCCACGACA ATGCTTAATG CGCTGTCAGG TAAACAACCG CTGGACAATA CGCTTACCAA TTTGAGTGGA AAGGATGTAG CTGGTCTTCT CACATACCTT GGTTTGGGAG AGGCGGCAAA ACGGGATGTG GGCACAGGGG AAAATCAGAT ACCGGACATG GTTTCATTTA GTGGGGTGAG GGATTATTAC GGAAAACAAC TTTTGCCAGG AGGGTTGATA CTCCAGTGGC TGACGATTCC ATCAAGTGCA GCAGCCAAAG CTGTAACACT GAATAATGGT AATTATCAGC TGTCAGGCTA TAAATGGCCC CAGTCATTTG GTGTCCTGTT TGCTGTGTTT GCTACAAAAG TTTCTGGCTC GACTAACGAA GCATACGCAA TCTCAGTTAA TCGTCACTCT ACCGATGTAA TTGTCACCTG GAATGCCCGT AAGGCTGATG ACGTCCACAT TTTAGGAATT GGGAAATTAT GA
|
Protein sequence | MSTKFYTLLT DIGAAKLASA AALGVPLKIT HMAVGDGGGV LPTPDAKQTA LVNEKRRAAL NMLYIDPQNS SQIIAEQVIP ENEGGWWIRE VGLFDESGAL IAVGNCPESY KPQLAEGSGR TQTVRMVLIT SSTDNITLKI DPAVVLATRK YVDDKALELK VYADDQMAKH LAAPDPHSQY APKASPTFTG TPKAPTPAAG NNTTQVATTA FVQAALTALI NGAPATLDTL KEIAAAINND PNFSTTINNA LALKAPLSSP ALTGTPTAPT AAQSVNNTQI ATTAFVKSAI AAMVGSAPAA LDTLNELAAA LGNDPNFATT MLNALSGKQP LDNTLTNLSG KDVAGLLTYL GLGEAAKRDV GTGENQIPDM VSFSGVRDYY GKQLLPGGLI LQWLTIPSSA AAKAVTLNNG NYQLSGYKWP QSFGVLFAVF ATKVSGSTNE AYAISVNRHS TDVIVTWNAR KADDVHILGI GKL
|
| |