Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00851 |
Symbol | ybl47 |
ID | 8115118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 887859 |
End bp | 889481 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644847115 |
Product | hypothetical protein |
Protein accession | YP_002998688 |
Protein GI | 251784384 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAA AATTTTATAC CCTGCTGACG GATATTGGCG CGGCGAAACT TGCCAGCGCC GCCGCGCTCG GTGTGCCTTT AAAAATTACC CATATGGCGG TCGGCGATGG CGGCGGAGTA TTGCCAACGC CGGATGCAAA GCAGACTGCA CTGGTGAATG AGAAACGCCG GGCTGCGCTG AATATGCTCT ATATCGACCC GCAGAACAGT AGCCAGATTA TTGCTGAACA GGTAATCCCT GAAAATGAGG GCGGTTGGTG GATACGTGAA GTGGGCCTGT TTGATGAGTC CGGGGCATTG ATTGCCGTGG GAAACTGCCC GGAAAGCTAT AAGCCGCAAC TGGCTGAAGG CAGTGGGCGT ACCCAGACCG TGCGCATGGT GCTGATTACC AGCAGCACGG ACAATATCAC CCTGAAAATC GACCCTGCCG TCGTGCTGGC AACCCGCAAG TATGTGGATG ACAAGGTACT GGAGCTGAAG GTGTTCGTGG ATGATAAGAT GGCAAAACAT CTTGCCGCAC CGGACCCGCA TTCACAGTAT GCACCCAAAG AAAGCCCGAC ATTGACCGGA ACACCCAAAG CGCCAACGCC AGCGGAGGGG AATAACACCA CGCAGATTGC GACCACCGCG TTTGTTCAGG CGGCACTGAT GGCCCTTATT AATGGTGCGC CAGCCACACT GGATACGATG AAAGAAATTG CCGCTGCCAT TAATAATGAC CCGAAATTCA GTACCACCAT TAACAATGCG CTGGCACTGA AAGCGCCGCT GTTAAGTCCG GCATTCACCG GAACGCCAAC AGCCCCCACT GCCGCACAGT CGGTTAACAA TACACAGATT GCCACCACGG CTTTTGTGAA ATCGGCAATT GCGGCAATGG TGGGGTCTGC ACCTGCTGCA CTGGATACAC TGAACGAACT GGCTGCGGCG CTGGGGAATG ACCCGAACTT TGCCACGACA ATGCTTAATG CGCTGGCAGG TAAACAACCG TTGGACAATA CGCTGACTAA TTTGAGTGGA AAGGATGTAG CTGGTCTTCT CGCATACCTT GGTTTGGGAG ATGCATTAAT TGGTGATGAA TGTAAAATTG CAGGGTTTGA CAGTAGTAAC GTCAATGCCC CGTATATGCG ATTCGCCAGA ACAAATACAG TAGTTCGTCT GGCAACAAAA GACTATGCGC AACCAAAAGA CCAGACACTG ACAGATTTAA GCGGTAAGGA TAAGGCTGAA CTAAGAACTT ATCTTGATCT GAAAAGTGCG GCTCAAAGGG ATGTTGGCTC AGGGGCAAAT CAGATTCCGG ATATGAATGA CTTCACATCC AGCCTGACCA GCCCTGGCTG GCAAAAATTA CCGTCAGGTC TGATTATTCA GTGGGGGGCA GCCAATCCAT CATCAACTGG AGAGATCTTT ATTACGTTTC CTGTCGCGTT CTCTGCATAC CCGATGTATG TGGGATTTGG TCCTCAGCAG GCATCGCTTC CTAACGTAGT TCAGTCGCCA GTAATTTCAG CGCCAACGAT AACTAATTTA GGATGCGGCG TCCGAAATCT GATGATTCCA ACAGCGGGCG GAGCACCAGT AGCCAGCATG AGTTCATTTT TCTGGATTGC GGTAGGGAAA TAA
|
Protein sequence | MSTKFYTLLT DIGAAKLASA AALGVPLKIT HMAVGDGGGV LPTPDAKQTA LVNEKRRAAL NMLYIDPQNS SQIIAEQVIP ENEGGWWIRE VGLFDESGAL IAVGNCPESY KPQLAEGSGR TQTVRMVLIT SSTDNITLKI DPAVVLATRK YVDDKVLELK VFVDDKMAKH LAAPDPHSQY APKESPTLTG TPKAPTPAEG NNTTQIATTA FVQAALMALI NGAPATLDTM KEIAAAINND PKFSTTINNA LALKAPLLSP AFTGTPTAPT AAQSVNNTQI ATTAFVKSAI AAMVGSAPAA LDTLNELAAA LGNDPNFATT MLNALAGKQP LDNTLTNLSG KDVAGLLAYL GLGDALIGDE CKIAGFDSSN VNAPYMRFAR TNTVVRLATK DYAQPKDQTL TDLSGKDKAE LRTYLDLKSA AQRDVGSGAN QIPDMNDFTS SLTSPGWQKL PSGLIIQWGA ANPSSTGEIF ITFPVAFSAY PMYVGFGPQQ ASLPNVVQSP VISAPTITNL GCGVRNLMIP TAGGAPVASM SSFFWIAVGK
|
| |