Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03389 |
Symbol | ybl149 |
ID | 8116248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3613469 |
End bp | 3615472 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644849562 |
Product | hypothetical protein |
Protein accession | YP_003001135 |
Protein GI | 251786831 |
COG category | [S] Function unknown |
COG ID | [COG3533] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACAGCC TTAAGGAGCA CAAGATGAAC ATTTCGGAAG TCGATCTGCG TAAACTGACG GTCAGCGATC CGTTCCTCGG TCAGTACCAA CAACTGGTCC GCGACGTGGT GATTTCTTAT CAATGGGATG CCTTGAACGA TCGTATCCCA GAAGCGGAAC CCAGCCATGC GATTGAAAAC TTTCGCATTG CTGCCGGACT TCAGGAGGGT GAATTTTACG GGATGGTGTT TCAGGACAGC GACGTCGCCA AATGGCTGGA AGCGGTAGCC TGGTCGCTGT GCCAGAAGCC GGACGCCGAA CTGGAAAAAA CCGCCGACGA GGTAATCGAA CTGATCGCCT CCGCCCAATG TGAAGACGGC TATCTCAATA CTTACTTTAC GGTAAAAGCA CCCGAAGAAC GCTGGAGCAA TCTTGCGGAG TGTCATGAAC TTTACTGCGC CGGTCATCTG ATTGAAGCCG GAGTCGCCTT CTTCCAGGCC ACGGGAAAAC GACGCTTGCT GGAGGTGGTT TGCCGTCTGG CCGATCATAT CGACCGCGTA TTTGGTCCAG ATGAAAGTAA GTTACACGGT TATCCTGGTC ACCCGGAAAT TGAACTGGCA CTAATGCGCC TGTATGAAGT GACTGAAGAG CCGCGCTACC TGGCGCTGAC GAACTATTTT GTCGAACAGC GTGGTGCGCA ACCGCACTAT TACGACCAAG AATATGAAAA GCGCGGGCAG ACATCGCACT GGCACACCTA CGGCCCGGCG TGGATGGTGA AAGACAAAGC CTACAGCCAG GCACATTTGT CCCTTGCGCA ACAGCAAACC GCCATCGGTC ACGCGGTACG TTTTGTCTAC CTGATGACCG GCGTCGCGCA TCTCGCGCGT TTAAGTCACG ATGACAGCAA GCGTCAGGAC TGCCTGAGGC TGTGGAACAA TATGGCCCAG CGTCAGTTAT ATATTACCGG CGGCATTGGC TCGCAAAGCA GCGGCGAAGC GTTCACTAGC GATTACGATC TGCCGAATGA CACGGTTTAC GCCGAAAGTT GTGCTTCCAT CGGCCTGATG ATGTTCGCCC GGCGAATGCT GGAAATGGAA GGCGACAGTC AATATGCCGA TGTGATGGAG CGCGCGCTGT ACAACACCGT GCTCGGCGGC ATGGCGCTGG ATGGCAAACA TTTCTTCTAT GTGAATCCGC TGGAAGTACA TCCAAAATCG CTGAAATTCA ACCATATCTA CGATCACGTT AAACCGATCC GCCAGCGTTG GTTTGGCTGC GCTTGTTGTC CGCCAAATAT CGCCCGCGTG CTGACCTCGA TTGGTCATTA TCTCTACACG CCGCGTGAAG ATGCGTTGTA TATCAACATA TACGCAGGAA ACAGCATGGA AGTGCCGGTA GAAAATGGCA CGCTGCGCCT GCGGGTTAGC GGGAACTATC CGTGGCAGGA GCAGGTGACG ATTGCGGTTG AATCGCCCCA GCCGGTACGT CATACGCTGG CTTTACGTCT GCCGGACTGG TGCACACAGC CGCAGATCAT ATTGAATGGG GAAGAGGTCG AGCAGGATAT TCGTAAAGGG TATTTGCACA TTACCCGCGA ATGGCAGGAG GGCGATACGC TGAATCTGAC TTTGCCGATG CCGGTACGCC GCGTTTACGG TAACCCGCTG GTGCGTCACG TCGCCGGAAA AGTGGCGATT CAGCGCGGCC CGCTGGTGTA TTGCCTGGAA CAGGCCGACA ACGGCGAGTC ACTGCATAAT CTGTGGCTGC CCACCGATGC GCCATTTACG ACATTTGAAG GCAAGGGATT GTTTAGCCAT AAGATCTTAA TCCAGGCACC GGGTTACCGG TATGAACAGA GCAATCCAGA GCAGCAACCG CTGTGGCATT ACGACAGCGC GCCAGCCAAA CGCCAGCCGC AAACTCTGAC GTTTATCCCG TGGTTTAGCT GGGCTAACCG GGGCGAAGGC GAAATGCGGA TCTGGGTGAA TGAGGAAAAG CATCGCCATC CGGAGGTTGG ATAA
|
Protein sequence | MYSLKEHKMN ISEVDLRKLT VSDPFLGQYQ QLVRDVVISY QWDALNDRIP EAEPSHAIEN FRIAAGLQEG EFYGMVFQDS DVAKWLEAVA WSLCQKPDAE LEKTADEVIE LIASAQCEDG YLNTYFTVKA PEERWSNLAE CHELYCAGHL IEAGVAFFQA TGKRRLLEVV CRLADHIDRV FGPDESKLHG YPGHPEIELA LMRLYEVTEE PRYLALTNYF VEQRGAQPHY YDQEYEKRGQ TSHWHTYGPA WMVKDKAYSQ AHLSLAQQQT AIGHAVRFVY LMTGVAHLAR LSHDDSKRQD CLRLWNNMAQ RQLYITGGIG SQSSGEAFTS DYDLPNDTVY AESCASIGLM MFARRMLEME GDSQYADVME RALYNTVLGG MALDGKHFFY VNPLEVHPKS LKFNHIYDHV KPIRQRWFGC ACCPPNIARV LTSIGHYLYT PREDALYINI YAGNSMEVPV ENGTLRLRVS GNYPWQEQVT IAVESPQPVR HTLALRLPDW CTQPQIILNG EEVEQDIRKG YLHITREWQE GDTLNLTLPM PVRRVYGNPL VRHVAGKVAI QRGPLVYCLE QADNGESLHN LWLPTDAPFT TFEGKGLFSH KILIQAPGYR YEQSNPEQQP LWHYDSAPAK RQPQTLTFIP WFSWANRGEG EMRIWVNEEK HRHPEVG
|
| |