Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02004 |
Symbol | yehI |
ID | 8116160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 2094454 |
End bp | 2098086 |
Gene Length | 3633 bp |
Protein Length | 1210 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644848216 |
Product | hypothetical protein |
Protein accession | YP_002999789 |
Protein GI | 251785485 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAGG AATTACCGTG GCTGGCGGAT AACGCCCAAC TGGAACTGAA ATATAAAAAA GGCAAAACGC CGCTCAGTCA TCGTCGCTGG CCGGGCGAAC CAGTGTCCGT TATCACTGGA AGTCTCATCC AGACATTGGG TGATGAATTG CTACAAAAAG CTGAGAAGAA AAAAAACATT GTCTGGCGTT ATGAGAATTT TTCACTGGAG TGGCAGTCCG CCATCACGCA GGCCATCAAC TTGATCGGCG AACACAAACC CTCAGTCCCG GCCCAGACAA TGGCGGCGCT AGCCTGTATC GCGCAAAATG ACAGCCAACA GTTGCTCGAC GAAATCGTCC AACAAGAGGG GCTGGAATAT GCGACTGAGG TGGTGATTGC ACGCCAGTTT ATTGCGCGGT GTTATGAGAG TGATCCTCTG GTAGTGACAT TGCAGTATCA GGACGAGGAT TATGGCTATG GTTATCGCTC AGAAACCTAT AACGAATTCG ATCTCCGACT GCGTAAGCAT CTCTCTCTGG CAGAGGAAAG CTGCTGGCAG CGTTGCGCCG ACAAACTCAT TGCCGCACTA CCAGGAATAA CCAAAGTTCG CCGCCCTTTT ATTGCGCTGA TCCTCCCGGA AAAACCAGAA ATAGCCAATG AGTTGGTAGG CCTTGAATGC CCGCGAACTC ATTTTCATTC TAAGGAGTGG TTAAAAGTTG TTGCTAATGA CCCCACAGCG GTGAGAAAAC TCGAACACTA CTGGAGCCAG GATATATTTA GCGATCGAGA AGCCAGCTAC ATGTCGCATG AAAACCACTT CGGCTACGCG GCCTGCGCCG CCCTTTTGCG CGAACAAGGA CTGGCAGCCA TTCCGCGCCT CGCGATGTAT GCCCATAAAG AAGATTGCGG CAGTCTGCTG GTACAAATTA ACCATCCGCA AGTCATCCGC ACCTTGCTAC TGGTGGCTGA TAAAAACAAA CCCAGCCTGC AACGTGTAGC TAAATACCAT AAAAACTTCC CCCATGCGAC GCTCGCCGCA CTGGCAGAAC TGCTGGCGTT AACAGAACCA CCAGCCCGCC CTGGTTATCC AATCATCGAA GACAAAAAGC TGCCTGCACA GCAAAAAGCA CGCGATGAAT ACTGGCGTAC GCTGTTACAG ACGCTGATGG CATCGCAGCC ACAACTGGCA GAAGAGGTGA TGCAGTGGTT AAGTACTCAA GCCAGGGCAG TGCTGAATAG TTATTTATCG GCACCGCCCA AACCGGTTAT TGATAGTACC GATAACAGCA ATCTGCCTGA AATCCTCGTT TCGCCACCGT GGCGTAGTAA GAAAAAAATG ACAGCTCCAC GTCTTGATTT GGCACCGCTC GAATTAACTC CGCAAGTTTA CTGGCAACCA GGCGAACAAG AGAGGCTTGC CGCCACTGAG TCTGCCCGTT ATTTCAGCAC GGAATCTCTT GCGCAACGCA TGGAACAAAA AAGTGGACGA GTTGTATTAC AGGAACTGGG TTTTGGGGAT GATGTATGGC TGTTTCTGAA TTATATACTC CCCGGAAAAC TGGATGCTGC ACGCAATTCA CTCATTGTTC AGTGGCATTA CTACCAGGGG CGGGTTGAAG AGATCCTGAA TGGCTGGAAC TCCCCGGAAG CACAATTAGC AGAACAGGCG CTCCGCAGCG GTCACATAGA AGCGTTAATT AACATATGGG AAAATGACAA CTACTCACGT TATCGTCCGG AAAAGAGTGT CTGGAACCTG TATTTATTGG CACAGTTGCC GCGTGAGATG GCTTTGACCT TCTGGCTGCG TATCAATGAG AAAAAGCATC TGTTCGCGGG TGAGGACTAT TTTCTCAGTA TCCTCGGATT GGATGCGCTA CCAGGTCTGC TGTTGGCTTT TTCACATCGT CCAAAAGAAA CATTTCCGTT AATTTTAAAT TTCGGCGCAA CAGAACTGGC CCTGCCCGTT GCCCGCGTCT GGCACCGTTT TGCGGGCCAG CGTAATCTGG CTCGCCAGTG GATTTTACAA TGGCCGGAAC ATACGGCTAC TGCACTTATT CCACTCGTCT TTGTTAAACC CTGCGACAAC AGCGAAGCGG CATTATTTGC CCTTCGTTTA CTGTATGAAC AAGGACATAG TGAATTACTG CAAACGGTTG CAAACCGCTG GGATCGCGCT GATATGTGGC CAGCCCTGGA AAAAATACTT ACCCAGAACC CGATGGAAAT TTACCCGGCA CGCATTCCAA AAGCCCCTGA TTTCTGGCAT CCGCAAATGT GGTCCAGGCC GCGCCTTATC ACTAATAATC AAACTGTTAC CAATGACGCT CTGGAAATTA TCGGCGAAAT GCTGCGCTTT ACCCAGGGGG GACGTTTTTA TAGCGGGCTG GAACAACTGA AAACGTTCTG CCAGCCACAA ACGCTGGCAG CTTTTGCTTG GGATCTCTTC ACTGCGTGGC AACAAGCTGG TGCCCCCGCA AAAGACAACT GGACATTTCT GGCGTTGAGT CTCTTTGGTG ACGAAAGCAC GGCACGGGAT CTAACGACAC AGATCCTCGC CTGGCCACAA GAAGGCAAAT CTGCCCGTGC GGTCAGTGGC CTGAACATCC TTACCCTGAT GAATAATGAT ATGGCGCTGA TACAGCTGCA TCATATATCG CAACGGGCGA AATCCTCTTC ATTACGTGAA AACGCAGCGG AATTTCTTCA AGTGGTCGCA GAAAATCGCG GGCTAAGCCA GGAAGAGTTA GCGGACAGAT TAGTCCCAAC CCTGGGCCTT GATGATCCGC AGGCGTTGAT TTTTGATTTT GGTCCCCGGC AGTTTACCGT TCGCTTCGAT GAAAACCTCA ATCCGGTTAT CTTTGATCAG CAAAACGTTC GCCAGAAAAG CGTTCCCCGG TTGCGCGCCG ATGACGATCA ACTGAAAGCG CCCGAGGCAC TGGCCCGACT AAAAGGGTTA AAAAAAGATG CTACTCAGGT GAGCAAAAAC CTGCTCCCGC GTCTTGAAGC TGCCCTACGT ACCATCCGAC GCTGGTCGCT GGCAGATTTT CATACTCTGT TTGTTAATCA TCCCTTTACC CGCCTGGTTA CCCAGCGATT AATATGGGGC GTGTATCCGG CAAATGAACC GCGTTGTTTA CTCAACGCCT TTCGTGTGGC CGCAGAGGGG GAGTTCTGCA ATGCGCAAGA TGAGCCAATT GGCCTGCCTG CGGATGCTCT GATTGGCATT GCCCACCCGT TAGAAATGAC AGCAGAAATG CGCAGTGAAT TTGCACAGCT TTTTGCCGAT TACGAAATTA TGCCGCCTTT TCGCCAGTTG TCGCGCCGCA CGGTGCTGCT CACACCTGAC GAGTCAACCA GTAACAGCCT GACTCGCTGG GAAGGTAAAT CCGCTACCGT TGGGCAACTT ATGGGAATGC GATACAAAGG CTGGGAGTCA GGCTATGAGG ACGCATTTGT CTATAACCTG GGTGAGTACC GGCTGGTCCT TAAGTTTTCA CCCGGTTTTA ACCACTACAA TGTTGATAGC AAAGCGCTAA TGAGCTTCCG TTCTCTTCGA GTGTACCGTG ACAATAAATC CGTCACTTTT GCCGAACTTG ATGTGTTTGA TTTGAGTGAG GCGTTAAGCG CACCTGACGT CATTTTCCAT TAA
|
Protein sequence | MDKELPWLAD NAQLELKYKK GKTPLSHRRW PGEPVSVITG SLIQTLGDEL LQKAEKKKNI VWRYENFSLE WQSAITQAIN LIGEHKPSVP AQTMAALACI AQNDSQQLLD EIVQQEGLEY ATEVVIARQF IARCYESDPL VVTLQYQDED YGYGYRSETY NEFDLRLRKH LSLAEESCWQ RCADKLIAAL PGITKVRRPF IALILPEKPE IANELVGLEC PRTHFHSKEW LKVVANDPTA VRKLEHYWSQ DIFSDREASY MSHENHFGYA ACAALLREQG LAAIPRLAMY AHKEDCGSLL VQINHPQVIR TLLLVADKNK PSLQRVAKYH KNFPHATLAA LAELLALTEP PARPGYPIIE DKKLPAQQKA RDEYWRTLLQ TLMASQPQLA EEVMQWLSTQ ARAVLNSYLS APPKPVIDST DNSNLPEILV SPPWRSKKKM TAPRLDLAPL ELTPQVYWQP GEQERLAATE SARYFSTESL AQRMEQKSGR VVLQELGFGD DVWLFLNYIL PGKLDAARNS LIVQWHYYQG RVEEILNGWN SPEAQLAEQA LRSGHIEALI NIWENDNYSR YRPEKSVWNL YLLAQLPREM ALTFWLRINE KKHLFAGEDY FLSILGLDAL PGLLLAFSHR PKETFPLILN FGATELALPV ARVWHRFAGQ RNLARQWILQ WPEHTATALI PLVFVKPCDN SEAALFALRL LYEQGHSELL QTVANRWDRA DMWPALEKIL TQNPMEIYPA RIPKAPDFWH PQMWSRPRLI TNNQTVTNDA LEIIGEMLRF TQGGRFYSGL EQLKTFCQPQ TLAAFAWDLF TAWQQAGAPA KDNWTFLALS LFGDESTARD LTTQILAWPQ EGKSARAVSG LNILTLMNND MALIQLHHIS QRAKSSSLRE NAAEFLQVVA ENRGLSQEEL ADRLVPTLGL DDPQALIFDF GPRQFTVRFD ENLNPVIFDQ QNVRQKSVPR LRADDDQLKA PEALARLKGL KKDATQVSKN LLPRLEAALR TIRRWSLADF HTLFVNHPFT RLVTQRLIWG VYPANEPRCL LNAFRVAAEG EFCNAQDEPI GLPADALIGI AHPLEMTAEM RSEFAQLFAD YEIMPPFRQL SRRTVLLTPD ESTSNSLTRW EGKSATVGQL MGMRYKGWES GYEDAFVYNL GEYRLVLKFS PGFNHYNVDS KALMSFRSLR VYRDNKSVTF AELDVFDLSE ALSAPDVIFH
|
| |