Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01965 |
Symbol | yegI |
ID | 8114654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2047352 |
End bp | 2049247 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644848179 |
Product | hypothetical protein |
Protein accession | YP_002999752 |
Protein GI | 251785448 |
COG category | [R] General function prediction only |
COG ID | [COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGCCGTGAAC TGGGCAAAGG CGGTGAAGGT GCGGTTTATG ATATCGAGGA GTTTGCCGAT AGCGTCGCCA AGATTTATCA CACGCCGCCA CCCGCCTTAA AACAGGACAA ACTTGCCTTT ATGGCTGCGA CAGCTGACGC GCAGTTGTTG AATTATGTCG CCTGGCCGCA GGCAACGCTT CACGGTGGGC GAGGCGGAAA AGTTATCGGT TTTATGATGC CAAAAGTTTC TGGTAAAGAA CCGATTCATA TGATCTATAG CCCGGCACAT CGTCGCCAGA GTTACCCTCA TTGTGCGTGG GATTTTCTAC TCTATGTTGC GCGCAATATT GCTTCATCTT TTGCTACGGT TCACGAGCAC GGGCACGTCG TGGGGGACGT AAACCAGAAC AGCTTTATGG TAGGTCGCGA CAGCAAAGTG GTGTTGATCG ATAGTGACTC CTTTCAGATT AACGCCAATG GCACACTGCA TTTATGCGAA GTCGGCGTGT CGCATTTTAC GCCGCCAGAG CTACAAACCT TGCCATCATT TGTCGGTTTT GAACGTACCG CGAATCACGA TAATTTTGGC CTTGCGTTGC TGATTTTTCA CGTCTTGTTT GGTGGGCGGC ATCCTTATTC TGGTGTGCCG CTTATCTCTG ATGCGGGTAA TGCGCTGGAG ACGGATATTG CCCATTTCCG TTATGCCTAC GCGTCAGATA ATCAGCGACG TGGTTTAAAA CCGCCGCCAC GATCGATTCC GCTGTCGATG TTACCGGGCG ATGTTGAAGC CATGTTTCAG CAGGCGTTTA CGGAAAGTGG TGTAGCAACC GGGCGTCCGA CGGCTAAAGC GTGGGTAGCA GCACTGGATT CTCTACGCCA ACAGTTAAAG AAATGTACCG TTTCGGCAAT GCATGTTTAT CCCGCTCATT TGACCGACTG CCCGTGGTGT ACGCTGGATA ATCAAGGCGT TATCTATTTT ATTGATCTCG GCGAAGAGGT CATTACCACC GGCGGTGATT TTGTGCTGGC GAAAGTCTGG GCGATGGTGA TGGCGTCAGT AGCACCGCCA GCAGTGCAAT TGCCATTACC CGATCATTTC CAACCGACTG GCAGGCCGCT TCCTTTAGGC CTGTTACGGC GCGAATACAT CATTCTGCTT GAGATCGCAC TGTCAGCGTT ATCGCTGTTG CTTTGCGGCC TTCAGGCAGA ACCGCGTTAT ATTATTTTGG TTCCTGTGCT GGCGGCTATC TGGATTATTG GCAGTCTGAC AAGCAAAGCT TATAAAGCAG AAATCCAGCA ACGCCGTGAG GCATTTAATC GCGCGAAAAT GGACTATGAC CATTTAGTCA GCCAGATCCA ACAGTTGGGC GGGCTGGAAG GTTTTATCGC CAAACGGACG ATGCTCGAAA AAATGAAGGA CGAAATGCTC GGGTTACCGG AGGAAGAAAA ACGTGCTCTG GCAGCACTTC ACGACACCGC AAGGGAACGG CAGAAGCAGA AGTTTCTGGA GGGATTTTTT ATTGATGTTG CCTCTATTCC CGGTGTTGGC CCTGCGCGTA AAGCGGCGTT ACGGTCCTTT GGTATTGAAA CAGCAGCGGA TGTTACCCGT CGTGGGGTTA AGCAAGTTAA AGGGTTTGGT GATCATCTGA CCCAGGCGGT CATCGACTGG AAAGCGAGCT GTGAACGCCG TTTTGTGTTC AGGCCGAACG AAGCGGTAAC GCCTGCAGAC AGACAAGCGG TAATGGCGAA AGTGGCCGCC AAACGACATC GGCTGGAATC GGCGTTGACT GTCGGCGCGA CAGAGTTGCA GCGATTCCGC CTTCATGCTC CAGCACGGAC CATGCCGTTG ATGGAACCGT TACGTCAGGC GGCAGAAAAA CTGGCTCAGG CGCAGGCAGA TTTAAGCCGC TGCTGA
|
Protein sequence | GRELGKGGEG AVYDIEEFAD SVAKIYHTPP PALKQDKLAF MAATADAQLL NYVAWPQATL HGGRGGKVIG FMMPKVSGKE PIHMIYSPAH RRQSYPHCAW DFLLYVARNI ASSFATVHEH GHVVGDVNQN SFMVGRDSKV VLIDSDSFQI NANGTLHLCE VGVSHFTPPE LQTLPSFVGF ERTANHDNFG LALLIFHVLF GGRHPYSGVP LISDAGNALE TDIAHFRYAY ASDNQRRGLK PPPRSIPLSM LPGDVEAMFQ QAFTESGVAT GRPTAKAWVA ALDSLRQQLK KCTVSAMHVY PAHLTDCPWC TLDNQGVIYF IDLGEEVITT GGDFVLAKVW AMVMASVAPP AVQLPLPDHF QPTGRPLPLG LLRREYIILL EIALSALSLL LCGLQAEPRY IILVPVLAAI WIIGSLTSKA YKAEIQQRRE AFNRAKMDYD HLVSQIQQLG GLEGFIAKRT MLEKMKDEML GLPEEEKRAL AALHDTARER QKQKFLEGFF IDVASIPGVG PARKAALRSF GIETAADVTR RGVKQVKGFG DHLTQAVIDW KASCERRFVF RPNEAVTPAD RQAVMAKVAA KRHRLESALT VGATELQRFR LHAPARTMPL MEPLRQAAEK LAQAQADLSR C
|
| |