Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03712 |
Symbol | yihQ |
ID | 8114944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 3972013 |
End bp | 3973977 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644849873 |
Product | hypothetical protein |
Protein accession | YP_003001446 |
Protein GI | 251787142 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACAACGTC TTATTTTAAC CCATAGCAAA GATAATCCTT GTTTATGGAT TGGCTCAGGT ATAGCGGATA TCGATATGTT CCGCGGTAAT TTCAGCATTA AAGATAAACT ACAGGAGAAA ATTGCGCTTA CCGACGCCAT CGTCAGCCAG TCACCGGATG GTTGGTTAAT TCATTTCAGC CGTGGTTCTG ACATTAGCGC CACGCTGAAT ATCTCTGCCG ACGATCAGGG GCGTTTATTG CTGGAACTAC AAAACGACAA CCTTAACCAC AACCGTATCT GGCTGCGCCT TGCCGCTCAA CCAGAGGACC ATATCTACGG CTGCGGCGAA CAGTTTTCCT ACTTCGATCT GCGTGGCAAA CCGTTCCCGC TATGGACCAG TGAACAAGGC GTTGGTCGCA ACAAACAAAC CTATGTCACC TGGCAGGCCG ACTGCAAAGA AAATGCGGGC GGCGACTATT ACTGGACTTT CTTCCCACAG CCTACGTTTG TCAGCACGCA GAAGTATTAC TGCCATGTTG ATAACAGTTG CTATATGAAC TTCGACTTTA GTGCCCCGGA ATACCATGAA CTGGCGCTGT GGGAAGACAA AGCAACGCTG CGTTTTGAAT GTGCTGACAC ATACATTTCC CTGCTGGAAA AATTAACCGC CCTGCTGGGA CGCCAGCCAG AACTGCCCGA CTGGATTTAT GACGGAGTAA CGCTCGGCAT TCAGGGCGGG ACGGAAGTGT GCCAGAAGAA ACTGGACACC ATGCGTAACG CGGGCGTGAA GGTCAACGGC ATCTGGGCGC AGGACTGGTC CGGTATTCGT ATGACCTCTT TTGGCAAACG CGTGATGTGG AACTGGAAGT GGAACAGCGA AAACTACCCG CAACTGGATT CACGCATTAA GCAGTGGAAT CAGGAGGGCG TGCAGTTCCT GGCCTATATC AACCCGTATG TTGCCAGCGA TAAAGATCTC TGCGAAGAAG CGGCACAACA CGGCTATCTG GCAAAAGATG CCTCTGGCGG TGACTATCTG GTGGAGTTTG GCGAGTTTTA CGGCGGCGTT GTCGATCTCA CTAATCCAGA AGCCTACGCC TGGTTCAAGG AAGTGATCAA AAAGAACATG ATTGAACTCG GCTGCGGCGG CTGGATGGCT GACTTCGGCG AGTATCTGCC CACCGACACG TACCTGCATA ACGGCGTCAG TGCCGAAATT ATGCATAACG CCTGGCCTGC GCTGTGGGCG AAGTGTAACT ACGAAGCCCT TGAAGAAACG GGCAAGCTCA GCGAGATCCT TTTCTTTATG CGCGCCGGTT CTACCGGTAG CCAGAAATAC TCCACCATGA TGTGGGCGGG CGACCAGAAC GTCGACTGGA GTCTCGACGA TGGCCTGGCG TCGGTTGTCC CGGCGGCGCT GTCGCTGGCA ATGACCGGAC ATGGCCTGCA CCACAGCGAC ATTGGCGGTT ACACCACCCT GTTTGAGATG AAGCGCAGCA AAGAGCTGCT GCTGCGCTGG TGCGATTTCA GCGCCTTCAC GCCGATGATG CGCACCCACG AAGGTAACCG TCCTGGCGAC AACTGGCAGT TTGACGGCGA CGCAGAAACC ATCGCCCATT TCGCCCGTAT GACCACCGTC TTCACCACCC TGAAACCTTA CCTGAAAGAG GCCGTCGCGC TGAATGCGAA GTCCGGCCTG CCGGTTATGC GCCCGCTGTT CCTGCATTAC GAAGACGATG CGCACACTTA CACCCTGAAA TATCAGTACC TGTTAGGTCG CGACATTCTG GTCGCTCCGG TGCATGAAGA AGGCCGTAGC GACTGGACGC TCTATCTGCC GGAGGATAAC TGGGTCCACG CCTGGACGGG TGAAGCGTTC CGGGGCGGGG AAGTTACCGT TAATGCGCCC ATCGGCAAGC CGCCGGTCTT TTATCGCGCC GATAGCGAAT GGGCGGCACT GTTCGCGTCG TTAAAAAGCA TCTAA
|
Protein sequence | QQRLILTHSK DNPCLWIGSG IADIDMFRGN FSIKDKLQEK IALTDAIVSQ SPDGWLIHFS RGSDISATLN ISADDQGRLL LELQNDNLNH NRIWLRLAAQ PEDHIYGCGE QFSYFDLRGK PFPLWTSEQG VGRNKQTYVT WQADCKENAG GDYYWTFFPQ PTFVSTQKYY CHVDNSCYMN FDFSAPEYHE LALWEDKATL RFECADTYIS LLEKLTALLG RQPELPDWIY DGVTLGIQGG TEVCQKKLDT MRNAGVKVNG IWAQDWSGIR MTSFGKRVMW NWKWNSENYP QLDSRIKQWN QEGVQFLAYI NPYVASDKDL CEEAAQHGYL AKDASGGDYL VEFGEFYGGV VDLTNPEAYA WFKEVIKKNM IELGCGGWMA DFGEYLPTDT YLHNGVSAEI MHNAWPALWA KCNYEALEET GKLSEILFFM RAGSTGSQKY STMMWAGDQN VDWSLDDGLA SVVPAALSLA MTGHGLHHSD IGGYTTLFEM KRSKELLLRW CDFSAFTPMM RTHEGNRPGD NWQFDGDAET IAHFARMTTV FTTLKPYLKE AVALNAKSGL PVMRPLFLHY EDDAHTYTLK YQYLLGRDIL VAPVHEEGRS DWTLYLPEDN WVHAWTGEAF RGGEVTVNAP IGKPPVFYRA DSEWAALFAS LKSI
|
| |