Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03672 |
Symbol | ybl194 |
ID | 8116298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 3920310 |
End bp | 3921314 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644849833 |
Product | hypothetical protein |
Protein accession | YP_003001406 |
Protein GI | 251787102 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3734] 2-keto-3-deoxy-galactonokinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGGAG CACCCTCATT TATTGCCCTG GACTGGGGAA CGTCATCGCT TCGGGCATGG CGTTTTGGCG AATCACCAAA TCCACAAGAA AAACGTGAAT TCCCCTGGGG GATCATGAAA TTACCCTCCC AGGCAGCAAC ACGTGAGGAC GCTTTCCACG ATACGTTTTT ACGTGTCTGC GGCGACTGGC TGGCACAAAC CCCATGCCCG GTACTGGCTT GCGGCATGCT CGGCAGTGCG CAAGGGTGGC AACCAGCAGC TTATTTACCT TGCCCGGTCA CTCTCGAGGG TCTGGCGAAG CAGTTGACGC CTGTTATCCA CCAGCAGCAG ACAATGCTGC ACATCATTCC TGGCGTGATT AAAGAAGGTG AAATGCCCGA AGTGATGCGT GGCGAAGAGA CACAAATCTT CGGTGCTATC TCGATGGAAC CGGCCCTGCA AAACGCGATC CATCAGGGTA TGCCAGTGCT GATAGGCTTA CCCGGCACAC ATGCGAAATG GGCAGTAGTT GAAAACAACA CCATTACCGA TTTCCGAACC TTTATGACGG GCGAGTTATT TGATGTTTTA TCCCGCCATT CGATTCTCGG TGCCACCATG CATCCTGGAG ATGAACCGCA CTGGGATGCC TTCACCCACG GGCTGACAGC AGCACAAGAG CATCATCAGA CCGGATTATT ATCGACGCTG TTTTCGACCC GTTCGCGCCT GCTGACCAGT AATCTTACGT CGTCCTCGCA GGGGGATTAC CTTTCCGGAT TACTGATAGG CCATGAATTA TGTGGTCTGG CATCCAGTTT GCTACGTGAT GTACCCGCCA CAACACCGAT CGCCTTAATC GGTAGCGCAA ATCTGAACAG CCGCTATTCA CAAGCATTCA ACCACGTTTT TCCCGACAGA CAGATACACG CCATTCCCAA TGCCACTGAA CAGGGATTAT GGCGAATCGC CCACGCTGCC GGGTTACTGT CCACCAACGC CAGGGAATGT ACCCATGCCA TTTAA
|
Protein sequence | MPGAPSFIAL DWGTSSLRAW RFGESPNPQE KREFPWGIMK LPSQAATRED AFHDTFLRVC GDWLAQTPCP VLACGMLGSA QGWQPAAYLP CPVTLEGLAK QLTPVIHQQQ TMLHIIPGVI KEGEMPEVMR GEETQIFGAI SMEPALQNAI HQGMPVLIGL PGTHAKWAVV ENNTITDFRT FMTGELFDVL SRHSILGATM HPGDEPHWDA FTHGLTAAQE HHQTGLLSTL FSTRSRLLTS NLTSSSQGDY LSGLLIGHEL CGLASSLLRD VPATTPIALI GSANLNSRYS QAFNHVFPDR QIHAIPNATE QGLWRIAHAA GLLSTNAREC THAI
|
| |