Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02036 |
Symbol | mglC |
ID | 8114684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2130960 |
End bp | 2131970 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644848248 |
Product | hypothetical protein |
Protein accession | YP_002999821 |
Protein GI | 251785517 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4211] ABC-type glucose/galactose transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.433899 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCGT TAAATAAGAA AAGTTTTCTT ACTTACCTGA AAGAGGGCGG TATTTACGTC GTTCTTTTAG TTTTGCTGGC GATTATTATT TTCCAGGACC CAACATTTTT AAGTCTGTTG AACTTAAGTA ATATTCTCAC CCAGTCATCG GTGCGTATTA TTATCGCGCT CGGTGTGGCA GGGTTAATTG TCACCCAGGG GACCGATCTT TCTGCTGGTC GTCAGGTAGG GCTGGCGGCA GTGGTGGCTG CGACATTATT GCAGTCCATG GATAACGCCA ACAAAGTGTT CCCGGAAATG GCGACGATGC CGATTGCGCT GGTTATTCTG ATTGTCTGTG CCATTGGTGC GGTGATCGGT TTGATCAACG GTCTGATTAT CGCTTATCTC AACGTGACGC CGTTCATTAC CACGCTCGGC ACGATGATCA TCGTCTATGG CATCAACTCG CTCTATTACG ACTTTGTCGG GGCGTCGCCA ATTTCTGGTT TTGACAGTGG CTTCTCTACC TTTGCTCAGG GCTTTGTCGC GCTGGGGAGT TTCCGTCTCT CTTACATCAC CTTCTACGCG TTGATTGCGG TGGCGTTCGT CTGGGTGTTG TGGAACAAAA CCCGCTTCGG TAAGAACATT TTTGCCATTG GCGGTAACCC GGAAGCGGCA AAAGTATCTG GTGTCAACGT CGGCCTGAAC CTGCTGATGA TCTACGCGTT GTCTGGCGTG TTCTATGCCT TTGGCGGGAT GTTAGAAGCC GGACGTATCG GCTCTGCCAC CAACAACCTC GGCTTTATGT ATGAGCTGGA TGCTATCGCG GCGTGCGTGG TAGGCGGTGT ATCGTTCAGC GGCGGTGTGG GGACGGTGAT TGGCGTGGTG ACCGGGGTAA TTATTTTTAC CGTCATCAAC TATGGCCTGA CGTATATCGG CGTAAACCCA TACTGGCAGT ACATCATCAA AGGGGCGATT ATTATCTTCG CCGTAGCGCT GGATTCACTG AAATACGCGC GTAAGAAATG A
|
Protein sequence | MSALNKKSFL TYLKEGGIYV VLLVLLAIII FQDPTFLSLL NLSNILTQSS VRIIIALGVA GLIVTQGTDL SAGRQVGLAA VVAATLLQSM DNANKVFPEM ATMPIALVIL IVCAIGAVIG LINGLIIAYL NVTPFITTLG TMIIVYGINS LYYDFVGASP ISGFDSGFST FAQGFVALGS FRLSYITFYA LIAVAFVWVL WNKTRFGKNI FAIGGNPEAA KVSGVNVGLN LLMIYALSGV FYAFGGMLEA GRIGSATNNL GFMYELDAIA ACVVGGVSFS GGVGTVIGVV TGVIIFTVIN YGLTYIGVNP YWQYIIKGAI IIFAVALDSL KYARKK
|
| |