Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1636 |
Symbol | bglH |
ID | 6142642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1625731 |
End bp | 1627404 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641616512 |
Product | glucoside specific outer membrane porin BglH |
Protein accession | YP_001743690 |
Protein GI | 170680584 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4580] Maltoporin (phage lambda and maltose receptor) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATAA AGACGCTTAA CGTCAGCCTT TTGTCTTTTT CTATTATTAC AGCATTGTTT CCATTGAACG CGATGGCAAC AAAATTAACC ATAGAGCAGC GCCTTGAACT GCTTGAAAAT GAATTGTCGC AAAATAAACA AGAGCTGAGA GCAACACAGA ATGAACTAGG AGTATATAAA TTCCGACTTT CGACATTACA AAAAAGCATC ACAGAAAATA AATATCAATC GGCCTCGCTT GCCGAAATAT CAGCTCCATC TCCCGTTGCT GATAACATCA AAAATGAAAA CGGTGAACAG AACTCGTCTG CCGCAGCACA TACTATAAAT GGATCGCAGC AAATTGCCGT TATTGAAAGT AAAGGCGATA AAACCACTAT CGAAAGCGTG ACCCTGAAAG ATATCAGTAA ATATATAAAA GATGATATTG GGTTCAGCTA TCAGGGGTAC TTTCGCTCAG GTTGGGGTAC CGGAAATCAT GGCTCACCAC AAACTTATGC AGCGGGTTCT CTGGGACGTT TTGGTAACGA GATGAGCGGT TGGTTTGACT TAACCTTAAA TCAGCGTGTT TATAATCAGG ACGGTAAAAC GGCAAATGCG GTCGTTACCT ATGATGGCAA CGTAGGTGAG CAGTATAACG ATGCCTGGTT TGGTGACAGT GCCAATGAAA ATATCATGCA GTTCAGTGAT ATTTATCTGA CAACGCGAGG TTTTTTACCC TTCGCGCCAG AGGCAGACTT CTGGGTAGGC AAACATAAAC TCCCGCAATA TGAGATCCAA ATGCTGGACT GGAAAACCTT AACCACGGAT GTCGCTGCGG GTGTGGGGAT TGAAAACTGG GCACTTGGTG TAGGGCTGTT TGATATGTCC TTAAGCCGAG ATGATGTCGA TGTTTACTCC CGTGATTTTA CGCGTACCAG TCAGATGAAT ACTAATTCTG TGGATGTTCG TTATCGCAAT ATCCCGTTAT GGGATGATGC AACATTATCA TTAATGGCTA AATATTCCGC ACCTAATAAA ACGGATCAAC AACAAGATAA TGAAAATGAC GACAGTTATT TTGAAATGAA AGATAGCTGG ATGCTGACTT CTGTTTTACG GCAAAAACTG CAACGCGATA CGTTTAATGA ATTTACGTTA CAGGTTGCCA ATAATTCCTA TGCCAGCAGT TTTGCCAGTT TCTCAGATGC CAGTAACACG ATGGCGCATG GTCGCTATTA CTATGGTGAC CATACCAATG GGATCGCCTG GCGTTTAATC TCTCAGGGCG AGATGTATCT TACTGACAAT ATTATTATGG CTAACGCGCT TGTCTATTCT CATGGCGAAG ATGTTTATAG TTATGAAAGT GGCGCTCATA GTGATTTTGA CAGTATTCGC ACCGTAATAA GACCGGCCTG GATCTGGAAT ACATGGAATC AGACGGGGCT TGAATTAGGC TGGTTTAAGC AACAGAACAA AGCACAGCAG GGTGTAACAC TAAATGAATC GGCTTATAAA ACGACACTCT GGCATGCATT GAAAGTGGGT GAAAGCATTT TAGGTTCACG ACCAGAAATT CGCTTCTATG GCACGTATAT CAATATTCTG GATAACGAAT TATCTAATTT TAAGTTTAAT GAGAACAGCA AAGACGAATT TATGGCCGGC ATCCAGGCGG AAGTCTGGTG GTAA
|
Protein sequence | MNIKTLNVSL LSFSIITALF PLNAMATKLT IEQRLELLEN ELSQNKQELR ATQNELGVYK FRLSTLQKSI TENKYQSASL AEISAPSPVA DNIKNENGEQ NSSAAAHTIN GSQQIAVIES KGDKTTIESV TLKDISKYIK DDIGFSYQGY FRSGWGTGNH GSPQTYAAGS LGRFGNEMSG WFDLTLNQRV YNQDGKTANA VVTYDGNVGE QYNDAWFGDS ANENIMQFSD IYLTTRGFLP FAPEADFWVG KHKLPQYEIQ MLDWKTLTTD VAAGVGIENW ALGVGLFDMS LSRDDVDVYS RDFTRTSQMN TNSVDVRYRN IPLWDDATLS LMAKYSAPNK TDQQQDNEND DSYFEMKDSW MLTSVLRQKL QRDTFNEFTL QVANNSYASS FASFSDASNT MAHGRYYYGD HTNGIAWRLI SQGEMYLTDN IIMANALVYS HGEDVYSYES GAHSDFDSIR TVIRPAWIWN TWNQTGLELG WFKQQNKAQQ GVTLNESAYK TTLWHALKVG ESILGSRPEI RFYGTYINIL DNELSNFKFN ENSKDEFMAG IQAEVWW
|
| |