Gene EcSMS35_1636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1636 
SymbolbglH 
ID6142642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1625731 
End bp1627404 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content42% 
IMG OID641616512 
Productglucoside specific outer membrane porin BglH 
Protein accessionYP_001743690 
Protein GI170680584 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4580] Maltoporin (phage lambda and maltose receptor) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATAA AGACGCTTAA CGTCAGCCTT TTGTCTTTTT CTATTATTAC AGCATTGTTT 
CCATTGAACG CGATGGCAAC AAAATTAACC ATAGAGCAGC GCCTTGAACT GCTTGAAAAT
GAATTGTCGC AAAATAAACA AGAGCTGAGA GCAACACAGA ATGAACTAGG AGTATATAAA
TTCCGACTTT CGACATTACA AAAAAGCATC ACAGAAAATA AATATCAATC GGCCTCGCTT
GCCGAAATAT CAGCTCCATC TCCCGTTGCT GATAACATCA AAAATGAAAA CGGTGAACAG
AACTCGTCTG CCGCAGCACA TACTATAAAT GGATCGCAGC AAATTGCCGT TATTGAAAGT
AAAGGCGATA AAACCACTAT CGAAAGCGTG ACCCTGAAAG ATATCAGTAA ATATATAAAA
GATGATATTG GGTTCAGCTA TCAGGGGTAC TTTCGCTCAG GTTGGGGTAC CGGAAATCAT
GGCTCACCAC AAACTTATGC AGCGGGTTCT CTGGGACGTT TTGGTAACGA GATGAGCGGT
TGGTTTGACT TAACCTTAAA TCAGCGTGTT TATAATCAGG ACGGTAAAAC GGCAAATGCG
GTCGTTACCT ATGATGGCAA CGTAGGTGAG CAGTATAACG ATGCCTGGTT TGGTGACAGT
GCCAATGAAA ATATCATGCA GTTCAGTGAT ATTTATCTGA CAACGCGAGG TTTTTTACCC
TTCGCGCCAG AGGCAGACTT CTGGGTAGGC AAACATAAAC TCCCGCAATA TGAGATCCAA
ATGCTGGACT GGAAAACCTT AACCACGGAT GTCGCTGCGG GTGTGGGGAT TGAAAACTGG
GCACTTGGTG TAGGGCTGTT TGATATGTCC TTAAGCCGAG ATGATGTCGA TGTTTACTCC
CGTGATTTTA CGCGTACCAG TCAGATGAAT ACTAATTCTG TGGATGTTCG TTATCGCAAT
ATCCCGTTAT GGGATGATGC AACATTATCA TTAATGGCTA AATATTCCGC ACCTAATAAA
ACGGATCAAC AACAAGATAA TGAAAATGAC GACAGTTATT TTGAAATGAA AGATAGCTGG
ATGCTGACTT CTGTTTTACG GCAAAAACTG CAACGCGATA CGTTTAATGA ATTTACGTTA
CAGGTTGCCA ATAATTCCTA TGCCAGCAGT TTTGCCAGTT TCTCAGATGC CAGTAACACG
ATGGCGCATG GTCGCTATTA CTATGGTGAC CATACCAATG GGATCGCCTG GCGTTTAATC
TCTCAGGGCG AGATGTATCT TACTGACAAT ATTATTATGG CTAACGCGCT TGTCTATTCT
CATGGCGAAG ATGTTTATAG TTATGAAAGT GGCGCTCATA GTGATTTTGA CAGTATTCGC
ACCGTAATAA GACCGGCCTG GATCTGGAAT ACATGGAATC AGACGGGGCT TGAATTAGGC
TGGTTTAAGC AACAGAACAA AGCACAGCAG GGTGTAACAC TAAATGAATC GGCTTATAAA
ACGACACTCT GGCATGCATT GAAAGTGGGT GAAAGCATTT TAGGTTCACG ACCAGAAATT
CGCTTCTATG GCACGTATAT CAATATTCTG GATAACGAAT TATCTAATTT TAAGTTTAAT
GAGAACAGCA AAGACGAATT TATGGCCGGC ATCCAGGCGG AAGTCTGGTG GTAA
 
Protein sequence
MNIKTLNVSL LSFSIITALF PLNAMATKLT IEQRLELLEN ELSQNKQELR ATQNELGVYK 
FRLSTLQKSI TENKYQSASL AEISAPSPVA DNIKNENGEQ NSSAAAHTIN GSQQIAVIES
KGDKTTIESV TLKDISKYIK DDIGFSYQGY FRSGWGTGNH GSPQTYAAGS LGRFGNEMSG
WFDLTLNQRV YNQDGKTANA VVTYDGNVGE QYNDAWFGDS ANENIMQFSD IYLTTRGFLP
FAPEADFWVG KHKLPQYEIQ MLDWKTLTTD VAAGVGIENW ALGVGLFDMS LSRDDVDVYS
RDFTRTSQMN TNSVDVRYRN IPLWDDATLS LMAKYSAPNK TDQQQDNEND DSYFEMKDSW
MLTSVLRQKL QRDTFNEFTL QVANNSYASS FASFSDASNT MAHGRYYYGD HTNGIAWRLI
SQGEMYLTDN IIMANALVYS HGEDVYSYES GAHSDFDSIR TVIRPAWIWN TWNQTGLELG
WFKQQNKAQQ GVTLNESAYK TTLWHALKVG ESILGSRPEI RFYGTYINIL DNELSNFKFN
ENSKDEFMAG IQAEVWW