Gene EcHS_A3934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3934 
SymbolbglH 
ID5592146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3927784 
End bp3929400 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content45% 
IMG OID640923041 
Productglucoside specific outer membrane porin BglH 
Protein accessionYP_001460518 
Protein GI157163200 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4580] Maltoporin (phage lambda and maltose receptor) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value0.551157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAGAC GAAATCTTAT TACCTCTGCC ATCTTATTAA TGGCACCGTT AGCCTTTTCT 
GCACAATCAT TGGCTGAATC ATTAACGGTG GAACAACGCC TTGAGTTATT AGAAAAGGCG
TTAAGAGAAA CGCAAAGCGA ACTCAAAAAG TATAAAGATG AAGAGAAGAA AAAGTATACG
CCAGCGACGG TGAATCGTAG CGTAAGTACG AATGATCAAG GGTATGCCGC CAATCCGTTC
CCGACCAGTA GTGCCGCAAA ACCTGATGCT GTACTGGTCA AAAATGAAGA GAAAAATGCC
AGTGAGACAG GCTCGATTTA TTCTTCCATG ACTCTGAAAG ATTTCAGTAA ATTTGTGAAA
GATGAAATTG GCTTTAGTTA CAACGGCTAT TACCGTTCTG GTTGGGGGAC CGCCTCTCAT
GGTTCACCTA AATCATGGGC GATTGGTTCT CTGGGCCGCT TTGGTAACGA ATACTCCGGC
TGGTTTGATT TGCAGTTAAA ACAACGTGTC TACAACGAAA ACGGCAAACG GGTTGATGCC
GTTGTGATGA TGGATGGTAA CGTTGGTCAG CAGTACTCTA CCGGCTGGTT TGGCGATAAC
GCCGGTGGCG AGAACTATAT GCAGTTCTCC GATATGTACG TTACCACCAA AGGTTTCCTG
CCCTTTGCGC CAGAGGCTGA TTTCTGGGTG GGTAAACACG GTGCGCCGAA AATTGAAATC
CAGATGCTTG ACTGGAAAAC GCAGCGTACT GATGCCGCAG CGGGTGTAGG TCTGGAAAAC
TGGAAAGTCG GTCCGGGTAA AATTGATATC GCGCTGGTTC GCGAAGATGT CGATGATTAC
GATCGCAGCC TGCAAAACAA ACAGCAGATT AATACCAATA CCATTGATTT ACGCTATAAA
GATATCCCGT TATGGGATAA AGCCACCTTA ATGGTAAGTG GTCGTTATGT CACGGCAAAC
GAAAGCGCAT CGGAAAAAGA TAATCAGGAT AATAACGGGT ATTATGACTG GAAAGATACC
TGGATGTTTG GCACATCTTT AACGCAGAAA TTTGATAAAG GTGGCTTCAA CGAATTCTCC
TTCCTGGTCG CGAATAATTC TATCGCCAGT AACTTTGGCC GTTATGCTGG CGCAAGTCCA
TTTACCACCT TTAATGGTCG TTATTATGGT GATCACACCG GCGGAACAGC GGTACGTCTG
ACTTCGCAGG GCGAAGCCTA TATTGGCGAT CATTTCATTG TAGCTAACGC GATTGTTTAC
TCCTTCGGTA ACGATATATA TAGCTACGAA ACAGGCGCCC ACTCTGATTT CGAATCTATT
CGTGCGGTTG TTCGCCCGGC CTATATTTGG GACCAATATA ACCAGACAGG TGTTGAACTG
GGCTATTTCA CCCAGCAAAA CAAAGATGCG AATAGTAATA AATTTAATGA GTCTGGTTAT
AAAACCACGC TCTTCCATAC CTTTAAAGTC AATACCAGTA TGTTGACCTC GCGTCCGGAA
ATTCGTTTCT ACGCCACGTA TATCAAAGCC CTGGAAAACG AACTGGATGG CTTCACCTTT
GAAGACAATA AAGACGACCA GTTTGCTGTC GGTGCCCAGG CTGAAATCTG GTGGTAA
 
Protein sequence
MFRRNLITSA ILLMAPLAFS AQSLAESLTV EQRLELLEKA LRETQSELKK YKDEEKKKYT 
PATVNRSVST NDQGYAANPF PTSSAAKPDA VLVKNEEKNA SETGSIYSSM TLKDFSKFVK
DEIGFSYNGY YRSGWGTASH GSPKSWAIGS LGRFGNEYSG WFDLQLKQRV YNENGKRVDA
VVMMDGNVGQ QYSTGWFGDN AGGENYMQFS DMYVTTKGFL PFAPEADFWV GKHGAPKIEI
QMLDWKTQRT DAAAGVGLEN WKVGPGKIDI ALVREDVDDY DRSLQNKQQI NTNTIDLRYK
DIPLWDKATL MVSGRYVTAN ESASEKDNQD NNGYYDWKDT WMFGTSLTQK FDKGGFNEFS
FLVANNSIAS NFGRYAGASP FTTFNGRYYG DHTGGTAVRL TSQGEAYIGD HFIVANAIVY
SFGNDIYSYE TGAHSDFESI RAVVRPAYIW DQYNQTGVEL GYFTQQNKDA NSNKFNESGY
KTTLFHTFKV NTSMLTSRPE IRFYATYIKA LENELDGFTF EDNKDDQFAV GAQAEIWW