Gene ECH74115_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2303 
Symbol 
ID6967335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2171411 
End bp2172667 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content54% 
IMG OID643386180 
Productputative voltage-gated ClC-type chloride channel ClcB 
Protein accessionYP_002270664 
Protein GI209397921 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0038] Chloride channel protein EriC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0264865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGCC GTCTGCTTAT CGCAACAGTC GTCGGTATTC TCGCGGCCTT TGCCGTTGCC 
GGGTTTCGTC ATGCGATGCT GCTACTGGAG TGGTTGTTCC TCAATAATGA CTCCGGCAGT
CTGGTCAATG CAGCGACAAA CCTTTCCCCC TGGCGACGGC TGCTAACTCC GGCGCTCGGC
GGACTGGCGG CGGGTTTGTT GTTGATGGGC TGGCAGAAAT TTACCCAGCA ACGCCCTCAT
GCGCCGACCG ATTACATGGA AGCGTTGCAA ACCGATGGTC AGTTTGATTA CGCAGCAAGC
CTGGTTAAAT CGCTTGCCTC TCTGCTGGTA GTAACCAGCG GCAGTGCAAT TGGTCGCGAA
GGTGCGATGA TTCTTTTAGC TGCCCTTGCC GCCTCCTGTT TTGCCCAACG TTTTACGCCA
CGCCAGGAGT GGAAATTATG GATCGCCTGT GGGGCCGCGG CGGGAATGGC TGCGGCCTAT
CGTGCCCCGC TTGCTGGCAG TTTATTTATA GCCGAAGTGC TGTTTGGCAC TATGATGTTG
GCCTCTCTCG GCCCGGTGAT TATTTCCGCC GTCGTGGCAT TGCTGGTTAG CAATCTGATT
AATCATAGCG ACGCGTTACT CTACAGCGTA CAACTCTCAG TGACGGTTCA GGCTCGTGAC
TATGCGCTGA TTATCAGTAC AGGTGTGCTG GCAGGTCTGT GCGGACCACT GTTGTTAACG
TTAATGAACG CCTGTCATCG TGGATTTGTG AGTCTCAAAC TTGCGCCGCC CTGGCAACTG
GCACTAGGCG GGTTGATCGT GGGTCTGCTT TCCCTGTTCA CACCTGCAGT GTGGGGCAAC
GGCTATAGCA CCGTACAATC CTTTTTAACC GCCCCGCCAC TGTTAATGAT CATTGCCGGG
ATCTTCCTTT GTAAACTGTG TGCCGTGCTG GCGAGTAGCG GTTCCGGCGC ACCCGGTGGT
GTCTTTACAC CGACGCTATT TATCGGTCTT GCCATTGGCA TGTTGTATGG TCGTAGTCTG
GGATTATGGT TCCCTGATGG CGAAGAAATT ACGCTTTTAC TCGGATTGAC CGGGATGGCG
ACACTGTTGG CGGCAACCAC GCACGCGCCG ATTATGTCGA CGTTGATGAT ATGTGAAATG
ACCGGGGAGT ATCAGCTACT CCCCGGTTTA TTGATTGCCT GCGTAATTGC GTCGGTAATC
TCGCGGACGT TACACCGTGA CTCTATCTAC CGCCAGCACA CTGCGCAGCA TAGCTAA
 
Protein sequence
MFRRLLIATV VGILAAFAVA GFRHAMLLLE WLFLNNDSGS LVNAATNLSP WRRLLTPALG 
GLAAGLLLMG WQKFTQQRPH APTDYMEALQ TDGQFDYAAS LVKSLASLLV VTSGSAIGRE
GAMILLAALA ASCFAQRFTP RQEWKLWIAC GAAAGMAAAY RAPLAGSLFI AEVLFGTMML
ASLGPVIISA VVALLVSNLI NHSDALLYSV QLSVTVQARD YALIISTGVL AGLCGPLLLT
LMNACHRGFV SLKLAPPWQL ALGGLIVGLL SLFTPAVWGN GYSTVQSFLT APPLLMIIAG
IFLCKLCAVL ASSGSGAPGG VFTPTLFIGL AIGMLYGRSL GLWFPDGEEI TLLLGLTGMA
TLLAATTHAP IMSTLMICEM TGEYQLLPGL LIACVIASVI SRTLHRDSIY RQHTAQHS