Gene EcSMS35_1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1607 
Symbol 
ID6143300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1597258 
End bp1598514 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content54% 
IMG OID641616485 
Productputative voltage-gated ClC-type chloride channel ClcB 
Protein accessionYP_001743663 
Protein GI170682758 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0038] Chloride channel protein EriC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGCC GTCTGCTTAT CGCAACAGTC GTCGGTATTC TTGCGGCCTT TGCCGTTGCC 
GGGTTTCGTC ATGCAATGCT GCTACTGGAG TGGTTGTTCC TCAATAATGA CTCCGGCAGT
CTGGTCAATG CAGCGACAAA TCTTTCCCCA TGGCGACGGT TGCTAACTCC GGCGCTCGGC
GGACTGGCGG CGGGTTTGTT GCTGATGGGC TGGCAGAAAT TTACCCAGCA ACGCCCTCAT
GCGCCGACCG ATTACATGGA AGCGTTGCAA ACCGATGGTC AGTTCGATTA CGCAGCAAGC
CTGGTTAAAT CGCTTGCCTC TCTGCTGGTA GTAACCAGCG GCAGTGCAAT TGGTCGCGAA
GGTGCGATGA TTCTTTTAGC TGCCCTTGCC GCCTCCTGTT TTGCCCAACG TTTTACGCCA
CGCCAGGAGT GGAAATTATG GATCGCCTGT GGAGCTGCGG CGGGAATGGC TGCGGCCTAT
CGAGCCCCGC TTGCTGGCAG TTTATTTATA GCCGAAGTGC TGTTTGGCAC TATGATGTTG
GCCTCTCTCG GCCCGGTGAT TATTTCCGCC GTCGTGGCGT TGCTGGTTAG CAATCTGATT
AATCATAGCG ACGCGTTACT CTACAGCGTA CAACTCTCAG TGACGGTTCA GGCTCGTGAC
TATGCGCTGA TTATCAGTAC AGGTGTGCTG GCAGGTCTGT GCGGACCACT GTTGTTAACG
TTAATGAACG CCTGTCATCG TGGATTTGTA AGTCTCAAAC TTGCGCCGCC CTGGCAACTG
GCACTGGGCG GGTTGATCGT GGGTCTGCTT TCCCTGTTCA CACCTGCAGT GTGGGGCAAC
GGCTATAGCA CCGTACAATC CTTTTTAACC GCCCCGCCAC TGTTAATGAT CATTGCCGGG
ATCTTCCTCT GTAAACTGTT TGCCGTGCTG GCGAGTAGTG GTTCTGGCGC GCCCGGTGGT
GTCTTTACAC CGACGCTATT TATCGGTCTT GCCATTGGCA TGTTGTATGG TCGTAGCCTG
GGATTATGGT TCCCTGATGG CGAAGAAATT ACGCTTTTAC TCGGATTGAC CGGGATGGCG
ACACTATTGG CGGCAACCAC GCACGCGCCG ATTATGTCGA CGTTGATGAT ATGTGAAATG
ACCGGGGAGT ATCAGCTACT CCCCGGTTTA TTGATTGCCT GCGTAATTGC GTCGGTAATT
TCGCGGACGT TACACCGTGA CTCTATCTAC CGCCAGCACA CTGCGAAGCA TAGCTAA
 
Protein sequence
MFRRLLIATV VGILAAFAVA GFRHAMLLLE WLFLNNDSGS LVNAATNLSP WRRLLTPALG 
GLAAGLLLMG WQKFTQQRPH APTDYMEALQ TDGQFDYAAS LVKSLASLLV VTSGSAIGRE
GAMILLAALA ASCFAQRFTP RQEWKLWIAC GAAAGMAAAY RAPLAGSLFI AEVLFGTMML
ASLGPVIISA VVALLVSNLI NHSDALLYSV QLSVTVQARD YALIISTGVL AGLCGPLLLT
LMNACHRGFV SLKLAPPWQL ALGGLIVGLL SLFTPAVWGN GYSTVQSFLT APPLLMIIAG
IFLCKLFAVL ASSGSGAPGG VFTPTLFIGL AIGMLYGRSL GLWFPDGEEI TLLLGLTGMA
TLLAATTHAP IMSTLMICEM TGEYQLLPGL LIACVIASVI SRTLHRDSIY RQHTAKHS