Gene ECH74115_3621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3621 
Symbol 
ID6968730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3341880 
End bp3343136 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content54% 
IMG OID643387416 
Producthypothetical protein 
Protein accessionYP_002271875 
Protein GI209397098 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0038] Chloride channel protein EriC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.504533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCATC CGCGAGCCAG AACCATGTTG TTATTATCGC TCCCCGCCGT GGCAATTGGG 
ATTGCGTCCA GTCTTATTCT GATTGTGGTG ATGAAAATCG CCTCGGTATT ACAGAATTTG
CTCTGGCAAC GACTGCCGGG AACTCTGGGG ATAGCCCAGG ATTCACCCCT CTGGATCATC
GGTGTATTAA CGCTAACGGG TATTGCGGTG GGGTTGGTTA TCCGTTTCAG CCAGGGTCAT
GCCGGACCAG ACCCCGCCTG TGAACCGCTG ATCGGCGCAC CGGTTCCGCC CTCTGCGCTA
CCAGGACTTA TCGTAGCATT AATTCTCGGT CTTGCTGGCG GCGTCAGCCT GGGGCCGGAA
CATCCGATCA TGACCGTCAA TATCGCCCTT GCGGTTGCCA TTGGCGCTCG TCTGTTACCG
CGCGTCAACC GAATGGAGTG GACTATTTTA GCCTCTGCCG GAACCATCGG CGCGCTGTTT
GGTACACCTG TTGCGGCGGC GTTGATATTT TCGCAAACCT TAAATGGCAG TAGTGAAGTT
CCGCTATGGG ATCGTCTTTT TGCGCCGTTA ATGGCGGCAG CAGCTGGTGC ACTTACTACC
GGATTATTTT TCCATCCCCA TTTTTCACTG CCCATTGCGC ATTACGGACA GATGGAAATG
ACCGATATTC TCAGCGGTGC AATTGTCGCG GCGATTGCCA TCGCAGCAGG GATGGTCGCC
GTATGGTGCT TACCACGGTT GCACGCGATG ATGCATCAAA TGAAAAATCC GGTGCTCGTG
CTGGGTATTG GCGGATTTAT TCTCGGTATT CTGGGGGTTA TTGGTGGACC AGTTTCGCTG
TTTAAAGGGC TGGATGAGAT GCAGCAGATG GTGGCAAATC AGGCTTTCAG CACCAGCGAT
TACTTTTTGC TGGCGGTAAT TAAACTTGCC GCCCTGGTCG TTGCTGCCGC CAGTGGCTTT
CGCGGTGGGC GAATCTTCCC GGCAGTGTTT GTCGGCGTGG CATTAGGGTT GATGCTGCAT
GAGCACGTTC CCGCCGTACC AGCGGCAATA ACCGTTTCCT GCGCTATTCT CGGCATCGTG
CTGGTGGTAA CACGCGATGG CTGGTTAAGT CTTTTTATGG CGGCAGTCGT TGTACCCAAT
ACCACATTGC TACCGCTGCT CTGTATCGTC ATGCTTCCGG CATGGCTGTT ATTAGCAGGT
AAGCCGATGA TGATGGTCAA TCGTCCGAAG CAACAGCCAC CCCACGATAA CGTTTAG
 
Protein sequence
MLHPRARTML LLSLPAVAIG IASSLILIVV MKIASVLQNL LWQRLPGTLG IAQDSPLWII 
GVLTLTGIAV GLVIRFSQGH AGPDPACEPL IGAPVPPSAL PGLIVALILG LAGGVSLGPE
HPIMTVNIAL AVAIGARLLP RVNRMEWTIL ASAGTIGALF GTPVAAALIF SQTLNGSSEV
PLWDRLFAPL MAAAAGALTT GLFFHPHFSL PIAHYGQMEM TDILSGAIVA AIAIAAGMVA
VWCLPRLHAM MHQMKNPVLV LGIGGFILGI LGVIGGPVSL FKGLDEMQQM VANQAFSTSD
YFLLAVIKLA ALVVAAASGF RGGRIFPAVF VGVALGLMLH EHVPAVPAAI TVSCAILGIV
LVVTRDGWLS LFMAAVVVPN TTLLPLLCIV MLPAWLLLAG KPMMMVNRPK QQPPHDNV