Gene ECH74115_4902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4902 
Symbol 
ID6967675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4541721 
End bp4543280 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content49% 
IMG OID643388588 
Producthypothetical protein 
Protein accessionYP_002273016 
Protein GI209397670 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCCTG TATTCTCTAT CGGTATCTCA TCATTATGGG ATGAGCTGCG ACATATGCCA 
GCAGGCGGCG TCTGGTGGTT TAACGTCGAT CGCCATGAAG ATGCTATCAG TCTGGCGAAT
CAAACAATTG CATCCCAGGC TGAAACCGCA CACGTCGCGG TCATTAGCAT GGACAGCGAT
CCGGCGAAAA TCTTTCAATT AGATGATTCT CAAGGGCCGG AAAAAATAAA ATTATTTTCA
ATGCTAAATC ATGAAAAAGG TCTATACTAT TTGGCCCGTG ATTTGCAGTG TTCTATTGAT
CCCCATAATT ACCTTTTTAT TCTTGTTTGC GCAAATAACG CATGGCAAAA CATTCCTGCC
GAGCGGCTGC GCTCATGGTT GGATAAAATG AATAAATGGA GCAGGTTAAA CCATTGTTCG
CTTTTGGTAA TTAATCCCGG AAATAATAAC GATAAACAAT TTTCATTGTT GCTTGAGGAA
TACCGTTCAC TTTTTGGTCT TGCCAGTTTG CGTTTTCAGG GCGACCAACA TTTGCTGGAT
ATTGCCTTCT GGTGCAACGA AAAAGGGGTC AGCGCCCGTC AGCAGCTTAG CGTTCAGCAA
CAAAATGGTA TCTGGACATT AGTTCAAAGC GAAGAGGCGG AGATCCAACC ACGCAGCGAC
GAAAAACGCA TTCTGAGTAA TGTTGCTGTA CTGGAAGGTG CGCCGCCGCT ATCGGAACAC
TGGCAACTGT TCAACAATAA CGAAGTCCTG TTCAATGAAG CCCGTACCGC TCAGGCGGCG
ACGGTGGTCT TTTCTTTACA GCAAAATGCG CAAATCGAGC CACTGGCCCG CAGCATTCAT
ACCCTGCGTC GCCAGCGCGG TAGTGCGATG AAAATCCTCG TGCGGGAAAA TACCGCTAGC
CTGCGCGCCA CCGATGAACG TTTGTTATTG GCCTGCGGTG CAAATATGGT TATTCCGTGG
AATGCGCCAC TCTCCCGTTG TCTGACGATG ATCGAAAGCG TGCAAGGGCA GAAGTTTAGT
CGCTATGTGC CGGAAGATAT CACTACCTTG CTGTCAATGA CCCAGCCGCT CAAACTGCGT
GGTTTCCAGA AGTGGGATGT GTTCTGTAAT GCCGTCAACA ACATGATGAA TAACCCTCTA
TTACCTGCCC ACGGTAAAGG CGTTCTGGTT GCCCTACGTC CGGTACCGGG TATCCGCGTT
GAACAAGCCC TGACGCTGTG TCGCCCTAAC CGTACCGGCG ATATCATGAC CATTGGCGGT
AATCGGCTGG TGCTGTTTCT CTCATTCTGT CGGATTAACG ATCTGGATAC CGCGTTGAAT
CATATTTTCC CATTGCCTAC TGGCGACATT TTCTCAAACC GTATGGTCTG GTTTGAAGAT
GATCAAATCA GTGCCGAGCT GGTGCAGATG CGCCTGCTTG CCCCAGAACA ATGGGGCATG
CCGCTGCCTT TAACGCAAAG TTCTAAACCG GTCATCAATG CCGAGCACGA TGGTCGCCAC
TGGCGACGAA TACCAGAACC CATGCGACTG TTAGATGATG CTGTGGAGCG CTCATCATGA
 
Protein sequence
MDPVFSIGIS SLWDELRHMP AGGVWWFNVD RHEDAISLAN QTIASQAETA HVAVISMDSD 
PAKIFQLDDS QGPEKIKLFS MLNHEKGLYY LARDLQCSID PHNYLFILVC ANNAWQNIPA
ERLRSWLDKM NKWSRLNHCS LLVINPGNNN DKQFSLLLEE YRSLFGLASL RFQGDQHLLD
IAFWCNEKGV SARQQLSVQQ QNGIWTLVQS EEAEIQPRSD EKRILSNVAV LEGAPPLSEH
WQLFNNNEVL FNEARTAQAA TVVFSLQQNA QIEPLARSIH TLRRQRGSAM KILVRENTAS
LRATDERLLL ACGANMVIPW NAPLSRCLTM IESVQGQKFS RYVPEDITTL LSMTQPLKLR
GFQKWDVFCN AVNNMMNNPL LPAHGKGVLV ALRPVPGIRV EQALTLCRPN RTGDIMTIGG
NRLVLFLSFC RINDLDTALN HIFPLPTGDI FSNRMVWFED DQISAELVQM RLLAPEQWGM
PLPLTQSSKP VINAEHDGRH WRRIPEPMRL LDDAVERSS