Gene ECH74115_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1900 
Symbol 
ID6972192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1792780 
End bp1794363 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content47% 
IMG OID643385833 
Producthypothetical protein 
Protein accessionYP_002270322 
Protein GI209399841 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000762932 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGGGA AGTTTCGCTG CATTTTGCTG TTGATAGTTG GGCTTTTTTT CTCTTCGTTA 
AGTTATGCGA AAAACACGGA GATGCCTTCT TATGAAGAAG GGATCTCGCT CTTTGATGTT
GAAGCCACTC TGCAACCGGA TGGGGTGCTC GACATCAAAG AAAATATTCA TTTTCAGGCG
CGAAATCAGC AGATTAAGCA CGGATTTTAT CGTGATTTAC CACGACTATG GATGCAGCCT
GATGGGGACG CTGCACTGCT GAACTATCAT ATTGTTGGCG TCACCCGTGA TGGTATTCCT
GAACCCTGGC ATCTTGACTG GCATATCGGG TTAATGAGTA TTGTCGTGGG CGATAAACAA
CGTTTCTTGC CTCAAGGCGA CTATCATTAT CAAATTCATT ATCAGGTTAA AAATGCTTTC
CTGCGTGAGG GAGATTCAGA TCTGTTAATC TGGAACGTGA CTGGTAACCA CTGGCCGTTT
GAAATCTATA AGACCCGATT TTCACTCAAG TTCCCTGATA TCGCGGGTAA TCCATTTAGC
GAAATCGATC TCTTTACTGG AGAAGAGGGC GACACATATC GAAATGGCCG CATCCTTGAG
GACGGAAGAA TTGAATCCAG CGATCCGTTT TATCGTGAAG ATTTCACGGT ACTCTACCGT
TGGCCTCACT CGTTACTTAG CAATGCCCCG GCTCCACAAA CGACGAATAT TTTCAGCCAT
CTTCTTTTAC CCTCCACGTC ATCGTTGTTA ATTTGGTTTC CGTGTCTATT CCTGGTTTGT
GGATGGTTAT ATCTCTGGAA GCGCAGGCCG CAATTTACGT CGGTAGATGT TTTACGGATG
CAGAAATTTT ATCTGCCGCG TAAAAAGTCT TCGTTTTACC GGCCTGATAC TTTTTTGCAA
TGGGGTGGAC TGGCAATATT GGCGGTCATT CTTTACGGTA ACCTGAGTCC TGTAGGCTGG
GCAGGAATGA GTCTGGTGGG CGATATGTTT ATTATGATCT GCTGGCTTCT TCCTTTTTTA
TTTTGTTCCC TTGAGCTTTT GTTTGCCCGC GATGATGACA AGCCTTGCGT TAATCGTGTA
ATCATCACTT TGTTTTTACC GCTGATTTGT TCAGGCGTGG CCTTTTATTC TCTCTATATC
AATGTCGGAG ATGTATTCTT TTACTGGTAT ATGCCAGCGG GTTATTTTAG CGCTGTTTTC
CTGACCGGTT ATCTCACTGG CATGGGGTAT ATTTTTCTGC CAAAGTTTAC CCAAACTGGG
CAGCAACGTT ATGCCCACGG TGAAGCTATC GTTAACTATC TCGCGCGTAA AGAGGCAGCA
ACACACAGTG GGCGTCGGCG GAAAGGGGAA ACACGGAAAC TGGATTACGC GTTGCTAGGT
TGGGCTGTCT CAGCAAACCT TGGAAGAGAA TGGGTAGCAC GTATCACCCC ATCACTCACA
GCGGCTGTTC GCGCCCCGGA AATTGCCCGT AGTGGCGTTT TGTTCTCATT ACAGATGCAC
CTGAGTCTGG GGGCCAATAC CAGTTTATTG GGGCGAAGTT ATTCCGGTGG TGCTGCGGGT
GGCGGAGGCG GTGGTGGCTG GTAA
 
Protein sequence
MAGKFRCILL LIVGLFFSSL SYAKNTEMPS YEEGISLFDV EATLQPDGVL DIKENIHFQA 
RNQQIKHGFY RDLPRLWMQP DGDAALLNYH IVGVTRDGIP EPWHLDWHIG LMSIVVGDKQ
RFLPQGDYHY QIHYQVKNAF LREGDSDLLI WNVTGNHWPF EIYKTRFSLK FPDIAGNPFS
EIDLFTGEEG DTYRNGRILE DGRIESSDPF YREDFTVLYR WPHSLLSNAP APQTTNIFSH
LLLPSTSSLL IWFPCLFLVC GWLYLWKRRP QFTSVDVLRM QKFYLPRKKS SFYRPDTFLQ
WGGLAILAVI LYGNLSPVGW AGMSLVGDMF IMICWLLPFL FCSLELLFAR DDDKPCVNRV
IITLFLPLIC SGVAFYSLYI NVGDVFFYWY MPAGYFSAVF LTGYLTGMGY IFLPKFTQTG
QQRYAHGEAI VNYLARKEAA THSGRRRKGE TRKLDYALLG WAVSANLGRE WVARITPSLT
AAVRAPEIAR SGVLFSLQMH LSLGANTSLL GRSYSGGAAG GGGGGGW