Gene ECH74115_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1844 
Symbol 
ID6966856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1749753 
End bp1750802 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content54% 
IMG OID643385780 
Producthypothetical protein 
Protein accessionYP_002270270 
Protein GI209399600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000141733 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0000000024539 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGGTAT TACTTCGACC TGTTCTGGTA CCGGAACTCG GTCTGGTTAT CGTTAAGCCA 
GGCCGTGAAT CAATGTCAGC ATTCCATAAC GGCAGAATAC TGGTGGAGCC GGAACCAAAA
AGCATGCGAG CTCTGCCGTC CGGGGTTGTA CCTGCCGTTC ACCAGCCGCT GGCGGAAGAT
AAATCACTAC TGCCATTTTT CAGCGATGAG CGGGTGATCC GTGCTGCGGG TGGCGCTGGT
GCACTGTCTG ACTGGTTATT ACGTCACGTG AAATCCTGCC AGTGGCTACA CGGTGATTAT
CATCACAGCG AAACCGTCAT TCACCGTTAC GGTACCGGCG CGATGGTGTT GTGCTGGCAC
TGCGACAACC AGCTGCGGGA GCAGACATCT GATTCACTGG ATCAACTTGC TCAACAGAAT
CTGGCCGCCT GGATGATTGA CATCATCCGT CACGCAATGA ATGGCGCACA GGAGCGTGAA
TTATCTCTGG CTGAATTATC CTGGTGGGCG GCCTGCAATC AGGTGGTGGA TGCACTACCT
GAGGCAGTAG CGCGTCGTTC TCTGGGATTA CCGGCGGAAA AAATCCGCTC CGTATACCGT
GAAAGCGACA TCATACCGGG AGAACAGACC GCCACCAGCA TACTGAAGCA GCGCACAAAA
AATATTGCGC TACCGCCTCA CACCCACCAG CAACAGAACC CACCACAGGA AAAGACGGTG
GTCAGCATTG CCGTTGATCC GGAGTCTCCG GAATCCTTCA TGAAACGACC TAAACGTCGC
CGCTGGGTAA ATGAGAAATA CACACGCTGG GTAAAGACAC AGCCGTGTGC GTGTTGTGGT
AAGCCAGCCG ACGATCCGCA TCACCTGATT GGTCATGGTC AGGGCGGAAT GGGGACAAAA
TCTCACGATA TTTTCACGCT ACCGCTGTGT CGGGAGCATC ACAACGAGCT TCATGCGGAT
CCGCTGGCGT TCGAAGAAAA GCATGGTTCT CAGGTTGATT TAATTTTTCG TTTTCTTGAT
CACGCCTTTG CAACCGGCGT GCTCGGGTAA
 
Protein sequence
MRVLLRPVLV PELGLVIVKP GRESMSAFHN GRILVEPEPK SMRALPSGVV PAVHQPLAED 
KSLLPFFSDE RVIRAAGGAG ALSDWLLRHV KSCQWLHGDY HHSETVIHRY GTGAMVLCWH
CDNQLREQTS DSLDQLAQQN LAAWMIDIIR HAMNGAQERE LSLAELSWWA ACNQVVDALP
EAVARRSLGL PAEKIRSVYR ESDIIPGEQT ATSILKQRTK NIALPPHTHQ QQNPPQEKTV
VSIAVDPESP ESFMKRPKRR RWVNEKYTRW VKTQPCACCG KPADDPHHLI GHGQGGMGTK
SHDIFTLPLC REHHNELHAD PLAFEEKHGS QVDLIFRFLD HAFATGVLG