Gene ECH74115_2268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2268 
Symbol 
ID6971247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2146985 
End bp2148034 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content54% 
IMG OID643386150 
Producthypothetical protein 
Protein accessionYP_002270634 
Protein GI209400833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.582934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.000000000247351 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGGTAT TACTTCGACC TGTTCTGGTA CCGGAACTCG GTCTGGTTAT CGTTAAGCCA 
GGCCGTGAAT CAATGTCAGC ATTCCATAAC GGCAGAATAC TGGTGGAGCC GGAACCAAAA
AGCATGCGAG CTCTGCCGTC CGGGGTTGTA CCTGCCGTTC ACCAGCCGCT GGCGGAAGAT
AAATCACTAC TGCCATTTTT CAGCGATGAG CGGGTGATCC GTGCTGCGGG TGGCGCTGGT
GCACTGTCTG ACTGGTTATT ACGTCACGTG AAATCCTGCC AGTGGCTACA CGGTGATTAT
CATCACAGCG AAACCGTCAT TCACCGTTAC GGTACCGGCG CGATGGTGTT GTGCTGGCAC
TGCGACAACC AGCTGCGGGA GCAGACATCT GATTCACTGG ATCAACTTGC TCAACAGAAT
CTGGCCGCCT GGATGATTGA CATCATCCGT CACGCAATGA ATGGCGCACA GGAGCGTGAA
TTATCTCTGG CTGAATTATC CTGGTGGGCG GTCCGCAATC AGGTGGCGGA CGCGCTACCG
GAAGCGGTAT TACGTCGTTC GCTGGGGTTG CGTGCGGAAA AAATCCGCTC CGTATACCGT
GAAAGCGACA TCATACCGGG AGAACAGACC GCCACCAGCA TACTGAAGCA GCGCACAAAA
AATATTGCGC TACCGCCTCA CACCCACCAG CAACAGAACC CACCACAGGA AAAGACGGTG
GTCAGCATTG CCGTTGATCC GGAGTCTCCG GAATCCTTCA TGAAACGACC TAAACGTCGC
CGCTGGGTAA ATGAGAAATA CACACGCTGG GTAAAGACAC AGCCGTGTGC GTGTTGTGGT
AAGCCAGCGG ACGATCCTCA TCATCTGATT GGTCATGGTC AGGGCGGAAT GGGAACAAAA
TCCCACGATA TTTTCACGCT ACCGCTGTGT CGGGAGCATC ACAACGAGCT TCATGCGGAT
CCGCTGGCGT TCGAAGAAAA GCATGGTTCC CAGGTTGATT TAATTTTTCG TTTTCTTGAT
CACGCCTTTG CAACCGGCGT GCTCGGGTAA
 
Protein sequence
MRVLLRPVLV PELGLVIVKP GRESMSAFHN GRILVEPEPK SMRALPSGVV PAVHQPLAED 
KSLLPFFSDE RVIRAAGGAG ALSDWLLRHV KSCQWLHGDY HHSETVIHRY GTGAMVLCWH
CDNQLREQTS DSLDQLAQQN LAAWMIDIIR HAMNGAQERE LSLAELSWWA VRNQVADALP
EAVLRRSLGL RAEKIRSVYR ESDIIPGEQT ATSILKQRTK NIALPPHTHQ QQNPPQEKTV
VSIAVDPESP ESFMKRPKRR RWVNEKYTRW VKTQPCACCG KPADDPHHLI GHGQGGMGTK
SHDIFTLPLC REHHNELHAD PLAFEEKHGS QVDLIFRFLD HAFATGVLG