Gene ECH74115_3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3155 
Symbol 
ID6969352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2921597 
End bp2922646 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content54% 
IMG OID643386978 
Producthypothetical protein 
Protein accessionYP_002271445 
Protein GI209397571 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000300414 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000000000395416 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGGTAT TACTTCGACC TGTTCTGGTA CCGGAACTCG GTCTGGTTAT CGTTAAGCCA 
GGCCGTGAAT CAATGTCAGC ATTCCATAAC GGCAGAATAC TGGTGGAGCC GGAACCAAAA
AGCATGCGAG CTCTGCCGTC CGGGGTTGTA CCTGCCGTTC ACCAGCCGCT GGCGGAAGAT
AAATCACTAC TGCCATTTTT CAGCGATGAG CGGGTGATCC GTGCTGCGGG TGGCGCTGGT
GCACTGTCTG ACTGGTTATT ACGTCACGTG AAATCCTGCC AGTGGCTACA CGGTGATTAT
CATCACAGCG AAACCGTCAT TCACCGTTAC GGTACCGGCG CGATGGTGTT GTGCTGGCAC
TGCGACAACC AGCTGCGGGA GCAGACATCT GATTCACTGG ATCAACTTGC TCAACAGAAT
CTGGCCGCCT GGATGATTGA CATCATCCGT CACGCAATGA ATGGCGCACA GGAGCGTGAA
TTATCTCTGG CTGAATTATC CTGGTGGGCG GCCTGCAATC AGGTGGTGGA TGCACTACCT
GAGGCAGTAG CGCGTCGTTC TCTGGGATTA CCGGCGGAAA AAATCCGCTC CGTATACCGT
GAAAGCGACA TCATACCGGG AGAACAGACC GCCACCAGCA TACTGAAGCA GCGCACAAAA
AATATTGCGC TACCGCCTCA CACCCACCAG CAACAGAACC CACCACAGGA AAAGACGGTG
GTCAGCATTG CCGTTGATCC GGAGTCTCCG GAATCCTTCA TGAAACGACC TAAACGTCGC
CGCTGGGTAA ATGAGAAATA CACACGCTGG GTAAAGACAC AGCCGTGTGC GTGTTGTGGT
AAGCCAGCCG ACGATCCGCA TCACCTGATT GGTCATGGTC AGGGCGGAAT GGGGACAAAA
TCTCACGATA TTTTCACGCT ACCGCTGTGT CGGGAGCATC ACAACGAGCT TCATGCGGAT
CCGCTGGCGT TCGAAGAAAA GCATGGTTCT CAGGTTGATT TAATTTTTCG TTTTCTTGAT
CACGCCTTTG CAACCGGCGT GCTCGGGTAA
 
Protein sequence
MRVLLRPVLV PELGLVIVKP GRESMSAFHN GRILVEPEPK SMRALPSGVV PAVHQPLAED 
KSLLPFFSDE RVIRAAGGAG ALSDWLLRHV KSCQWLHGDY HHSETVIHRY GTGAMVLCWH
CDNQLREQTS DSLDQLAQQN LAAWMIDIIR HAMNGAQERE LSLAELSWWA ACNQVVDALP
EAVARRSLGL PAEKIRSVYR ESDIIPGEQT ATSILKQRTK NIALPPHTHQ QQNPPQEKTV
VSIAVDPESP ESFMKRPKRR RWVNEKYTRW VKTQPCACCG KPADDPHHLI GHGQGGMGTK
SHDIFTLPLC REHHNELHAD PLAFEEKHGS QVDLIFRFLD HAFATGVLG