Gene ECH74115_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2199 
Symbol 
ID6968130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2100044 
End bp2101093 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content54% 
IMG OID643386090 
Producthypothetical protein 
Protein accessionYP_002270577 
Protein GI209397705 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value6.39622e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value1.28788e-10 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGGTAT TACTTCGACC TGTTCTAGTT CCGGAACTCG GGCTGGTGGT CCTTAAGCCC 
GGTCGTGAAT CATTGCCAGT TTTTCATCGC GGCAGGGTGC TGGTGGAGCC GGAACCGAAA
AACATGCGGG CGCTGCCATC TGGAGCGGTT CCTGCTGTTC GCCAGCCGCT GGCGGAAGAT
AAATCACTGC TGCCATTTTT CAGCGATGAG CGGGTGATTC GTGCAGCTGG CGGCGCTGGT
GCACTGTCTG ACTGGTTATT ACGTCACGTG AAATCCTGCC AGTGGCCACA CGGCGATTAT
CATCACAGCG AAACCGTTAT TCACAGTTAC GGTGCTGGCG CAATGGTGTT GTGCTGGCAC
TGCGACAACC AGCTGCGCGA CCAGACCTCC GAATCACTTG AGCAACTTAC TCAACAAAAT
CTGACAGCCT GGATGATTGA CGTCATACGC CATGTAATGA ATGGCACGCA GGAGCGGGAA
TTATCGCTGG CTGAATTATC CTGGTGGGCA GTCTGCAATC AGGTGGTGGA CGCATTACCT
GAGGCAGTAT CGCGTCGCTC TCTGGGATTA CCGGCGGAAA AAATCCGCTC CGTATACCGT
GAAAGCGACA TCATACCGGG AGAACAGACC GCCACCAGCA TACTGAAGCA GCGCACAAAA
AATATTGCGC TACCGCCTCA CACCCACCAG CAACAGAACC CACCACAGGA AAAGACGGTG
GTCAGCATTG CCGTTGATCC GGAGTCTCCG GAATCCTTCA TGAAACGACC TAAACGTCGC
CGCTGGGTAA ATGAGAAATA CACACGCTGG GTAAAGACAC AGCCGTGTGC GTGTTGTGGT
AAGCCAGCGG ACGATCCTCA TCATCTGATT GGTCATGGTC AGGGCGGAAT GGGAACAAAA
TCCCACGATA TTTTCACGCT ACCGCTGTGT CGGGAGCATC ACAACGAGCT TCATGCGGAT
CCGCTGGCGT TCGAAGAAAA GCATGGTTCC CAGGTTGATT TAATTTTTCG TTTTCTTGAT
CACGCCTTTG CAACCGGCGT GCTCGGGTAA
 
Protein sequence
MRVLLRPVLV PELGLVVLKP GRESLPVFHR GRVLVEPEPK NMRALPSGAV PAVRQPLAED 
KSLLPFFSDE RVIRAAGGAG ALSDWLLRHV KSCQWPHGDY HHSETVIHSY GAGAMVLCWH
CDNQLRDQTS ESLEQLTQQN LTAWMIDVIR HVMNGTQERE LSLAELSWWA VCNQVVDALP
EAVSRRSLGL PAEKIRSVYR ESDIIPGEQT ATSILKQRTK NIALPPHTHQ QQNPPQEKTV
VSIAVDPESP ESFMKRPKRR RWVNEKYTRW VKTQPCACCG KPADDPHHLI GHGQGGMGTK
SHDIFTLPLC REHHNELHAD PLAFEEKHGS QVDLIFRFLD HAFATGVLG