Gene ECH74115_1924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1924 
Symbol 
ID6966873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1816694 
End bp1817821 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content51% 
IMG OID643385855 
Producthypothetical protein 
Protein accessionYP_002270344 
Protein GI209400889 
COG category[S] Function unknown 
COG ID[COG4950] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0400808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00000000100313 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACAAC GCCACATCAC CGGCAAAAGC CACTGGTATC ATGAAACGCA ATCCAGTACT 
ACGGAGTATG ACGTTCTGCC TCTGGTCCCG GAAGCCGCAA AGGTCAGCGA TCCCTTTCTA
CTCGACGTGA TCCTTGAAAA AGAAACGCTG GCCCCCTTCC TTTCATGGCT GGACCCTGCG
CGTGTTCTTG CAGTGGAGTT GTTCCCTGAC CAGCTTACCG TGACCCGTTC ACAGACCTTC
ACCGCTTATG AACGCTTGTC GACGGCCCTG ACGGTTGCTC AGGTTTGCGG CGTCCAGCGG
TTATGTAACT ACTATTCGGC GCGACTTACG CCGCTCCCCG GGCCTGATTC CACCAGGGAA
AGTAATCATC GGTTGGCACA AATCACGCAA TATGCCCGCC AACTGGCTAG CTCGCCTTCT
ATTATCGACA ACCGATCGCG CCAGCATCTG AATGACGTCG GTCTTACTGC CTGGGACTGT
GTGATCATTA ACCAAATCAT TGGTTTTATT GGCTTTCAGG CGCGGACAAT TGCGACATTT
CAGGCTTATC TCGGGCATCC GGTACGCTGG TTACCCGGGC TGGAGATACA AAACTACGCC
GACGCGTCAC TGTTTGCTGA TGAATCATTA CGCTGGCGAA GCAGCTATGA GGTGGAAAAA
CTACCTGAAG AACACACAAA AAGTTCAACT GCAGAACTTT GCCAACTGGC CGAAATACTC
TCTCTCCACC CTATTTCACT TTCCCTTCTC GAAAGGTTGT TAAACAGCAC ACGGGTTAAT
ACACAGCCGG ATAATCAGCT TGCGGCGTTG TTATGCGCGC GGATAAATGG CAGTCCTGCT
TGTTTTGCCG CCTGTATGGA TTCATCAAAT GAATATAAAA AAATCAGCCC CCTTCTGCGC
AAGGGCGAAA ATGAAATTAA CCAATGGGCT GACCGTCATT CTGTTGAGCG CGCTACCGTT
CAGGCGATAC AATGGCTGAC CCGAGCACCC GATCGCTTTA GCGCCGCCCA GTTCAGCCCT
TTACTCGAAC ACGAAAAATC ATCAACGCAG ATTATTAATC TGCTGGTATG GAGCGGGCTG
TGTGGCTGGA TAAATCGCTT AAAAATCGCG TTGGGTGAGA CATATTAA
 
Protein sequence
MEQRHITGKS HWYHETQSST TEYDVLPLVP EAAKVSDPFL LDVILEKETL APFLSWLDPA 
RVLAVELFPD QLTVTRSQTF TAYERLSTAL TVAQVCGVQR LCNYYSARLT PLPGPDSTRE
SNHRLAQITQ YARQLASSPS IIDNRSRQHL NDVGLTAWDC VIINQIIGFI GFQARTIATF
QAYLGHPVRW LPGLEIQNYA DASLFADESL RWRSSYEVEK LPEEHTKSST AELCQLAEIL
SLHPISLSLL ERLLNSTRVN TQPDNQLAAL LCARINGSPA CFAACMDSSN EYKKISPLLR
KGENEINQWA DRHSVERATV QAIQWLTRAP DRFSAAQFSP LLEHEKSSTQ IINLLVWSGL
CGWINRLKIA LGETY