Gene ECH74115_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2381 
Symbol 
ID6970786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2253137 
End bp2254741 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content47% 
IMG OID643386254 
Producthypothetical protein 
Protein accessionYP_002270736 
Protein GI209400503 
COG category[S] Function unknown 
COG ID[COG4529] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000071971 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00180659 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA TTGCAATTGT GGGTGCCGGG CCTACGGGGA TCTACACCTT ATTCTCGCTT 
CTACAGCAAC AAACTCTACT TTCTATTTCT ATCTTCGAGC AGGCTGACGA GGCCGGTGTC
GGGATGCCAT ACAGTGATGA GGAAAACTCA AAAATGATGC TGGCAAATAT TGCCAGTATT
GAAATACCGC CGATTTATTG TACGTATCTC GAATGGCTAC AAAAGCAAGA AGCCAGTCAT
CTCCAGCGTT ATGGCGTTAA AAAAGAAACC TTGCACGATC GTCAGTTTTT ACCGCGAATT
CTGCTGGGCG AATATTTCCG CGATCAATTT TTACGATTAG TAGACCAGGC ACGAAAGCAA
AAATTTGCAG TGGCTGTTTA TGAATCATGC CAGGTTACCG ATCTGCAAAT TACAAATGCT
GGCGTCATGC TCGCTACAAA TCAGGATTTA CCCAGCGAGA CGTTTGATTT AGCGGTGATC
GCCACGGGTC ACGTCTGGCC TGATGAAGAA GAAGCAACCC GAACGTATTT TCCAAGCCCG
TGGTCAGGCT TGATGGAAGC AAAGGTCGAT GCGTGTAACG TGGGTATTAT GGGAACATCC
TTGAGCGGAC TGGATGCGGC AATGGCAGTG GCTATTCAGC ATGGTTCGTT CATTGAAGAT
GATAAACAAC ACGTCGTTTT TCACCGCGAT AACGCAAGTG AAAAGCTAAA TATTACGTTA
ATGTCGCGCA CGGGTATTTT ACCCGAAGCC GATTTCTATT GCCCTATTCC CTACGAGCCC
TTACACATCG TCACTGATCA GGCATTAAAT GCTGAGATTC AAAAAGGCGA ATATGGCCTT
TTGGATCGGG TATTTAGATT GATAGTAGAG GAAATCAAGT TTGCTGATCC AGACTGGAGT
CAACGCATAG CCTTAGAGAG CCTGAATGTC GATTCCTTTG CTCAAGCCTG GTTTGCCGAG
CGCAAACAAC GCGACCAATT TGACTGGGCA GAAAAAAATC TCCAGGAAGT CGAACGCAAT
AAACGAGAAA AACATACTGT TCCCTGGCGT TATGTCATTC TGCGCCTGCA TGAAGCCGTA
CAGGAAATTG TTCCACATCT GAATGAACAC GACCATAAAC GGTTCAGTAA AGGCCTTGCC
CGGGTTTTCA TCGATAATTA TGCGGCAATC CCTTCAGAGT CTATTCGTCG CCTACTTGCC
TTACGTGAAG CGGGAATCAT TCATATTCTC GCTCTCGGTG AAGACTACAA AATGGAAATT
AACGAGTCGC GCACCGTCCT GAAAACGGAA GACAACAGCT ACTCGTTTGA CGTTTTTATT
GATGCCCGCG GGCAGCGTCC GCTTAAAGTG AAAGATATTC CTTTCCCTGG ACTACGCGAA
CAATTACAGA AAACAGGGGA TGAAATCCCT GATGTTGGTG AAGATTATAC GTTACAGCAA
CCCGAAGATA TTCGTGGGCG CGTAGCGTTC GGCGCGTTGC CCTGGTTGAT GCACGACCAG
CCTTTCGTTC AGGGACTTAC GGCATGTGCA GAAATTGGTG AGGCGATGGC TCGGGCGGTC
GTAAAGCCTG CATCCCGTGC TCGTCGGCGT CTTTCGTTTG ATTAA
 
Protein sequence
MKKIAIVGAG PTGIYTLFSL LQQQTLLSIS IFEQADEAGV GMPYSDEENS KMMLANIASI 
EIPPIYCTYL EWLQKQEASH LQRYGVKKET LHDRQFLPRI LLGEYFRDQF LRLVDQARKQ
KFAVAVYESC QVTDLQITNA GVMLATNQDL PSETFDLAVI ATGHVWPDEE EATRTYFPSP
WSGLMEAKVD ACNVGIMGTS LSGLDAAMAV AIQHGSFIED DKQHVVFHRD NASEKLNITL
MSRTGILPEA DFYCPIPYEP LHIVTDQALN AEIQKGEYGL LDRVFRLIVE EIKFADPDWS
QRIALESLNV DSFAQAWFAE RKQRDQFDWA EKNLQEVERN KREKHTVPWR YVILRLHEAV
QEIVPHLNEH DHKRFSKGLA RVFIDNYAAI PSESIRRLLA LREAGIIHIL ALGEDYKMEI
NESRTVLKTE DNSYSFDVFI DARGQRPLKV KDIPFPGLRE QLQKTGDEIP DVGEDYTLQQ
PEDIRGRVAF GALPWLMHDQ PFVQGLTACA EIGEAMARAV VKPASRARRR LSFD