Gene ECH74115_5622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5622 
Symbol 
ID6972247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5256784 
End bp5259012 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content51% 
IMG OID643389256 
Producthypothetical protein 
Protein accessionYP_002273653 
Protein GI209400238 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.889676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACACAC AGACCCTGTA TGAGTTAAGT CAGGAGGCTG AACGGCTGTT ACAGCTTTCT 
CGCCAACAGT TGCAGTTACT GGAAAAAATG CCTCTCTCTG TACCCGGAGA CGATGCGCCA
CAACTGGCTT TACCCTGGAG TCAGCCTAAT ATCGCCGAGC GTCACGCGAT GCTGAATAAT
GAGTTGCGTA AAATTTCCCG ACTGGAAATG GTGCTGGCGA TTGTCGGTAC CATGAAAGCA
GGGAAATCAA CCACCATTAA TGCCATTGTT GGTACGGAGG TACTGCCTAA TCGCAATCGC
CCAATGACTG CGCTGCCGAC GCTTATTCGC CATACGCCCG GCCAAAAGGA ACCGGTACTG
CATTTTTCAC ATGTCGCGCC AATCGATTGT TTAATTCAAA AATTACAACA GCGCCTGCGT
GATTGCGATA TTAAGCATCT GACCGATGTG CTGGAAATAG ATAAAGATAT GCGTGCGCTT
ATGCAGCGGA TCGAAAATGG CGTCGCTTTC GAAAAATATT ATCTGGGTGC CCAGCCTATT
TTTCATTGTC TGAAAAGTTT GAATGATTTG GTGCGACTGG CGAAGGCGCT GGACGTCGAT
TTTCCTTTTT CTGCTTACGC CGCCATTGAG CATATTCCCG TGATTGAAGT GGAGTTTGTC
CATCTGGCGG GGCTGGAAAG TTATCCCGGT CAATTGACGT TACTGGATAC CCCCGGGCCA
AATGAAGCCG GGCAACCGCA TCTGCAAAAA ATGCTTAACC AGCAGCTGGC ACGCGCCTCG
GCGGTACTGG CGGTGCTGGA TTATACGCAA CTGAAATCGA TCTCCGATGA AGAGGTCCGT
GAGGCGATTT TGGCGGTGGG GCAATCGGTG CCGCTGTATG TGCTGGTCAA TAAGTTCGAT
CAACAGGATC GTAACAGTGA CGACGCCGAC CAGGTGCGGG CACTGATTTC CGGGACGCTG
ATGAAAGGCT GTATTACGCC ACAGCAGATA TTTCCGGTGT CGTCGATGTG GGGCTACCTG
GCAAATCGGG CGCGCCATGA GTTAGCCAAC AACGGTAAGT TACCACCGCC AGAGCAACAA
CGCTGGGTGG AAGATTTTGC CCATGCCGCG CTCGGCAGGC GCTGGCGTCA TGCCGATCTG
GCGGACCTCG AACATATTCG TCATGCTGCC GATCAGTTGT GGGAAGATTC GCTGTTCGCC
CAGCCAATTC AGATGTTGCT TCATGCCGCT TACGCTAACG CCTCGTTGTA TGCTCTGCGA
TCTGCCGCGC ATAAACTGTT GAATTACGCG CAGCAGGCGC GGGAATACCT GGATTTTCGT
GCGCACGGGT TAAACGTCGC TTGTGAACAA TTGCGGCAAA ATATCCACCA GATCGAAGAA
AGTTTGCAGC TATTGCAACT CAATCAGGCG CAGGTGAGCG GCGAGATTAA ACATGAAATC
GAGCTGGCCC TGACCTCCGC CAACCACTTT CTGCGTCAAC AGCAAGATGC GCTGAAGGTG
CAGTTAGCCG CCTTGTTTCA GGATGATTCG GAGCCGTTAA GCGAGATTCG TACCCGCTGT
GAGACACTGT TACAGACGGC GCAGAACACC ATCAGTCGCG ACTTTACGCT GCGTTTTGCC
GAGCTTGAAT CCACCCTTTG CCGGGTGTTA ACCGATGTTA TTCGCCCCAT TGAGCAACAA
GTCAAAATGG AATTGAGCGA GTCAGGGTTT CGTCCTGGGT TTCATTTTCC TGTTTTTCAC
GGCGTAGTTC CCCACTTCAA CACTCGCCAG CTGTTCAGTG AAGTCATTTC GCGCCAGGAA
GCAACGGACG AGCAGAGCAC GCGATTAGGC GTTGTGCGTG AGACTTTTTC GCGCTGGTTG
AATCAGCCTG ACTGGGGACG GGGAAATGAG AAATCCCCGA CAGAAACGGT TGATTACAGT
GTGTTGCAAC GAGCATTAAG CGCAGAAGTC GATCTTTATT GCCAACAAAT GGCTAAAGTT
CTGGCAGAGC AGGTCGATGA ATCTGTTACG GCAGGCATGA ATACTTTTTT CGCTGAGTTC
GCTTCATGTT TGACGGAATT ACAGACGCGT TTACGCGAAA GTCTGGCTCT GCGTCAACAA
AATGAATCGG TGGTCAGGCT GATGCAACAG CAATTGCAGC AGACTGTGAT GACTCACGGC
TGGATTTACA CCGACGCTCA GCTGTTACGC GATGATATTC AAACACTTTT CACGGCAGAA
CGATATTGA
 
Protein sequence
MYTQTLYELS QEAERLLQLS RQQLQLLEKM PLSVPGDDAP QLALPWSQPN IAERHAMLNN 
ELRKISRLEM VLAIVGTMKA GKSTTINAIV GTEVLPNRNR PMTALPTLIR HTPGQKEPVL
HFSHVAPIDC LIQKLQQRLR DCDIKHLTDV LEIDKDMRAL MQRIENGVAF EKYYLGAQPI
FHCLKSLNDL VRLAKALDVD FPFSAYAAIE HIPVIEVEFV HLAGLESYPG QLTLLDTPGP
NEAGQPHLQK MLNQQLARAS AVLAVLDYTQ LKSISDEEVR EAILAVGQSV PLYVLVNKFD
QQDRNSDDAD QVRALISGTL MKGCITPQQI FPVSSMWGYL ANRARHELAN NGKLPPPEQQ
RWVEDFAHAA LGRRWRHADL ADLEHIRHAA DQLWEDSLFA QPIQMLLHAA YANASLYALR
SAAHKLLNYA QQAREYLDFR AHGLNVACEQ LRQNIHQIEE SLQLLQLNQA QVSGEIKHEI
ELALTSANHF LRQQQDALKV QLAALFQDDS EPLSEIRTRC ETLLQTAQNT ISRDFTLRFA
ELESTLCRVL TDVIRPIEQQ VKMELSESGF RPGFHFPVFH GVVPHFNTRQ LFSEVISRQE
ATDEQSTRLG VVRETFSRWL NQPDWGRGNE KSPTETVDYS VLQRALSAEV DLYCQQMAKV
LAEQVDESVT AGMNTFFAEF ASCLTELQTR LRESLALRQQ NESVVRLMQQ QLQQTVMTHG
WIYTDAQLLR DDIQTLFTAE RY