Gene ECH74115_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1966 
Symbol 
ID6967381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1858946 
End bp1860343 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content55% 
IMG OID643385892 
Producthypothetical protein 
Protein accessionYP_002270381 
Protein GI209397259 
COG category[R] General function prediction only 
COG ID[COG3106] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.575033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC TTAAAAATGA ACTTAATGCG CTGGTGAATC GGGGTGTCGA CAGACATCTG 
CGCCTCGCCG TAACCGGACT TAGCCGCAGC GGCAAAACGG CGTTTATCAC TGCGATGGTC
AATCAGTTGC TCAATATTCA TGCCGGAGCA CGTTTGCCGC TATTAAGTGC GGTGCGTGAA
GAGCGTCTGC TGGGCGTAAA ACGCATTCCT CAGCGTGACT TTGGCATTCC GCGTTTTACC
TACGACGAAG GGTTGGCGCA GCTGTATGGC GATCCTCCCG CCTGGCCGAC GCCAACGCGC
GGCGTCAGCG AAATTCGCCT GGCACTACGC TATAAATCGA ACGATTCGCT GCTGCGCCAC
TTTAAGGATA CCTCCACGCT GTATCTGGAG ATTGTGGATT ACCCTGGCGA ATGGTTGCTC
GACCTGCCGA TGCTGGCGCA GGACTATTTA AGCTGGTCGC GCCAGATGAC GGGCTTACTC
AATGGTCAGC GCGGCGAATG GTCGGCGAAA TGGCGAATGA TGTGCGAAGG GCTGGACCCG
CTAGCGCCTG CCGACGAAAA CCGGCTGGCA GACATTGCCG CCGCGTGGAC CGATTATCTC
CACCACTGTA AACAGCAGGG GCTGCACTTT ATTCAGCCTG GGCGCTTTGT CTTGCCGGGG
GATATGGCAG GTGCGCCCGC GCTGCAATTC TTCCCGTGGC CGGATGTCGA TGCCTGGGGC
GAGTCCAAAC TGGCGCAGGC CGATAAGCAC ACCAATGCCG GAATGCTGCG CGAGCGGTTT
AATTATTACT GCGAGAAGGT GGTGAAGGGG TTCTATAAGA ATCATTTTCT GCGCTTTGAC
CGCCAGATTG TGCTGGTGGA TTGCCTGCAA CCTCTCAACA GTGGGCCACA GGCATTTAAT
GATATGCGTC TGGCGCTGAC GCAGCTGATG CAAAGTTTCC ACTATGGGCA GCGTACCCTG
TTCAGGCGTT TGTTTTCGCC GGTTATCGAT AAGCTATTGT TTGCTGCCAC TAAAGCGGAC
CATGTGACCA TCGATCAGCA CGCCAATATG GTTTCATTGC TGCAACAACT AATTCAGGAT
GCCTGGCAAA ATGCGGCGTT CGAAGGGATC AGCATGGATT GCCTGGGGCT GGCGTCAGTT
CAGGCGACCA CCAGTGGCAT TATTGATGTT AACGGTGAGA AAATCCCGGC GCTGCGCGGT
AATCGACTTA GCGATGGCGC ACCGCTCACT GTTTATCCTG GCGAAGTTCC CGCACGTTTG
CCTGGTCAGG CGTTCTGGGA TAAGCAAGGG TTCCAGTTTG AAGCGTTTCG CCCGCAGGTG
ATGGATGTCG ACAAACCGCT GCCGCATATT CGTCTTGATG CCGCGCTGGA ATTTTTAATA
GGAGATAAAT TGCGATGA
 
Protein sequence
MKRLKNELNA LVNRGVDRHL RLAVTGLSRS GKTAFITAMV NQLLNIHAGA RLPLLSAVRE 
ERLLGVKRIP QRDFGIPRFT YDEGLAQLYG DPPAWPTPTR GVSEIRLALR YKSNDSLLRH
FKDTSTLYLE IVDYPGEWLL DLPMLAQDYL SWSRQMTGLL NGQRGEWSAK WRMMCEGLDP
LAPADENRLA DIAAAWTDYL HHCKQQGLHF IQPGRFVLPG DMAGAPALQF FPWPDVDAWG
ESKLAQADKH TNAGMLRERF NYYCEKVVKG FYKNHFLRFD RQIVLVDCLQ PLNSGPQAFN
DMRLALTQLM QSFHYGQRTL FRRLFSPVID KLLFAATKAD HVTIDQHANM VSLLQQLIQD
AWQNAAFEGI SMDCLGLASV QATTSGIIDV NGEKIPALRG NRLSDGAPLT VYPGEVPARL
PGQAFWDKQG FQFEAFRPQV MDVDKPLPHI RLDAALEFLI GDKLR