Gene ECH74115_4663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4663 
Symbol 
ID6969646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4306188 
End bp4307210 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content54% 
IMG OID643388367 
Productputative hydrolase 
Protein accessionYP_002272795 
Protein GI209400340 
COG category[R] General function prediction only 
COG ID[COG0429] Predicted hydrolase of the alpha/beta-hydrolase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0421817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.05896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGA TAACGACGAC CGATGCCAAT GAATTCAGCA GCAGTGCTGA ATTCACCCCT 
ATGCGCGGCT TTAGCAATTG TCATCTGCAA ACCATGCTGC CGCGTCTGTT TCGTCGCAAG
GTGAAATTCA CCCCGTACTG GCAGCGACTG GAGTTGCCCG ACGGCGATTT TGTCGATCTC
GCGTGGAGTG AAGCCCCTGC ACAGGCGCAA CATAAACCTC GTCTGGTGGT GTTTCACGGG
CTGGAAGGCA GTCTCAATAG CCCTTACGCC CACGGTCTGG TTGAGGCGGC GCAAAAGCGC
GGCTGGCTGG GCGTGGTGAT GCATTTTCGC GGATGCAGCG GTGAACCAAA CCGTATGCAC
CGCATTTACC ATTCGGGCGA AACCGAAGAC GCCAGTTGGT TTTTACGCTG GCTGCAACGC
GAGTTTGGCC ATGCGCCAAC GGCTGCCGTC GGCTATTCGC TCGGCGGTAA TATGCTGGCC
TGTTTGCTGG CAAAAGAAGG CAATGATCTC CCGGTTGATG CGGCGGTTAT TGTCTCTGCG
CCGTTTATGC TGGAAGCCTG TAGCTATCAT ATGGAAAAGG GCTTTTCCCG CGTTTATCAG
CGTTACTTGC TGAACCTGTT AAAAGCCAAT GCCGCGCGCA AGCTGGCAGC CTACCCCGGA
ACACTGCCGA TTAATCTCGC GCAGTTAAAA TCGGTACGTC GCATCCGTGA ATTTGACGAT
CTCATCACCG CCAGAATTCA CGGCTACGCT GACGCTATCG ACTATTATCG TCAGTGTAGC
GCCATGCCGA TGCTGAACCG GATCGCCAAA CCGACGCTGA TTATTCACGC CAAAGACGAT
CCGTTTATGG ATCATCAGGT GATCCCGAAA CCGGAAAGTC TCCCCCCGCA GGTGGAGTAT
CAACTGACTG AACATGGCGG TCATGTTGGC TTTATTGGCG GTACATTACT TCATCCGCAA
ATGTGGCTGG AGTCACGCAT TCCTGACTGG TTAACAACGT ATCTGGAGGC GAAATCATGT
TGA
 
Protein sequence
MAQITTTDAN EFSSSAEFTP MRGFSNCHLQ TMLPRLFRRK VKFTPYWQRL ELPDGDFVDL 
AWSEAPAQAQ HKPRLVVFHG LEGSLNSPYA HGLVEAAQKR GWLGVVMHFR GCSGEPNRMH
RIYHSGETED ASWFLRWLQR EFGHAPTAAV GYSLGGNMLA CLLAKEGNDL PVDAAVIVSA
PFMLEACSYH MEKGFSRVYQ RYLLNLLKAN AARKLAAYPG TLPINLAQLK SVRRIREFDD
LITARIHGYA DAIDYYRQCS AMPMLNRIAK PTLIIHAKDD PFMDHQVIPK PESLPPQVEY
QLTEHGGHVG FIGGTLLHPQ MWLESRIPDW LTTYLEAKSC