Gene ECH74115_0869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0869 
SymbolybhA 
ID6969720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp884847 
End bp885767 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content51% 
IMG OID643384894 
Productphosphotransferase 
Protein accessionYP_002269394 
Protein GI209397698 
COG category[R] General function prediction only 
COG ID[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGGTGA ATGCTGAATT AAAACGCAGA ATTTTGTCCG ATAGTGGTAT CGGTGTACCA 
TTCGCGCAGA AGAAAATTCT TAACCTTAAT CTGGAATACG CCATGACCAC ACGCGTGATT
GCTCTCGACT TAGACGGCAC CTTATTGACC CCGAAAAAGA CCCTGCTACC TTCATCGATA
GAAGCCCTGG CCCGCGCTCG CGAAGCAGGT TATCAATTAA TCATCGTCAC AGGTCGCCAT
CACGTCGCTA TTCATCCTTT TTATCAGGCG CTGGCGCTGG ATACACCTGC TATTTGCTGT
AATGGCACCT ATTTGTATGA TTATCATGCA AAAACCGTGC TGGAAGCGGA CCCAATGCCC
GTTAATAAAG CCCTACAACT CATTGAGATG CTGAATGAAC ACCACATTCA CGGTCTGATG
TATGTCGATG ACGAGATGGT TTATGAGCAC CCGACCGGGC ATGTCATTCG CACGTCTAAC
TGGGCGCAAA CCCTGCCGCC GGAACAGCGT CCGACTTTCA CACAAGTCGC TTCTCTGGCT
GAAACGGCGC AACAAGTTAA CGCCGTATGG AAGTTCGCCC TCACGCACGA TGACCTGCCG
CAATTGCAGC ATTTTGGTAA GCATGTCGAA CATGAACTGG GACTGGAGTG TGAATGGTCC
TGGCACGATC AGGTTGATAT TGCACGCGGC GGCAACAGCA AAGGTAAACG TTTGACGAAA
TGGGTTGAGG CGCAAGGCTG GTCGATGGAA AACGTCGTGG CGTTCGGCGA TAACTTTAAT
GATATCAGTA TGCTGGAGGC CGCTGGTACA GGCGTGGCGA TGGGCAACGC CGATGACGCG
GTAAAAGCGC GCGCCAACAT TGTGATTGGT GATAACACCA CCGACAGCAT TGCCCAGTTC
ATTTATAGCC ACCTGATTTA A
 
Protein sequence
MAVNAELKRR ILSDSGIGVP FAQKKILNLN LEYAMTTRVI ALDLDGTLLT PKKTLLPSSI 
EALARAREAG YQLIIVTGRH HVAIHPFYQA LALDTPAICC NGTYLYDYHA KTVLEADPMP
VNKALQLIEM LNEHHIHGLM YVDDEMVYEH PTGHVIRTSN WAQTLPPEQR PTFTQVASLA
ETAQQVNAVW KFALTHDDLP QLQHFGKHVE HELGLECEWS WHDQVDIARG GNSKGKRLTK
WVEAQGWSME NVVAFGDNFN DISMLEAAGT GVAMGNADDA VKARANIVIG DNTTDSIAQF
IYSHLI