Gene ECH74115_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1960 
Symbol 
ID6972158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1851661 
End bp1852716 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content55% 
IMG OID643385886 
Productgfo/idh/mocA family protein 
Protein accessionYP_002270375 
Protein GI209397813 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.430175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.910454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAGTG CAATGACAAG CTCTCCACTG CGGGTCGCGA TAATAGGCGC AGGCCAGGTG 
GCGGATAAGG TTCATGCTTC GTACTACTGC ACCCGCAACG ATCTGGAACT GGTGGCTGTC
TGTGACAGCC GCCTTTCCCA GGCGCAGGCG CTGGCAGAAA AATACGGGAA TGCATCCGTG
TGGGACGATC CGCAGGCCAT GCTGCTGGCG GTGAAACCTG ATGTGGTTAG CGTCTGCTCA
CCTAACCGTT TTCATTACGA ACATACCCTG ATGGCGCTGG AAGCGGGCTG CCATGTGATG
TGCGAAAAAC CGCCCGCCAT GACGCCAGAA CAGGCGCGGG AAATGTGCGA TACCGCGCGC
AAACAGGGCA AGGTGCTGGC CTACGACTTT CACCATCGTT TTGCGCTCGA TACGCAACAG
CTGCGTGAAC AGGTGACCAA CGGCGTTTTG GGAGAGATTT ACGTTACCAC CGCCCGCGCC
CTGCGTCGCT GCGGCGTTCC CGGCTGGGGT GTCTTTACCA ATAAAGAACT GCAGGGTGGT
GGCCCGCTGA TCGACATCGG CATTCATATG CTGGATGCTG CGATGTATGT GCTGGGCTAT
CCTGCGGTGA AAAGCGTGAA TGCGCATAGC TTTCAAAAGA TCGGCACGCA AAAGAGCTGT
GGTCAATTTG GCGAGTGGGA TCCGGCAACT TACAGCGTCG AAGATTCGCT GTTTGGCACC
ATTGAATTTC ATAACGGCGG CATTCTGTGG CTGGAAACGT CATTTGCGCT CAACATCCGC
GAACAGTCGA TTATGAACGT CAGCTTTTGT GGTGATAAAG CCGGTGCGAC GCTGTTTCCA
GCACATATCT ACACCGATAA CAACGGTGAA TTAATGACGC TGATGCAACG GGAAATGGCA
GACGACAACC GCCATTTGGG CAGCATGGAA GCCTTTATCA ATCACGTACA GGGCAAGCCC
GTGATGATAG CCGACGCCGA GCAGGGGTAC ATCATCCAGC AACTGGTGGC GGCGTTATAT
CAATCCGCAG AAACAGGGAC GCGTGTGGAA TTATGA
 
Protein sequence
MKSAMTSSPL RVAIIGAGQV ADKVHASYYC TRNDLELVAV CDSRLSQAQA LAEKYGNASV 
WDDPQAMLLA VKPDVVSVCS PNRFHYEHTL MALEAGCHVM CEKPPAMTPE QAREMCDTAR
KQGKVLAYDF HHRFALDTQQ LREQVTNGVL GEIYVTTARA LRRCGVPGWG VFTNKELQGG
GPLIDIGIHM LDAAMYVLGY PAVKSVNAHS FQKIGTQKSC GQFGEWDPAT YSVEDSLFGT
IEFHNGGILW LETSFALNIR EQSIMNVSFC GDKAGATLFP AHIYTDNNGE LMTLMQREMA
DDNRHLGSME AFINHVQGKP VMIADAEQGY IIQQLVAALY QSAETGTRVE L