Gene ECH74115_3444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3444 
Symbol 
ID6970258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3189240 
End bp3190133 
Gene Length894 bp 
Protein Length297 aa 
Translation table11 
GC content55% 
IMG OID643387250 
ProductNAD dependent epimerase/dehydratase family protein 
Protein accessionYP_002271713 
Protein GI209400621 
COG category[R] General function prediction only 
COG ID[COG1090] Predicted nucleoside-diphosphate sugar epimerase 
TIGRFAM ID[TIGR01777] conserved hypothetical protein TIGR01777 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0131327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATAG TGATCACCGG AGGGACGGGA TTAATTGGTC GCCATTTGAT TCCACGTTTG 
CTGGAGCTGG GCCATCAAAT CACGGTAATG ACGCGTAACC CGCAGAAAGC CAGTTCCGTT
CTCGGCCCTC GGGTGACACT ATGGCAAGGG CTTGCCGATC AAAGCAACCT CAACGGCATT
GATGCGGTAA TCAACCTGGC CGGAGAACCG ATTGCTGATA AACGTTGGAC TCACGAGCAA
AAAGAGCGTC TCTGCCAAAG CCGCTGGAAT ATCACGCAAA AACTGGTCGA TTTGATTAAT
GCCAGCGACA CGCCACCGTC GGTACTCATT TCCGGCTCGG CAACGGGCTA TTATGGCGAC
TTAGGTGAAG TGGTGGTTAC CGAAGAGGAA CCGCCGCATA ACAAATTTAC CCATAAACTC
TGCGCCCGCT GGGAAGAAAT TGCCTGTCGG GCGCAAAGTG ACAAAACGCG AGTGTGCCTG
CTGCGTACCG GCGTAGTGCT GGCACCGGAT GGCGGTATTC TCGGTAAAAT GCTGCCGCCG
TTTCGTCTTG GCCTGGGCGG GCCGATTGGT TCCGGTCGGC AGTATCTGGC CTGGATTCAT
ATCGATGATA TGGTCAACGG CATTCTCTGG CTGCTGGATA ACGAGCTGCG CGGGCCATTT
AATATGGTTT CGCCCTACCC GATACGCAAT GAACAATTTG CCCATGCGCT CGGTCATGCG
CTGCATCGCC CGGCTATTTT GCGCGTCCCC GCAACCGCCA TTCGGCTGTT AATGGGCGAA
TCTTCAGTAC TGGTATTAGG TGGACAACGC GCGCTGCCTA AACGGCTGGA AGAAGCGGGT
TTTGCGTTTC GCTGGTACGA TTTAGAAGAG GCGCTGGCGG ATGTCGTTCG CTGA
 
Protein sequence
MNIVITGGTG LIGRHLIPRL LELGHQITVM TRNPQKASSV LGPRVTLWQG LADQSNLNGI 
DAVINLAGEP IADKRWTHEQ KERLCQSRWN ITQKLVDLIN ASDTPPSVLI SGSATGYYGD
LGEVVVTEEE PPHNKFTHKL CARWEEIACR AQSDKTRVCL LRTGVVLAPD GGILGKMLPP
FRLGLGGPIG SGRQYLAWIH IDDMVNGILW LLDNELRGPF NMVSPYPIRN EQFAHALGHA
LHRPAILRVP ATAIRLLMGE SSVLVLGGQR ALPKRLEEAG FAFRWYDLEE ALADVVR