Gene ECH74115_4453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4453 
Symbol 
ID6969717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4126599 
End bp4127753 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content56% 
IMG OID643388173 
Productputative sugar isomerase, AgaS family 
Protein accessionYP_002272610 
Protein GI209400641 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2222] Predicted phosphosugar isomerases 
TIGRFAM ID[TIGR02815] putative sugar isomerase, AgaS family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.24237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAAA ATTACACCCC TGCTGCCGCC GCAACCGGTA CATGGACTGA AGAAGAGATC 
CGCCATCAGC CTCGCGCATG GATCCGTTCG CTCACCAACA TCGACGCGCT ACATTCCGCG
CTTAATAACT TCCTTGAACC GTTACTGCGC AAAGAGAATC TGCGGATCAT CCTGACCGGA
GCCGGAACCT CGGCATTTAT CGGTGACATC ATCGCGCCGT GGCTCGCCAG CCATACCGGT
AAAAACTTCA GCGCCGTACC GACCACCGAT CTGGTCACTA ATCCGATGGA CTACCTGAAC
CCAGCCCATC CGCTGCTGTT GATCTCCTTC GGTCGATCCG GCAACAGCCC GGAAAGCGTC
GCAGCCGTGG AACTGGCAAA TCAATTTGTA CCGGAATGCT ATCACCTGCC GATCACCTGC
AACGAAGCGG GCGCTCTTTA CCAAAACGCG ATCAACAGCG ATAACGCGTT TGCCGTGCTG
ATGCCCGCAG AAACGCACGA TCGCGGCTTT GCGATGACCA GCAGCATTAC CACCATGATG
GCCAGCTGCC TCGCGGTTTT CGCACCTGAG ACGATCAACA GCCAAACCTT CCGCGACGTG
GCGGATCGTT GCCAGGCGAT CCTGACCTCA CTGGGCGATT TCAGCGAAGG TGTGTTTGGT
TACGCACCGT GGAAACGGAT CGTTTATCTC GGCAGCGGTG GCTTACAGGG CGCAGCACGC
GAGTCGGCGC TGAAAGTGCT GGAACTGACG GCGGGTAAAC TGGCGGCCTT TTATGATTCT
CCAACCGGAT TCCGTCATGG ACCAAAATCG CTGGTCGATA ACGAAACACT GGTGGTGGTA
TTTGTCTCCA GCCACCCTTA CACCCGTCAG TATGATCTTG ATCTGCTGGC TGAACTTCAC
CGTGACAACC AGGCAATGCG TGTAATCGCC ATCGCCGCGG AAAGCAGCGA CATCGTCGCT
GCCGGTCCAC ATATCATCCT GCCACCGTCA CGTCACTTTA TCGACGTTGA GCAGGCATTT
TGCTTCCTGA TGTACGCCCA GACGTTTGCA CTGATGCAGT CGCTGCACAT GGGCAATACG
CCGGATACCC CATCAGCCAG TGGCACCGTT AACCGCGTGG TGCAAGGCGT AATCATTCAT
CCGTGGCAGG CATAA
 
Protein sequence
MPENYTPAAA ATGTWTEEEI RHQPRAWIRS LTNIDALHSA LNNFLEPLLR KENLRIILTG 
AGTSAFIGDI IAPWLASHTG KNFSAVPTTD LVTNPMDYLN PAHPLLLISF GRSGNSPESV
AAVELANQFV PECYHLPITC NEAGALYQNA INSDNAFAVL MPAETHDRGF AMTSSITTMM
ASCLAVFAPE TINSQTFRDV ADRCQAILTS LGDFSEGVFG YAPWKRIVYL GSGGLQGAAR
ESALKVLELT AGKLAAFYDS PTGFRHGPKS LVDNETLVVV FVSSHPYTRQ YDLDLLAELH
RDNQAMRVIA IAAESSDIVA AGPHIILPPS RHFIDVEQAF CFLMYAQTFA LMQSLHMGNT
PDTPSASGTV NRVVQGVIIH PWQA