Gene EcSMS35_3435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3435 
Symbol 
ID6145313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3514117 
End bp3515271 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content57% 
IMG OID641618264 
ProductAgaS family sugar isomerase 
Protein accessionYP_001745413 
Protein GI170682634 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2222] Predicted phosphosugar isomerases 
TIGRFAM ID[TIGR02815] putative sugar isomerase, AgaS family 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAAA ATTACACCCC TGCTGCTGCC GCAACCGGTA CATGGACTGA AGAAGAGATC 
CGCCATCAGC CTCGCGCATG GATCCGTTCA CTCACCAACA TCGACGCGCT ACGTTCCGCG
CTCAATAACT TCCTTGAACC GTTACTGCGC AAAGAGAATC TGCGGGTAAT CCTGACCGGA
GCCGGAACCT CGGCATTTAT CGGTGACATC ATCGCGCCGT GGCTCGCCAG CCATACCGGT
AAAAACTTCA GCGCCGTACC GACCACCGAT CTGGTCACTA ATCCGATGGA CTACCTGAAT
CCAGCTCATC CGCTGCTGTT GATCTCCTTC GGTCGATCCG GCAACAGCCC GGAAAGCGTC
GCCGCCGTGG AACTGGCAAA TCAATTTGTA CCAGAATGCT ATCACCTGCC GATCACCTGC
AACGAAGCGG GCGCTCTTTA CCAAAACGCG ATCAACAGCG ACAACGCGTT TGCCCTGCTG
ATGCCCGCAG AAACGCACGA TCGCGGCTTC GCGATGACCA GCAGCATTAC CACCATGATG
GCCAGCTGCC TCGCGGTTTT CGCACCTGAG ACGATCAACA GCCAGAGCTT CCGCGATGTG
GCGGATCGTT GCCAGGCGAT CCTGACCTCA CTGGGCGATT TCAGCGAAGG TGTGTTTGGT
TACGCACCGT GGAAACGGAT CGTTTATCTC GGCAGCGGTG GCTTACAGGG CGCAGCACGC
GAGTCGGCGC TGAAAGTGCT GGAACTGACG GCGGGTAAAC TGGCGGCCTT TTATGATTCC
CCGACCGGAT TCCGTCATGG CCCGAAATCG CTGGTCGATA ACGAAACGCT GGTGGTGGTG
TTTGTCTCCA GCCACCCTTA CACCCGTCAG TATGATCTTG ATCTGCTGGC AGAACTCCGC
CGTGACAACC AGGCATTGCG CGTAATCGCC ATCGCCGCGG AAAGCAACGA CGTTATTACC
GCCGGTCCAC ATATCATCCT GCCGCCGTCC CGTCACTTTA TCGACGTTGA GCAGGCATTT
TGCTTCCTGA TGTACGCCCA GACGTTTGCA CTGATGCAGT CGCTGCACAT GGGCAATACG
CCGGATACCC CATCAGCCAG TGGCACCGTT AACCGCGTGG TGCAAGGCGT AATCATTCAT
CCGTGGCAGG CATAA
 
Protein sequence
MPENYTPAAA ATGTWTEEEI RHQPRAWIRS LTNIDALRSA LNNFLEPLLR KENLRVILTG 
AGTSAFIGDI IAPWLASHTG KNFSAVPTTD LVTNPMDYLN PAHPLLLISF GRSGNSPESV
AAVELANQFV PECYHLPITC NEAGALYQNA INSDNAFALL MPAETHDRGF AMTSSITTMM
ASCLAVFAPE TINSQSFRDV ADRCQAILTS LGDFSEGVFG YAPWKRIVYL GSGGLQGAAR
ESALKVLELT AGKLAAFYDS PTGFRHGPKS LVDNETLVVV FVSSHPYTRQ YDLDLLAELR
RDNQALRVIA IAAESNDVIT AGPHIILPPS RHFIDVEQAF CFLMYAQTFA LMQSLHMGNT
PDTPSASGTV NRVVQGVIIH PWQA