Gene EcSMS35_4849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4849 
SymboluxuA 
ID6147201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4956259 
End bp4957443 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content54% 
IMG OID641619653 
Productmannonate dehydratase 
Protein accessionYP_001746760 
Protein GI170683309 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1312] D-mannonate dehydratase 
TIGRFAM ID[TIGR00695] mannonate dehydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.908596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGA CCTGGCGCTG GTACGGCCCA AACGATCCGG TTTCTTTAGC TGATGTCCGT 
CAGGCGGGCG CAACTGGCGT GGTTACCGCG CTGCACCATA TCCCGAACGG CGAAGTATGG
TCCGTAGAAG AGATCCTCAA ACGCAAGGCG ATCGTTGAAG ACGCAGGCCT GGTGTGGTCT
GTCGTTGAAA GCGTACCAAT TCACGAAGAT ATCAAAACCC ACACTGGCAA CTATGAGCAG
TGGATTGCTA ACTATCAGCA GACCCTGCGC AACCTGGCGC AGTGCGGCAT TCGCACCGTG
TGCTACAACT TCATGCCGGT GCTCGACTGG ACCCGTACTG ACCTCGAATA CGTGCTGCCA
GACGGCTCCA AAGCTCTGCG CTTCGACCAG ATCGAATTCG CTGCATTCGA AATGCATATC
CTGAAGCGTC CAGGCGCGGA AGCGGATTAC ACCGAAGAAG AAATTGCTCA GGCCGCTGAA
CGCTTCGCCA CTATGAGCGA CGAAGACAAA GCGCGTCTGA CCCGTAACAT CATTGCCGGT
CTGCCAGGTG CGGAAGAAGG GTATACCCTC GACCAGTTCC GTAAGCACCT GGAGCTGTAC
AAAGATATCG ACAAAGCCAA ACTGCGCGAA AACTTTGCTG TCTTCCTGAA AGCGATTATT
CCAGTTGCTG AAGAAGTTGG CGTGCGTATG GCGGTTCACC CGGACGATCC GCCGCGCCCA
ATCCTCGGCC TGCCGCGCAT TGTTTCTACC ATTGAAGATA TGCAGTGGAT GGTTGATACC
GTAAACAGCA TGGCGAACGG TTTCACCATG TGCACCGGTT CCTACGGCGT GCGTGCTGAC
AACGATCTGG TTGATATGAT CAAGCAGTTT GGTCCGCGTA TTTACTTCAC CCATCTGCGC
TCCACCATGC GTGAAGATAA CCCGAAAACC TTCCACGAAG CGGCGCACCT GAACGGTGAC
GTTGATATGT ACGAAGTGGT GAAAGCGATT GTTGAAGAAG AACACCGTCG TAAAGCGGAA
GGCAAAGAAG ACCTGATCCC GATGCGTCCG GACCACGGTC ATCAGATGCT GGACGACCTG
AAGAAGAAAA CCAACCCAGG TTACTCCGCA ATTGGTCGTC TGAAAGGCCT GGCCGAAGTT
CGCGGTGTCG AACTGGCGAT CCAGCGCGCT TTCTTTAGCC GTTAA
 
Protein sequence
MEQTWRWYGP NDPVSLADVR QAGATGVVTA LHHIPNGEVW SVEEILKRKA IVEDAGLVWS 
VVESVPIHED IKTHTGNYEQ WIANYQQTLR NLAQCGIRTV CYNFMPVLDW TRTDLEYVLP
DGSKALRFDQ IEFAAFEMHI LKRPGAEADY TEEEIAQAAE RFATMSDEDK ARLTRNIIAG
LPGAEEGYTL DQFRKHLELY KDIDKAKLRE NFAVFLKAII PVAEEVGVRM AVHPDDPPRP
ILGLPRIVST IEDMQWMVDT VNSMANGFTM CTGSYGVRAD NDLVDMIKQF GPRIYFTHLR
STMREDNPKT FHEAAHLNGD VDMYEVVKAI VEEEHRRKAE GKEDLIPMRP DHGHQMLDDL
KKKTNPGYSA IGRLKGLAEV RGVELAIQRA FFSR