Gene EcolC_3742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3742 
Symbol 
ID6068116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4093533 
End bp4094702 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content55% 
IMG OID641603158 
Productmannonate dehydratase 
Protein accessionYP_001726678 
Protein GI170021724 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1312] D-mannonate dehydratase 
TIGRFAM ID[TIGR00695] mannonate dehydratase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.179437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGA CCTGGCGCTG GTACGGCCCA AACGATCCGG TTTCTTTAGC TGATGTCCGT 
CAGGCGGGCG CAACTGGCGT GGTTACCGCG CTGCACCATA TCCCGAACGG CGAAGTATGG
TCCGTAGAAG AGATCCTCAA ACGCAAGGCG ATCGTTGAAG ACGCAGGCCT GGTGTGGTCT
GTCGTAGAAA GCGTACCAAT TCACGAAGAT ATCAAAACCC ACACTGGCAA CTATGAGCAG
TGGATCGCTA ACTATCAGCA GACTCTGCGC AACCTGGCGC AGTGCGGCAT TCGCACCGTG
TGCTACAACT TCATGCCGGT GCTCGACTGG ACCCGTACTG ACCTCGAATA CGTGCTGCCA
GACGGCTCCA AAGCTCTGCG CTTCGACCAG ATCGAATTCG CTGCATTCGA AATGCATATC
CTGAAACGTC CAGGCGCGGA AGCGGATTAC ACCGAAGAAG AAATTGCTCA GGCTGCTGTT
CGCTTCGCCA CTATGAGCGA CGAAGACAAA GCGCGTCTGA CGCGTAACAT CATTGCCGGT
CTGCCGGGCG CGGAAGAAGG GTATACCCTC GACCAGTTCC GTAAGCACCT GGAGCTGTAC
AAAGATATCG ATAAAGCGAA GCTGCGCGAA AACTTTGCCG TCTTCCTGAA AGCGATTATT
CCAGTTGCTG AAGAAGTTGG CGTGCGTATG GCGGTTCACC CGGACGATCC GCCGCGCCCG
ATCCTTGGCC TGCCGCGCAT TGTTTCCACC ATTGAAGATA TGCAGTGGAT GGTTGATACC
GTAAACAGCA TGGCGAACGG TTTCACCATG TGCACCGGTT CCTACGGCGT GCGTGCTGAC
AACGATCTGG TTGATATGAT CAAGCAGTTC GGTCCGCGTA TTTACTTCAC CCATCTGCGC
TCCACCATGC GTGAAGATAA CCCGAAAACC TTCCACGAAG CGGCGCACCT GAACGGTGAC
GTTGATATGT ACGAAGTGGT GAAAGCGATT GTTGAAGAAG AACACCGTCG TAAAGCGGAA
GGCAAAGAAG ACCTGATCCC GATGCGTCCG GACCACGGTC ATCAGATGCT GGACGACCTG
AAGAAGAAAA CCAACCCAGG TTACTCCGCA ATTGGTCGTC TGAAAGGCCT GGCCGAAGTT
CGCGGTACTG GGACTGGCTC AGGCGCGTGA
 
Protein sequence
MEQTWRWYGP NDPVSLADVR QAGATGVVTA LHHIPNGEVW SVEEILKRKA IVEDAGLVWS 
VVESVPIHED IKTHTGNYEQ WIANYQQTLR NLAQCGIRTV CYNFMPVLDW TRTDLEYVLP
DGSKALRFDQ IEFAAFEMHI LKRPGAEADY TEEEIAQAAV RFATMSDEDK ARLTRNIIAG
LPGAEEGYTL DQFRKHLELY KDIDKAKLRE NFAVFLKAII PVAEEVGVRM AVHPDDPPRP
ILGLPRIVST IEDMQWMVDT VNSMANGFTM CTGSYGVRAD NDLVDMIKQF GPRIYFTHLR
STMREDNPKT FHEAAHLNGD VDMYEVVKAI VEEEHRRKAE GKEDLIPMRP DHGHQMLDDL
KKKTNPGYSA IGRLKGLAEV RGTGTGSGA