Gene EcolC_1589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1589 
Symbol 
ID6066811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1766151 
End bp1767116 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content56% 
IMG OID641601005 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001724575 
Protein GI170019621 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAC AACGCATTTT TATTGCTGGC CATCGCGGAA TGGTCGGTTC CGCCATCAGG 
CGACAGCTCG AACAGCGCGG TGATGTGGAG CTGGTATTAC GCACCCGCGA TGAACTGAAC
CTGCTGGACA GCCGCGCCGT GCATGATTTC TTTGCCAGCG AAAGCATTGA CCAGGTCTAT
CTGGCGGCGG CGAAAGTGGG CGGCATTGTT GCCAACAACA CCTATCCGGC GGATTTCATC
TACCAGAACA TGATGATTGA GAGCAACATC ATTCACGCCG CGCATCAGAA CGACGTGAAC
AAACTGCTGT TTCTCGGATC GTCCTGCATC TACCCGAAAC TGGCAAAACA GCCGATGGCA
GAAAGCGAGT TGTTGCAGGG CACGCTGGAG CCGACTAACG AGCCTTATGC TATTGCCAAA
ATCGCCGGGA TCAAACTGTG CGAATCATAC AATCGCCAGT ATGGACGCGA TTACCGCTCA
GTCATGCCGA CCAACCTGTA TGGCCCGCAT GACAACTTCC ACCCGAGTAA TTCGCATGTG
ATCCCTGCAT TGCTGCGTCG CTTCCACGAG GCGACGGCAC AGAATGCACC GGACGTGGTG
GTGTGGGGTA GCGGTACACC GATGCGTGAA TTCCTGCACG TCGATGATAT GGCGGCGGCG
AGCATTCATG TCATGGAGCT GGCGCATGAA GTCTGGCTGG AGAACACCCA GCCGATGCTG
TCGCACATTA ACGTCGGCAC GGGCGTTGAC TGCACTATCC GCGAGCTGGC GCAAACCATC
GCCAAAGTGG TGGGTTACAA AGGTCGGGTG GTTTTTGATG CCAGCAAGCC GGATGGTACG
CCGCGCAAAC TGCTGGATGT GACGCGCCTG CATCAGCTTG GCTGGTATCA CGAAATCTCA
CTGGAAGCGG GGCTTGCCAG CACTTACCAG TGGTTCCTTG AGAATCAAGA CCGCTTTCGG
GGGTAA
 
Protein sequence
MSKQRIFIAG HRGMVGSAIR RQLEQRGDVE LVLRTRDELN LLDSRAVHDF FASESIDQVY 
LAAAKVGGIV ANNTYPADFI YQNMMIESNI IHAAHQNDVN KLLFLGSSCI YPKLAKQPMA
ESELLQGTLE PTNEPYAIAK IAGIKLCESY NRQYGRDYRS VMPTNLYGPH DNFHPSNSHV
IPALLRRFHE ATAQNAPDVV VWGSGTPMRE FLHVDDMAAA SIHVMELAHE VWLENTQPML
SHINVGTGVD CTIRELAQTI AKVVGYKGRV VFDASKPDGT PRKLLDVTRL HQLGWYHEIS
LEAGLASTYQ WFLENQDRFR G