Gene EcolC_0609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0609 
Symbol 
ID6066450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp652604 
End bp654091 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content54% 
IMG OID641600015 
ProductAltronate dehydratase 
Protein accessionYP_001723612 
Protein GI170018658 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2721] Altronate dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATACA TCAAGATCCA TGCGCTGGAT AACGTCGCGG TCGCTTTAGC GGATTTGGCT 
GAAGGCACAG AAGTCAGTGT CGATAACCAA ACTGTTAGGC TGCGCCAGGA TGTTGCTCGT
GGACATAAAT TTGCGTTAAC GAATATCGCA AAAGGGGCCA ACGTCATCAA ATATGGCCTG
CCGATTGGTT ATGCATTGGC GGATATTGCG GCGGGGGAAC ACGTTCACGC CCACAATACG
CGCACGAATC TGAGCGATCT GGATCAGTAT CGCTATCAAC CTGATTTTCA GGATCTGCCT
GCGCAAGCGG CAGATCGTGA AGTGCAGATC TATCGTCGCG CTAACGGCGA TGTCGGGGTG
CGTAATGAGC TGTGGATCCT GCCAACCGTG GGCTGTGTCA ACGGCATCGC GCGGCAGATC
CAAAATCGTT TCCTGAAAGA AACCAACAAC GCCGAAGGTA CCGACGGCGT GTTCCTCTTC
AGCCACACCT ACGGCTGCTC ACAGCTGGGC GACGATCACA TTAATACCCG CACCATGCTG
CAAAACATGG TGCGCCACCC GAACGCAGGC GCAGTGCTGG TGATTGGTCT GGGCTGTGAA
AACAACCAGG TTGCCGCATT CCGTGAAACG CTGGGCGATA TCGATCCTGA ACGCGTTCAT
TTCATGATCT GCCAACAGCA GGATGATGAG ATCGAAGCCG GAATCGAGCA TTTGCATCAG
CTGTATAACG TGATGCGCAA CGATAAACGC GAGCCAGGCA AACTCAGCGA ACTGAAGTTT
GGTCTGGAGT GCGGTGGTTC TGACGGTCTT TCTGGTATTA CTGCTAACCC GATGCTGGGG
CGTTTCTCTG ACTACGTGAT TGCTAACGGC GGTACTACCG TACTGACCGA AGTGCCGGAG
ATGTTTGGCG CAGAGCAGTT GCTGATGGAC CATTGCCGCG ACGAAGCAAC GTTTGAAAAA
CTGGTCACCA TGGTCAACGA CTTCAAACAG TACTTTATTG CCCATGATCA GCCGATCTAT
GAAAACCCAT CGCCGGGGAA CAAAGCGGGC GGTATCACCA CGCTGGAAGA CAAATCACTT
GGCTGTACCC AGAAAGCGGG TTCCAGCGTC GTGGTTGACG TGCTGCGTTA CGGCGAGCGT
CTGAAAACGC CAGGGCTGAA CTTGTTAAGT GCGCCGGGTA ACGATGCCGT AGCGACCAGC
GCCCTGGCGG GTGCGGGCTG CCATATGGTG CTGTTCAGTA CTGGTCGTGG CACGCCGTAT
GGTGGATTTG TGCCGACGGT GAAAATCGCC ACCAACAGTG AACTGGCGGC GAAGAAAAAA
CACTGGATCG ACTTTGACGC GGGTCAGTTG ATCCACGGTA AAGCGATGCC GCAGTTGCTG
GAAGAATTTA TCGACACCAT CGTTGAGTTT GCCAACGGTA AGCAAACCTG TAACGAGCGT
AACGACTTCC GCGAACTGGC GATCTTCAAA AGCGGCGTAA CGCTATAA
 
Protein sequence
MQYIKIHALD NVAVALADLA EGTEVSVDNQ TVRLRQDVAR GHKFALTNIA KGANVIKYGL 
PIGYALADIA AGEHVHAHNT RTNLSDLDQY RYQPDFQDLP AQAADREVQI YRRANGDVGV
RNELWILPTV GCVNGIARQI QNRFLKETNN AEGTDGVFLF SHTYGCSQLG DDHINTRTML
QNMVRHPNAG AVLVIGLGCE NNQVAAFRET LGDIDPERVH FMICQQQDDE IEAGIEHLHQ
LYNVMRNDKR EPGKLSELKF GLECGGSDGL SGITANPMLG RFSDYVIANG GTTVLTEVPE
MFGAEQLLMD HCRDEATFEK LVTMVNDFKQ YFIAHDQPIY ENPSPGNKAG GITTLEDKSL
GCTQKAGSSV VVDVLRYGER LKTPGLNLLS APGNDAVATS ALAGAGCHMV LFSTGRGTPY
GGFVPTVKIA TNSELAAKKK HWIDFDAGQL IHGKAMPQLL EEFIDTIVEF ANGKQTCNER
NDFRELAIFK SGVTL