Gene EcolC_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0572 
Symbol 
ID6066119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp613812 
End bp615383 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content53% 
IMG OID641599979 
Productgalactarate dehydratase 
Protein accessionYP_001723576 
Protein GI170018622 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2721] Altronate dehydratase 
TIGRFAM ID[TIGR03248] galactarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAACA TCGAAATCAG ACAAGAAACG CCAACTGCGT TTTATATAAA AGTTCACGAC 
ACAGATAATG TGGCAATTAT TGTTAATGAT AATGGCCTGA AAGCAGGAAC GCGTTTTCCG
GATGGGCTGG AATTAATTGA ACATATTCCC CAGGGGCATA AAGTCGCATT GCTGGACATT
CCGGCTAATG GTGAAATTAT TCGTTATGGC GAAGTGATTG GTTACGCCGT GCGTGCAATC
CCACGCGGAA GCTGGATCGA CGAATCAATG GTTGTACTAC CGGAAGCGCC GCCGTTACAC
ACGCTGCCAC TGGCAACCAA AGTCCCGGAA CCCTTACCGC CGCTGGAAGG ATACACCTTT
GAGGGCTATC GCAATGCCGA TGGCAGCGTG GGCACCAAAA ACCTGCTCGG CATCACCACC
AGCGTCCACT GTGTGGCAGG CGTGGTGGAC TACGTAGTAA AAATCATTGA ACGCGATCTG
CTACCGAAAT ACCCGAACGT CGATGGCGTG GTGGGGCTGA ATCATTTGTA CGGTTGTGGC
GTGGCGATTA ACGCACCGGC GGCAGTTGTG CCTATTCGTA CCATTCACAA TATTTCGCTG
AACCCTAACT TTGGCGGCGA AGTAATGGTG ATTGGCCTGG GTTGTGAAAA GTTGCAGCCT
GAGCGCCTGC TGACTGGAAC GGATGATGTG CAAGCTATTC CAGTAGAAAG CGCCAGCATT
GTCAGTTTGC AGGATGAAAA GCATGTCGGT TTTCAGTCCA TGGTCGAGGA TATTTTGCAG
GTCGCCGAAC GCCATCTACA AAAACTGAAT CAACGTCAGC GAGAAACCTG TCCGGCTTCA
GAACTGGTTG TCGGCATGCA GTGCGGTGGC AGCGATGCCT TTTCTGGCGT AACGGCAAAC
CCGGCGGTTG GGTATGCGTC TGATCTACTG GTGCGCTGCG GCGCAACGGT GATGTTCTCA
GAAGTCACGG AAGTGCGTGA CGCGATCCAT CTGCTGACAC CACGCGCAGT GAACGAAGAG
GTCGGCAAAC GGCTGCTGGA GGAGATGGAG TGGTACGATA ACTATCTCAA TATAGGAAAA
ACCGACCGCA GCGCCAACCC TTCGCCGGGC AACAAGAAAG GCGGTCTGGC AAACGTGGTA
GAGAAGGCAC TCGGCTCCAT TGCTAAATCG GGTAAAAGCG CAATTGTTGA AGTGCTGTCG
CCCGGTCAAC GCCCGACTAA ACGCGGATTA ATTTACGCCG CGACGCCAGC CAGCGATTTT
GTCTGTGGCA CGCAACAGGT GGCTTCGGGT ATCACAGTGC AAGTGTTTAC GACCGGCCGT
GGTACGCCGT ACGGCCTGAT GGCGGTACCC GTCATTAAAA TGGCGACCCG CACCGAGCTG
GCGAACCGCT GGTTTGATTT AATGGATATT AACGCGGGCA CTATCGCCAC CGGCGAAGAA
ACCATTGAAG AGGTGGGCTG GAAGTTGTTC CACTTTATTC TCGACGTCGC CAGCGGGAAG
AAGAAAACCT TCTCGGATCA ATGGGGGCTG CATAACCTGC TGGCGGTGTT TAACCCGGCA
CCGGTGACCT GA
 
Protein sequence
MANIEIRQET PTAFYIKVHD TDNVAIIVND NGLKAGTRFP DGLELIEHIP QGHKVALLDI 
PANGEIIRYG EVIGYAVRAI PRGSWIDESM VVLPEAPPLH TLPLATKVPE PLPPLEGYTF
EGYRNADGSV GTKNLLGITT SVHCVAGVVD YVVKIIERDL LPKYPNVDGV VGLNHLYGCG
VAINAPAAVV PIRTIHNISL NPNFGGEVMV IGLGCEKLQP ERLLTGTDDV QAIPVESASI
VSLQDEKHVG FQSMVEDILQ VAERHLQKLN QRQRETCPAS ELVVGMQCGG SDAFSGVTAN
PAVGYASDLL VRCGATVMFS EVTEVRDAIH LLTPRAVNEE VGKRLLEEME WYDNYLNIGK
TDRSANPSPG NKKGGLANVV EKALGSIAKS GKSAIVEVLS PGQRPTKRGL IYAATPASDF
VCGTQQVASG ITVQVFTTGR GTPYGLMAVP VIKMATRTEL ANRWFDLMDI NAGTIATGEE
TIEEVGWKLF HFILDVASGK KKTFSDQWGL HNLLAVFNPA PVT