Gene ECH74115_4442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4442 
SymbolgarD 
ID6967518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4117600 
End bp4119171 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content53% 
IMG OID643388162 
Productgalactarate dehydratase 
Protein accessionYP_002272599 
Protein GI209395943 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2721] Altronate dehydratase 
TIGRFAM ID[TIGR03248] galactarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.648393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAACA TCGAAATCAG ACAAGAAACG CCAACTGCGT TTTATATAAA AGTTCACGAC 
ACAGATAATG TGGCAATTAT TGTTAATGAT AATGGCCTGA AAGCAGGAAC GCGTTTTCCG
GATGGGCTGG AATTAATTGA ACATATTCCC CAGGGGCATA AAGTCGCATT GCTCGACATT
CCGGCTAATG GTGAAATTAT TCGTTATGGC GAAGTGATTG GTTACGCCGT GCGTGCAATC
CCACGCGGAA GCTGGATCGA CGAATCAATG GTTGTACTGC CGAAAGCGCC GCCGTTACAC
ACGCTGCCAC TGGCAACCAA AGTCCCGGAA CCCTTACCGC CGCTGGAAGG ATACACCTTT
GAGGGCTATC GCAATGCCGA TGGCAGCGTG GGCACCAAAA ACCTGCTCGG CATCACCACC
AGCGTCCACT GTGTGGCAGG CGTGGTGGAC TACGTAGTAA AAATCATTGA ACGCGATCTG
CTACCGAAAT ACCCGAACGT CGATGGCGTG GTGGGGCTGA ATCATTTGTA CGGTTGTGGC
GTGGCGATTA ACGCACCGGC GGCAGTTGTA CCTATCCGTA CCATTCACAA TATTTCGCTG
AATCCTAACT TTGGCGGCGA AGTAATGGTG ATTGGCCTGG GTTGTGAAAA GTTGCAGCCT
GAGCGCCTGC TGACTGGAAC GGATGATGTG CAAGCTATTC CAGTAGAAAG CGCCAGCATT
GTCAGTTTGC AGGATGAAAA GCATGTCGGT TTTCAGTCCA TGGTCGAGGA TATTTTGCAG
GTCGCCGAAC GCCATCTACA AAAACTGAAT CAACGTCAGC GAGAAACCTG TCCGGCTTCA
GAACTGGTTG TCGGCATGCA GTGCGGTGGC AGCGATGCCT TTTCTGGCGT AACGGCAAAC
CCGGCGGTTG GGTATGCGTC TGATCTACTG GTGCGCTGCG GCGCAACGGT GATGTTCTCA
GAAGTCACGG AAGTGCGTGA CGCGATCCAT CTGCTGACAC CACGCGCAGT GAACGAAGAG
GTCGGCAAAC GGCTGCTGGA GGAGATGGAG TGGTACGATA ACTATCTCAA TATGGGAAAA
ACCGACCGCA GCGCCAACCC TTCGCCGGGC AACAAGAAAG GCGGTCTGGC AAATGTAGTG
GAGAAAGCGC TCGGCTCCAT TGCTAAATCG GGTAAAAGTG CAATTGTTGA AGTGCTGTCG
CCCGGTCAAC GCCCGACTAA ACGCGGATTA ATTTACGCCG CGACGCCAGC CAGCGATTTT
GTCTGTGGCA CGCAACAGGT GGCTTCGGGT ATCACCGTGC AAGTGTTTAC GACCGGTCGT
GGTACGCCGT ACGGCCTGAT GGCGGTACCC GTCATTAAAA TGGCGACCCG CACCGAGCTG
GCGAACCGCT GGTTTGATTT AATGGATATT AACGCGGGCA CTATCGCCAC CGGCGAAGAA
ACCATTGAAG AGGTGGGCTG GAAGTTGTTC CACTTTATTC TCGACGTCGC CAGCGGGAAG
AAGAAAACCT TCTCGGATCA ATGGGGACTG CATAACCAGC TGGCGGTGTT TAACCCAGCA
CCGGTGACCT GA
 
Protein sequence
MANIEIRQET PTAFYIKVHD TDNVAIIVND NGLKAGTRFP DGLELIEHIP QGHKVALLDI 
PANGEIIRYG EVIGYAVRAI PRGSWIDESM VVLPKAPPLH TLPLATKVPE PLPPLEGYTF
EGYRNADGSV GTKNLLGITT SVHCVAGVVD YVVKIIERDL LPKYPNVDGV VGLNHLYGCG
VAINAPAAVV PIRTIHNISL NPNFGGEVMV IGLGCEKLQP ERLLTGTDDV QAIPVESASI
VSLQDEKHVG FQSMVEDILQ VAERHLQKLN QRQRETCPAS ELVVGMQCGG SDAFSGVTAN
PAVGYASDLL VRCGATVMFS EVTEVRDAIH LLTPRAVNEE VGKRLLEEME WYDNYLNMGK
TDRSANPSPG NKKGGLANVV EKALGSIAKS GKSAIVEVLS PGQRPTKRGL IYAATPASDF
VCGTQQVASG ITVQVFTTGR GTPYGLMAVP VIKMATRTEL ANRWFDLMDI NAGTIATGEE
TIEEVGWKLF HFILDVASGK KKTFSDQWGL HNQLAVFNPA PVT