Gene ECH74115_3270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3270 
SymbolgtdA 
ID6971767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3003416 
End bp3004444 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content56% 
IMG OID643387083 
Productgentisate 1,2-dioxygenase 
Protein accessionYP_002271547 
Protein GI209399809 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.395363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATA ACAATCAAAA TAGCCGCGAG CAATTTTATC AGCATATTTC CGGGCAAAAC 
CTGACCCCGC TGTGGGAGTC ACTGCACCAT CTGGTGCCGA AAACGCCCAA CGCTAACTGT
GCGCCAGCGT ACTGGAATTA TCAGGAGATC CGCCCGCTGC TACTGGAGAG CGGCGGCCTG
ATTGGTGCGA AAGAGGCCGT GCGCCGCGTG CTGGTGCTGG AAAACCCGGC GCTGCGCGGG
CAGTCTTCCA TTACCGCGAC TTTATATGCC GGTTTGCAAC TGATCATGCC GGGCGAAGTG
GCACCGAGTC ATCGTCATAA TCAGTCGGCG CTGCGTTTTA TTGTTGAAGG CAAAGGGGCA
TTTACCGCCG TTGATGGCGA ACGCACGCCA ATGAATGAAG GCGATTTTAT CCTGACCCCG
CAGTGGCGCT GGCACGATCA CGGTAACCCT GGCGACGAAC CGGTTATCTG GCTCGATGGG
CTGGATCTGC CATTAGTGAA TATTCTGGGC TGCGGTTTTG CCGAAGATTA TCCGGAAGAG
CAACAACCGG TAACGCGTAA AGAGGGAGAT TATCTGCCGC GTTACGCTGC CAATATGTTG
CCGCTGCGGC ATCAGACGGG GAACTCCTCG CCGATCTTTA ACTATCGTTA TGACCGCAGC
CGCGAAGTGC TGCACGATTT AACCCGACTG GGCGATGCCG ACGAGTGGGA TGGCTACAAA
ATGCGCTACG TCAACCCGGT CACCGGCGGC TATCCGATGC CGTCGATGGG CGCTTTCCTG
CAACTGCTGC CGAAAGGGTT TGCCTCACGC GTTGCGCGTA CCACTGACAG CACCATCTAC
CATGTGGTGG AAGGTAGCGG GCAGGTCATC ATCGGCAATG AAACCTTCAG CTTTAGTGCA
AAAGATATCT TCGTGGTGCC GACCTGGCAT GGCGTGTCGT TCCAGACCAC GCAAGATTCC
GTTTTATTTA GTTTTTCGGA CAGACCGGTA CAAGAAGCCC TGGGGCTGTT CCGCGAAGCG
CGTTATTAA
 
Protein sequence
MTDNNQNSRE QFYQHISGQN LTPLWESLHH LVPKTPNANC APAYWNYQEI RPLLLESGGL 
IGAKEAVRRV LVLENPALRG QSSITATLYA GLQLIMPGEV APSHRHNQSA LRFIVEGKGA
FTAVDGERTP MNEGDFILTP QWRWHDHGNP GDEPVIWLDG LDLPLVNILG CGFAEDYPEE
QQPVTRKEGD YLPRYAANML PLRHQTGNSS PIFNYRYDRS REVLHDLTRL GDADEWDGYK
MRYVNPVTGG YPMPSMGAFL QLLPKGFASR VARTTDSTIY HVVEGSGQVI IGNETFSFSA
KDIFVVPTWH GVSFQTTQDS VLFSFSDRPV QEALGLFREA RY