Gene EcE24377A_2433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2433 
SymbolgtdA 
ID5590259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2415446 
End bp2416474 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content57% 
IMG OID640926094 
Productgentisate 1,2-dioxygenase 
Protein accessionYP_001463489 
Protein GI157155116 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATA ACAATCAAAA TAGCCGCGAA CAATTTTATC AGCACATTTC CGGGCAAAAC 
CTGACCCCGC TGTGGGAGTC ACTACACCAT CTGGTGCCGA AAACGCCCAA CGCTAACTGT
GCGCCAGCGT ACTGGAATTA TCAGGAGATC CGCCCGCTGC TGCTGGAGAG CGGCAGCCTG
ATTGGCGCGA AAGAGGCGGT GCGCCGCGTG CTGGTGCTGG AAAACCCGGC GCTGCGCGGG
CAGTCTTCCA TCACCGCGAC TTTATATGCC GGTTTGCAAC TGATCATGCC GGGCGAAGTG
GCACCGAGTC ATCGCCATAA CCAGTCGGCG CTGCGTTTTA TTGTTGAAGG CAAAGGGGCA
TTTACCGCCG TTGACGGCGA ACGCACGCCA ATGAATGAGG GCGATTTTAT CCTCACTCCG
CAGTGGCGCT GGCACGATCA CGGTAACCCT GGCGACGAAC CAGTTATCTG GCTCGATGGG
CTGGATCTGC CATTAGTGAA TACTCTGGGC TGCGGTTTTG CCGAAGATTA CCCGGAAGAG
CAACAACCGG TAACGCGTAA AGAGGGCGAT TATCTGCCGC GTTACGCTGC CAATATGTTG
CCGCTGCGCC ATCAGACCGG GAACTCCTCA CCCATCTTTA ACTACCGTTA TGACCGCAGC
CGCGAAGTGC TGCACGATTT AACCCGCCTG GGCGATGCCG ATGAGTGGGA TGGCTACAAA
ATGCGCTACG TCAACCCGGT CACCGGTGGC TATCCGATGC CGTCGATGGG CGCTTTCCTG
CAACTGCTGC CGAAAGGGTT CGCCTCCCGC GTGGCGCGTA CCACTGACAG CACCATCTAT
CACGTGGTGG AAGGTGGCGG GCAGGTCACT ATTGGCAATG AAACCTTCAG CTTTAGTGCA
AAAGATATCT TCGTGGTGCC GACCTGGCAC GGCGTGTCGT TCCAGACCAC GCAAGATACC
GTGTTATTCA GTTTTTCGGA CAGACCGGTA CAAGAAGCCC TGGGACTGTT CCGCGAAGCG
CGTTATTAA
 
Protein sequence
MTDNNQNSRE QFYQHISGQN LTPLWESLHH LVPKTPNANC APAYWNYQEI RPLLLESGSL 
IGAKEAVRRV LVLENPALRG QSSITATLYA GLQLIMPGEV APSHRHNQSA LRFIVEGKGA
FTAVDGERTP MNEGDFILTP QWRWHDHGNP GDEPVIWLDG LDLPLVNTLG CGFAEDYPEE
QQPVTRKEGD YLPRYAANML PLRHQTGNSS PIFNYRYDRS REVLHDLTRL GDADEWDGYK
MRYVNPVTGG YPMPSMGAFL QLLPKGFASR VARTTDSTIY HVVEGGGQVT IGNETFSFSA
KDIFVVPTWH GVSFQTTQDT VLFSFSDRPV QEALGLFREA RY