Gene Dd703_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDd703_3074 
SymbolentE 
ID8089910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDickeya dadantii Ech703 
KingdomBacteria 
Replicon accessionNC_012880 
Strand
Start bp3610224 
End bp3611834 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content53% 
IMG OID644837155 
Productenterobactin synthase subunit E 
Protein accessionYP_002988667 
Protein GI242240486 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTATCG CGTATCACCG CTGGCCTAAC GCGTTGGCAG CCCGCTATCG GGCTAAAGGT 
TACTGGATAG ATCTGCCACT GACAGACGTG ATAACACGGC ATGTGGATAC CGAACGGGTC
GCACTGACCG ATAATGGCAA AAACTACAGT TATGCGGAAC TCCATCGCTT GTCTAATCAG
TTGGCTGCCG CACTTGCGCA ACGCGGACTT CGGCGTGGTG ACACGGCGCT GATCCAATTA
GGCAATATCG CCGAATTTTA TATTGTCTTC TTCGCCCTGT TGAAAATCGG CGTAGCGCCA
GTCAATGCGC TATTTAGCCA CCAGCGGCAC GAAATGGATG CCTACGCCCT GCAGATTCAC
CCCCGCGCGC TGATTGCCGA TCGCCAACAT AAACTATTTC ACGATAATAC CTATCTGCAG
CAGTTACGCC ACGCACACCC AAACCTGAAC ATTGTGGCCT TTCACCATCA GGATACGGGC
GATGAGTCAC TGTCCATGCT GATACAACAA GCAGACACTG GCTTTGTTCC TTCGCCATCA
GCGGCAGATG AGGTCGCGTT TTTCCAGCTT TCCGGCGGCA GCACCGGGAC ACCCAAGCTC
ATTCCACGCA CTCACAACGA TTACTATTAC AGTATTCGTG GCAGTGTCGA CATCTGTCGC
TTCACCCCGC AGACACGCTA CCTATGCGCA CTGCCCGCAG CACATAACTA TCCCCTCAGT
TCCCCAGGAT CACTCGGCGT ATTCTATGCC AATGGCACGG TGATACTGGC ACCCGATCCG
AGTGCGACCA CCTGCTTCGC ATTGATTGAA CGGTATCAGG TGACAGTCGC GGCACTGGTT
CCTCCCGCCG CCAGTCTATG GCTACAAGCG GTCAAGGATA GCGGCAGTCA GGCGCTGCGT
TCGCTTGAAC TTCTGCAGGT GGGCGGCGCG CGGCTGAGTC CGCGTGTGGC ATCAGAAATC
CCTTCCCAAC TAGGCTGCCA ACTACAGCAA GTGTTCGGCA TGGCCGAAGG GCTGGTGAAT
TATACTCGTC TCGACGATCC CCAGGATCGT ATTTTCACAA CTCAGGGGCG TCCCATTTCA
CCCGACGACG ACGTGTGGGT GGCTGATGAC CATGGCGCGC CTCTGCCTGC CAACCGGATC
GGGCGGTTAA TGACGCGAGG TCCTTACACC ATTCGCGGCT ATTACAACAG CCCACAGTAC
AATGCTACCG TTTTCGATGC CGACGGCTTC TACTGTTCAG GCGATCTGGT GGCCATTGAT
GAGGACGGTT ATATCACTGT TCATGGGCGG GAAAAGGATC AGATCAATCG TGGTGGTGAA
AAGATTGCCG CAGAAGAGAT CGAGAATTTA CTACTAATGC ATCCCTCAGT GACAGAAGCG
GCGCTGGTGG CGATAGAGGA CGAACTGATA GGAGAAAAAA GTTACGCCTT CATCATGGCG
AATGAGCCGC TAAAAAACGT CGAAATACGC CGTTTTTTAC GCAGCCATGG CGTAGCGGAT
TACAAATTAC CCGATCACGT TGAAATTCTA TCTGCTTTGC CGCTAACACC GGTCGGCAAA
ATAGATAAAA AACAGTTGCG TCAATTATTG GTACTGAAAA CCATTTTATA G
 
Protein sequence
MTIAYHRWPN ALAARYRAKG YWIDLPLTDV ITRHVDTERV ALTDNGKNYS YAELHRLSNQ 
LAAALAQRGL RRGDTALIQL GNIAEFYIVF FALLKIGVAP VNALFSHQRH EMDAYALQIH
PRALIADRQH KLFHDNTYLQ QLRHAHPNLN IVAFHHQDTG DESLSMLIQQ ADTGFVPSPS
AADEVAFFQL SGGSTGTPKL IPRTHNDYYY SIRGSVDICR FTPQTRYLCA LPAAHNYPLS
SPGSLGVFYA NGTVILAPDP SATTCFALIE RYQVTVAALV PPAASLWLQA VKDSGSQALR
SLELLQVGGA RLSPRVASEI PSQLGCQLQQ VFGMAEGLVN YTRLDDPQDR IFTTQGRPIS
PDDDVWVADD HGAPLPANRI GRLMTRGPYT IRGYYNSPQY NATVFDADGF YCSGDLVAID
EDGYITVHGR EKDQINRGGE KIAAEEIENL LLMHPSVTEA ALVAIEDELI GEKSYAFIMA
NEPLKNVEIR RFLRSHGVAD YKLPDHVEIL SALPLTPVGK IDKKQLRQLL VLKTIL