Gene EcSMS35_0614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0614 
SymbolentE 
ID6143426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp626994 
End bp628604 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content57% 
IMG OID641615506 
Productenterobactin synthase subunit E 
Protein accessionYP_001742712 
Protein GI170681611 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTC CATTCACCCG CTGGCCGGAA GAGTTTGCCC GTCGCTATCG GGAAAAAGGC 
TACTGGCAGG ATTTGCCGCT GACCGACATT CTGACCCGTC ATGCCGCAAG CGACAGCATC
GCGGTTATCG ACGGCGAGCG GCAGTTGAGT TATCGGGAGC TGAATCAGGC GGCGGATAAC
CTCGCGTGTA GTTTGCGCCG TCAGGGCATT AAACCTGGTG AAACCGCGCT GGTACAGCTG
GGTAACGTCG CTGAACTTTA CATTACCTTT TTCGCGCTGC TGAAACTGGG CGTTGCGCCG
GTGCTGGCAC TGTTCAGCCA TCAGCGTAGT GAACTGAACG CCTACGCCAG CCAGATTGAA
CCGGCGTTAC TGATTGCCGA TCGCCAGCAT GCGCTGTTTA GCGGAGATGA TTTCCTCAAC
ACATTTGTTG CAGAGCATTC TTCCATTCGC GTGGTGCAGC TGCTCAACGA CAGCGGTGAG
CATAACTTGC AGGATGCGAT TAACCATCCG GCTGAGGATT TTACTGCCAC GCCATCTCCT
GCTGATGAAG TGGCCTATTT CCAGCTTTCC GGCGGCACCA CCGGCACTCC GAAACTGATC
CCGCGCACTC ATAACGACTA CTACTACAGC GTGCGTCGTA GCGTCGAGAT TTGTCAGTTC
ACACAACAGA CGCGCTACCT GTGCGCGATC CCGGCGGCTC ATAACTACGC CATGAGTTCA
CCGGGATCGC TGGGCGTCTT TCTCGCTGGC GGCACTGTCG TTCTGGCTGC CGACCCCAGC
GCCACGCTTT GCTTCCCATT GATTGAAAAA CATCAGGTGA ACGTCACCGC GCTGGTGCCG
CCAGCAGTCA GCCTGTGGTT GCAGGCACTG GCTGAAGGCG AAAGCCGGGC GCAGCTTGCC
TCGCTGAAAC TGTTACAGGT CGGCGGCGCA CGTCTTTCTG CCACGCTTGC GGCGCGTATT
CCCGCTGAGA TTGGCTGCCA GTTGCAGCAG GTGTTTGGCA TGGCGGAAGG GCTGGTGAAC
TACACCCGTC TTGATGATAG TGCGGAGAAA ATTATCCATA CCCAGGGTTA CCCAATGTGC
CCGGACGATG AAGTATGGGT TGCCGATGCC GAAGGAAATC CACTGCCGCA AGGGGAAGTT
GGACGCCTGA TGACGCGCGG GCCGTACACC TTCCGTGGCT ATTACAAAAG CCCGCAGCAC
AATGCCAGCG CCTTTGATGC CAACGGTTTT TACTGTTCCG GCGATCTGAT CTCTATTGAT
CCAGAGGGTT ACATCACCGT GCAGGGGCGC GAGAAAGATC AGATCAACCG TGGCGGCGAG
AAGATCGCTG CCGAAGAGAT CGAAAACCTG CTGCTGCGCC ACCCGGCGGT GATCTACGCC
GCACTGGTGA GCATGGAAGA TGAGCTGATG GGCGAAAAAA GCTGTGCTTA TCTGGTGGTA
AAAGAGCCGC TGCGCGCGGT GCAGGTGCGT CGTTTCCTGC GTGAACAGGG TATTGCCGAA
TTTAAATTAC CGGATCGCGT GGAGTGTGTG GATTCACTTC CGCTGACGGC GGTCGGGAAA
GTCGATAAAA AACAATTACG TCAGTGGCTG GCGTCACGCA CATCAGCCTG A
 
Protein sequence
MSIPFTRWPE EFARRYREKG YWQDLPLTDI LTRHAASDSI AVIDGERQLS YRELNQAADN 
LACSLRRQGI KPGETALVQL GNVAELYITF FALLKLGVAP VLALFSHQRS ELNAYASQIE
PALLIADRQH ALFSGDDFLN TFVAEHSSIR VVQLLNDSGE HNLQDAINHP AEDFTATPSP
ADEVAYFQLS GGTTGTPKLI PRTHNDYYYS VRRSVEICQF TQQTRYLCAI PAAHNYAMSS
PGSLGVFLAG GTVVLAADPS ATLCFPLIEK HQVNVTALVP PAVSLWLQAL AEGESRAQLA
SLKLLQVGGA RLSATLAARI PAEIGCQLQQ VFGMAEGLVN YTRLDDSAEK IIHTQGYPMC
PDDEVWVADA EGNPLPQGEV GRLMTRGPYT FRGYYKSPQH NASAFDANGF YCSGDLISID
PEGYITVQGR EKDQINRGGE KIAAEEIENL LLRHPAVIYA ALVSMEDELM GEKSCAYLVV
KEPLRAVQVR RFLREQGIAE FKLPDRVECV DSLPLTAVGK VDKKQLRQWL ASRTSA