Gene SbBS512_E0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0496 
SymbolentE 
ID6272510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp479723 
End bp481333 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content56% 
IMG OID641724714 
Productenterobactin synthase subunit E 
Protein accessionYP_001879261 
Protein GI187731959 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTC CATTCACCCG CTGGCCGGAA GAGTTTGCCC GTCGCTATCG GGAAAAAGGC 
TACTGGCAGG ATTTGCCGCT GACCGACATT CTGACTCGCC ACGCTGCGAG TGACAGCATC
GCGGTTATCG ACGGCGAGCT ACAGTTGAGT TACCGGGAGC TGAATCAGGC GGCGGATAAC
CTCGCGTGTA CTTTACGCCG TCAGGGCATT AAACCTGGTG AAACCGCGCT GGTACAACTG
GGTAACGTCG CTGAATTGTA TATTACCTTT TTCGCGCTGC TGAAACTGGG CGTTGCGCCG
GTGCTGGCGT TGTTCAGCCA TCAGCGTAGT GAACTGAACG CCTATGCCAG CCAGATTGAA
CCCGCATTGC TGATTGCCGA TCGCCAACAT GCGTTGTTTA GCGGGGATGA TTTCCTCAAT
ACTTTCGTCA CAGAACATTC CTCCATTCGC GTGGTGCAAC TGCACAACGA CAGCGGTGAG
CATAACTTGC AGGATGCGAT TAACCATCCG GCAGACGGTT TTACTGCCAC GCCATCACCT
GCTGATGAAG TGGCCTATTT CCAGCTTTCC GGCGGCACCA CCGGTACACC GAAGCTGATC
CCGCGCACTC ATAACGACTA CTACTACAGC GTGCGTCGTA GCGTCGAGAT TTGTCAGTTC
ACACAACAGA CGCGCTACCT GTGCGCGATC CCGGCGGCTC ATAACTACGC CATGAGTTCG
CCAGGATCGC TGGGCGTCTT TCTTGCCGGA GGAACGGTTG TTCTGGCGGC CGATCCCAGC
GCTACGCTTT GCTTCCCATT GATTGAAAAA CATCAGGTGA ACGTCACCGC GCTGGTGCCG
CCGGCAGTCA GCCTGTGGTT GCAGGCGCTG GCTGAAGGCG AAAGCCGGGC GCAGCTTGCC
TCGCTGAAAC TGTTACAGGT CGGCGGCGCA CGTCTTTCTG CCACCCTTGC GGCGCGTATT
CCCGCTGAGA TTGGCTGCCA GTTGCAGCAG GTGTTTGGCA TGGCGGAAGG GCTGGTGAAC
TACACCCGTC TTGATGATAG CGCGGAGAAA ATTATCCATA CCCAGGGTTA CCCAATGTGT
CCGGACGATG AAGTGTGGGT TGCCGATGCC GAAGGAAATC CACTGCCGCA AGGGGAAGTC
GGACGCCTGA TGACGCGCGG GCCGTACACC TTCCGCGGTT ATTACAAAAG TCCGCAGCAC
AATGCCAGCG CCTTTGATGC CAACGGTTTT TACTGTTCCG GCGATCTGAT CTCTATTGAT
CCAGAGGGTT ACATCACCGT GCAGGGGCGC GAGAAAGATC AGATCAACCG TGGCGGCGAG
AAGATCGCTG CCGAAGAGAT CGAAAACCTG CTGCTGCGCC ATCCGGCGGT GATCTATGCC
GCACTGGTGA GCATGGAAGA TGAGCTGATG GGCGAAAAAA GCTGTGCTTA TCTGGTGGTA
AAAGAGCCGC TGCGCGCGGT GCAGGTGCGT CGTTTCCTGC GTGAACAGGG TATTGCCGAA
TTTAAATTAC CGGATCGCGT GGAGTGTGTA GATTCACTTC CGCTGACGGC GGTCGGGAAA
GTCGATAAAA AACAATTAAG TCAGTGGCTG GCGTCACGCG CATCAGCCTG A
 
Protein sequence
MSIPFTRWPE EFARRYREKG YWQDLPLTDI LTRHAASDSI AVIDGELQLS YRELNQAADN 
LACTLRRQGI KPGETALVQL GNVAELYITF FALLKLGVAP VLALFSHQRS ELNAYASQIE
PALLIADRQH ALFSGDDFLN TFVTEHSSIR VVQLHNDSGE HNLQDAINHP ADGFTATPSP
ADEVAYFQLS GGTTGTPKLI PRTHNDYYYS VRRSVEICQF TQQTRYLCAI PAAHNYAMSS
PGSLGVFLAG GTVVLAADPS ATLCFPLIEK HQVNVTALVP PAVSLWLQAL AEGESRAQLA
SLKLLQVGGA RLSATLAARI PAEIGCQLQQ VFGMAEGLVN YTRLDDSAEK IIHTQGYPMC
PDDEVWVADA EGNPLPQGEV GRLMTRGPYT FRGYYKSPQH NASAFDANGF YCSGDLISID
PEGYITVQGR EKDQINRGGE KIAAEEIENL LLRHPAVIYA ALVSMEDELM GEKSCAYLVV
KEPLRAVQVR RFLREQGIAE FKLPDRVECV DSLPLTAVGK VDKKQLSQWL ASRASA