Gene Spro_3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3420 
SymbolentE 
ID5604240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3784139 
End bp3785767 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content59% 
IMG OID640938973 
Productenterobactin synthase subunit E 
Protein accessionYP_001479646 
Protein GI157371657 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00195409 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTG CTTTTACCCC CTGGCCTGAG GCGTTAGCCC ATCGCTATCG CGAACGCGGC 
TATTGGACCG ACCGGCCCCT GACCGACATC ATCACTCGCC AGGCCAAAAA TGACGCTATC
GCGCTGATCG ACCCGGAGCG CAGCCTGAGT TACCGCCGGC TTAATCAGCT CTCCGATCGG
TTGGCTGCCG CCCTGCAGCG CCGCGGCATT CAGAGCGGAG ATACTGCTCT GGTGCAGCTG
GGCAACGTGG TCGAGTTCTA TGTAACCTTC TTCGCGTTGT TGAAAATTGG CGTGGTACCG
GTGAACGCGT TGTTCAGCCA TCAACGCAAC GAGCTGAACG CCTACGCCGT ACAGATCAAA
CCAGCATTGC TGATTGCCGA CCGTCAGCAT GGCCTGTTTG GCAACGACGA ATTTCTGACG
GCGTTTCGCG CCGAGCATCC TTCATTGCGC GTGGTGGCAT TGCGCAATCA GGACGGGGGC
GAGCAGTCGC TGGCGGCCTG GCTGGAAGAG GACAGCAGTG GTTTTGTTGC GACGCCAAGC
CCGGCCGACC AGGTGGCGTT TTTCCAGCTT TCCGGTGGCA GCACCGGAAC GCCAAAGCTG
ATCCCGCGGA CTCACAACGA TTACTACTAC AGCATTCGCC GCAGCGTGGA AATTTGCCAT
TTCGACGTCG ATACCCGCTA CCTGTGCGCG CTGCCGGTGG CCCATAACTA TCCGATGAGT
TCTCCGGGCG TCCTGGGCGT GTTTTATGGC GCCGGGCTGG TGGTGTTTGC CAGCGACCCG
GATGCCGGAC AATGTTTCCG TTTGATCGAG CAGCATCAGA TTAACGTGAC GGCGCTGGTG
CCACCGGCGG TGACGCTGTG GCTGCAGGCG ATTGAAGAGT GGGGCGGTTG CCAGCAACTG
ACCAGCCTTA AGCTGTTGCA GGTGGGCGGC GCCAAGCTGG GCGAAACCCT GGCGGCACGT
ATTCCGGCCG AGATCGGCTG CCAGTTGCAG CAGGTGTTCG GCATGGCGGA AGGCCTGGTG
AACTACACCC GTCTGGACGA TGACGATCAG CATATTCTGA CCACTCAGGG CTGCCCGATG
TCGCCGGATG ACGAGCTGTG GGTGGCCGAT GAGGACGGCA ACCCGCTGCC GGTGGGGGAA
ACCGGGCGTC TGATGACCCG TGGCCCTTAT ACCTTCCGCG GTTATTACCA AAGTCCGGAG
CACAACGCCG CCGCCTTTGA TAAGGACGGT TTCTACTGTT CCGGCGATTT GATCAGCCTG
ACCGAAGATG GCTATGTGAA AGTTGAAGGG CGACAAAAAG ATCAGATTAA CCGTGGTGGG
GAAAAGATCG CTGCGGAAGA AATCGAAAAC CTGTTATTGC GCCATCCGGA AGTGATCAAT
GCGGCGTTGG TGTCGATGCC GGACGAACTG ATGGGTGAAA AAAGCTGCGC CTATATCATC
GCGACCAGCG CGTTGAAGCC GGTAGTGTTG CGCCGCCATT TACGCGGCGA GGGCGTGGCC
GAATTTAAAT TACCCGACCG TTTTATACAG GTTGATACGC TGCCGCTCAC CCCGGTTGGC
AAAGTGGATA AAAAACTGTT GCGCCAACGC CTGGAGGCGC AACAACTGAC TCTGGTCCAG
GGAGAATAA
 
Protein sequence
MTIAFTPWPE ALAHRYRERG YWTDRPLTDI ITRQAKNDAI ALIDPERSLS YRRLNQLSDR 
LAAALQRRGI QSGDTALVQL GNVVEFYVTF FALLKIGVVP VNALFSHQRN ELNAYAVQIK
PALLIADRQH GLFGNDEFLT AFRAEHPSLR VVALRNQDGG EQSLAAWLEE DSSGFVATPS
PADQVAFFQL SGGSTGTPKL IPRTHNDYYY SIRRSVEICH FDVDTRYLCA LPVAHNYPMS
SPGVLGVFYG AGLVVFASDP DAGQCFRLIE QHQINVTALV PPAVTLWLQA IEEWGGCQQL
TSLKLLQVGG AKLGETLAAR IPAEIGCQLQ QVFGMAEGLV NYTRLDDDDQ HILTTQGCPM
SPDDELWVAD EDGNPLPVGE TGRLMTRGPY TFRGYYQSPE HNAAAFDKDG FYCSGDLISL
TEDGYVKVEG RQKDQINRGG EKIAAEEIEN LLLRHPEVIN AALVSMPDEL MGEKSCAYII
ATSALKPVVL RRHLRGEGVA EFKLPDRFIQ VDTLPLTPVG KVDKKLLRQR LEAQQLTLVQ
GE