Gene ECH74115_0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0749 
Symbollnt 
ID6969582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp772779 
End bp774317 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content56% 
IMG OID643384779 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_002269292 
Protein GI209398690 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.209793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTG CCTCATTAAT TGAACGCCAG CGCATTCGCC TGCTGCTGGC GTTATTATTC 
GGTGCCTGCG GAACGCTGGC CTTCTCTCCT TACGACGTCT GGCCTGCGGC GATTATTTCG
CTGATGGGGC TTCAGGCGTT GACCTTTAAC CGCCGTCCAC TCCAGTCTGC CGCTATTGGC
TTTTGCTGGG GATTTGGCCT CTTTGGCAGC GGTATTAACT GGGTCTATGT CAGCATCGCG
ACCTTTGGCG GAATGCCTGG CCCGGTTAAC ATCTTCCTGG TGGTACTGCT GGCGGCGTAT
TTGTCGCTGT ATACCGGACT GTTTGCTGGC GTGCTGTCGC GCCTGTGGCC GAAAACCACC
TGGCTGCGCG TGGCGATTGC CGCCCCTGCT CTCTGGCAAG TGACCGAGTT TCTGCGCGGT
TGGGTACTGA CAGGCTTCCC GTGGTTACAG TTCGGCTATA GCCAGATTGA TGGCCCGTTA
AAAGGGCTGG CACCGCTAAT GGGCGTGGAA GCCATTAACT TCCTGCTTAT GATGGTTAGC
GGCCTGCTGG CACTGGCGTT AGTCAAACGC AACTGGCGTC CGCTGGTGGT GGCCGTCGTG
CTGTTTGCCC TTCCCTTCCC GCTGCGTTAC ATCCGGTGGT TTACCCCACA ACCGGAGAAA
ACCATTCAGG TTTCGATGGT TCAGGGCGAT ATTCCGCAAT CGCTGAAATG GGACGAAGGC
CAGCTTCTTA ATACGCTGAA GATTTACTAC AACGCAACGG CACCGCTGAT GGGCAAATCA
TCGTTGATTA TCTGGCCGGA GTCGGCGATA ACCGATCTGG AAATTAATCA GCAACCGTTC
CTCAAAGCAC TGGACGGTGA GTTGCGTGAT AAAGGTAGCT CGCTGGTGAC CGGGATTGTT
GACGCGCGTC TTAATAAGCA GAACCGTTAC GATACCTACA ACACCATCAT CACACTGGGT
AAAGGTGCAC CGTACAGCTA CGAATCAGCC GATCGCTATA ACAAAAACCA TCTGGTGCCG
TTTGGCGAGT TTGTCCCGCT GGAGTCGATT CTGCGTCCGT TAGCACCGTT CTTTGATCTG
CCGATGTCGT CGTTCAGCCG TGGGCCATAT ATCCAGCCGC CGCTGTCGGC AAATGGTATT
GAGCTTACTG CGGCTATTTG CTACGAGATC ATTCTCGGCG AGCAAGTGCG CGATAACTTC
CGCCCGGATA CCGACTATCT GCTGACTATC TCCAACGATG CGTGGTTTGG TAAGTCTATT
GGTCCATGGC AGCACTTCCA GATGGCGCGA ATGCGTGCGC TGGAGCTGGC GCGCCCACTG
TTGCGCAGCA CCAACAACGG CATTACGGCG GTGATTGGCC CGCAGGGTGA GATTCAGGCG
ATGATCCCGC AGTTCACCCG CGAGGTGTTA ACCACTAACG TGACGCCGAC CACCGGACTC
ACACCATACG CACGTACCGG CAACTGGCCG CTGTGGGTGC TGACGGCACT GTTTGGTTTT
GCTGCTGTGT TGATGAGTCT GCGTCAGCGA CGTAAATAA
 
Protein sequence
MAFASLIERQ RIRLLLALLF GACGTLAFSP YDVWPAAIIS LMGLQALTFN RRPLQSAAIG 
FCWGFGLFGS GINWVYVSIA TFGGMPGPVN IFLVVLLAAY LSLYTGLFAG VLSRLWPKTT
WLRVAIAAPA LWQVTEFLRG WVLTGFPWLQ FGYSQIDGPL KGLAPLMGVE AINFLLMMVS
GLLALALVKR NWRPLVVAVV LFALPFPLRY IRWFTPQPEK TIQVSMVQGD IPQSLKWDEG
QLLNTLKIYY NATAPLMGKS SLIIWPESAI TDLEINQQPF LKALDGELRD KGSSLVTGIV
DARLNKQNRY DTYNTIITLG KGAPYSYESA DRYNKNHLVP FGEFVPLESI LRPLAPFFDL
PMSSFSRGPY IQPPLSANGI ELTAAICYEI ILGEQVRDNF RPDTDYLLTI SNDAWFGKSI
GPWQHFQMAR MRALELARPL LRSTNNGITA VIGPQGEIQA MIPQFTREVL TTNVTPTTGL
TPYARTGNWP LWVLTALFGF AAVLMSLRQR RK