Gene EcSMS35_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0678 
Symbollnt 
ID6146388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp688800 
End bp690338 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content55% 
IMG OID641615569 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_001742775 
Protein GI170681081 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTG CCTCATTAAT TGAACGCCAG CGCATTCGCC TGCTGCTGGC GTTATTATTC 
GGTGCCTGCG GAACGCTGGC CTTCTCTCCT TACGACGTCT GGCCTGCGGC GATTATTTCG
CTGATGGGGC TTCAGGCGTT GACCTTTAAC CGCCGTCCAC TCCAGTCTGC CGCTATTGGC
TTTTGCTGGG GATTTGGCCT CTTTGGCAGC GGTATTAACT GGGTCTATGT CAGCATCGCG
ACCTTTGGCG GAATGCCTGG CCCGGTTAAC ATCTTCCTGG TGGTACTGCT GGCGGCGTAT
TTGTCGCTGT ATACCGGACT GTTTGCTGGC GTGCTGTCGC GTCTGTGGCC GAAAACCACC
TGGCTGCGCG TGGCGATTGC CGCCCCTGCT CTCTGGCAAG TGACCGAGTT TCTGCGCGGT
TGGGTACTAA CAGGCTTCCC GTGGTTACAG TTCGGCTATA GCCAGATTGA TGGCCCGTTA
AAAGGGCTGG CACCGCTAAT GGGCGTGGAA GCCATTAACT TCCTGCTTAT GATGGTTAGC
GGCCTGCTGG CACTGGCGTT AGTCAAACGC AACTGGCGTC CGCTGGTGGT GGCCGTCGTG
CTGTTTGCCC TACCCTTCCC GCTGCGTTAC ATCCAGTGGT TTACCCCGCA ACCGGAGAAA
ACCATTCAGG TTTCGATGGT TCAGGGCGAT ATTCCGCAAT CGCTGAAATG GGACGGAGAC
CAGCTACTTA ATACGCTGAA GATTTACTAC AACGCAACGG CACCGCTGAT GGGCAAATCA
TCGTTGATTA TCTGGCCGGA GTCGGCGATA ACCGATCTGG AAATTAATCA GCAACCGTTC
CTCAAAGCAC TGGACGGTGA GTTGCGCGAT AAAGGTAGCT CGCTGGTAAC CGGGATTGTC
GACGCGCGTC TCAATAAGCA GAACCGCTAC GATACCTACA ACACCATCAT CACACTGGGT
AAAGGTGCAC CGTACAGCTA CGAATCAGCC GATCGCTATA ACAAAAACCA TCTGGTGCCG
TTTGGCGAGT TTGTCCCGCT CGAGTCGATT CTGCGTCCGT TAGCACCGTT CTTTGATCTG
CCGATGTCGT CGTTCAGCCG TGGGCCATAT ATCCAGCCGC CGCTGTCGGT AAATGGTATT
GAGCTTACTG CGGCTATTTG CTACGAGATC ATTCTCGGCG AGCAAGTGCG CGATAACTTC
CGCCCGGATA CCGACTATCT GCTGACTATC TCCAACGATG CGTGGTTTGG TAAGTCTATT
GGTCCATGGC AGCACTTCCA GATGGCGCGA ATGCGTGCGT TGGAGCTGGC GCGCCCACTG
TTGCGCAGCA CCAACAACGG CATTACGGCG GTGATTGGCC CGCAGGGTGA GATTCAGGCG
ATGATCCCGC AGTTCACCCG CGAGGTGTTA ACCACTAATG TGACGCCGAC CACCGGACTC
ACACCGTATG CACGTACCGG CAACTGGCCG CTGTGGGTAC TGACGGCATT GTTTGGTTTT
GCCGCTGTGT TGATGAGTCT GCGTCAACGA CGTAAATAA
 
Protein sequence
MAFASLIERQ RIRLLLALLF GACGTLAFSP YDVWPAAIIS LMGLQALTFN RRPLQSAAIG 
FCWGFGLFGS GINWVYVSIA TFGGMPGPVN IFLVVLLAAY LSLYTGLFAG VLSRLWPKTT
WLRVAIAAPA LWQVTEFLRG WVLTGFPWLQ FGYSQIDGPL KGLAPLMGVE AINFLLMMVS
GLLALALVKR NWRPLVVAVV LFALPFPLRY IQWFTPQPEK TIQVSMVQGD IPQSLKWDGD
QLLNTLKIYY NATAPLMGKS SLIIWPESAI TDLEINQQPF LKALDGELRD KGSSLVTGIV
DARLNKQNRY DTYNTIITLG KGAPYSYESA DRYNKNHLVP FGEFVPLESI LRPLAPFFDL
PMSSFSRGPY IQPPLSVNGI ELTAAICYEI ILGEQVRDNF RPDTDYLLTI SNDAWFGKSI
GPWQHFQMAR MRALELARPL LRSTNNGITA VIGPQGEIQA MIPQFTREVL TTNVTPTTGL
TPYARTGNWP LWVLTALFGF AAVLMSLRQR RK