Gene EcolC_2988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2988 
Symbollnt 
ID6065850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3265489 
End bp3267027 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content56% 
IMG OID641602405 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_001725940 
Protein GI170020986 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTG CCTCATTAAT TGAACGCCAG CGCATTCGCC TGCTGCTGGC GTTATTATTC 
GGTGCCTGCG GAACGCTGGC CTTCTCTCCT TACGACGTCT GGCCTGCGGC GATTATTTCG
CTGATGGGGC TTCAGGCGTT GACCTTTAAC CGTCGCCCAC TCCAGTCTGC CGCTATTGGC
TTTTGCTGGG GATTTGGCCT CTTTGGCAGC GGTATTAACT GGGTCTATGT CAGCATCGCG
ACCTTTGGCG GGATGCCTGG CCCGGTTAAC ATCTTCCTGG TGGTGCTGCT GGCGGCGTAT
TTGTCGCTGT ATACCGGACT GTTTGCTGGC GTGCTGTCGC GTCTGTGGCC GAAAACCACC
TGGCTGCGCG TGGCGATTGC TGCCCCTGCC CTCTGGCAAG TGACCGAGTT TCTGCGCGGT
TGGGTACTGA CCGGCTTCCC GTGGTTACAG TTCGGCTATA GCCAGATTGA TGGCCCGTTA
AAAGGGCTGG CACCGATAAT GGGCGTGGAA GCCATTAACT TCCTGCTGAT GATGGTTAGT
GGCCTGCTGG CACTGGCGTT GGTCAAACGC AACTGGCGTC CGCTGGTGGT GGCCGTCGTG
CTGTTTGCCC TACCCTTCCC GCTGCGTTAC ATCCAATGGT TTACCCCGCA ACCGGAGAAA
ACCATTCAGG TTTCGATGGT TCAGGGCGAT ATCCCACAAT CGCTGAAATG GGACGAAGGC
CAGCTTCTTA ATACGCTGAA GATTTACTAC AACGCAACGG CACCGCTGAT GGGCAAATCA
TCGTTGATTA TCTGGCCGGA GTCGGCGATA ACCGATCTGG AAATTAATCA GCAACCGTTC
CTCAAAGCAC TGGACGGTGA GTTGCGTGAT AAAGGTAGCT CGCTGGTCAC CGGGATTGTC
GACGCGCGTC TCAATAAGCA GAACCGCTAC GATACCTACA ACACCATCAT CACGCTGGGT
AAAGGTGCAC CGTACAGCTA CGAATCAGCC GATCGCTATA ACAAAAACCA TCTGGTGCCG
TTTGGCGAGT TTGTCCCGCT GGAGTCGATT CTGCGTCCGT TAGCACCGTT CTTTGATCTG
CCGATGTCGT CGTTCAGCCG TGGGCCATAT ATCCAGCCGC CGCTGTCGGT AAATGGTATT
GAGCTTACTG CGGCTATTTG CTACGAGATC ATTCTCGGCG AGCAAGTGCG CGATAACTTC
CGCCCGGATA CCGACTACCT GCTGACTATC TCCAACGATG CATGGTTTGG TAAGTCTATT
GGTCCATGGC AGCACTTCCA GATGGCGCGA ATGCGTGCGC TGGAGCTGGC GCGCCCATTG
CTGCGTAGCA CCAACAACGG CATTACGGCG GTGATTGGCC CGCAGGGTGA GATTCAGGCG
ATGATCCCGC AGTTCACCCG CGAGGTGTTG ACCACTAACG TGACGCCGAC GACCGGACTC
ACACCGTACG CACGAACCGG CAACTGGCCG CTGTGGGTGC TGACGGCGCT GTTTGGTTTT
GCCGCTGTGC TGATGAGTCT GCGAGCACGT AAACGTTGA
 
Protein sequence
MAFASLIERQ RIRLLLALLF GACGTLAFSP YDVWPAAIIS LMGLQALTFN RRPLQSAAIG 
FCWGFGLFGS GINWVYVSIA TFGGMPGPVN IFLVVLLAAY LSLYTGLFAG VLSRLWPKTT
WLRVAIAAPA LWQVTEFLRG WVLTGFPWLQ FGYSQIDGPL KGLAPIMGVE AINFLLMMVS
GLLALALVKR NWRPLVVAVV LFALPFPLRY IQWFTPQPEK TIQVSMVQGD IPQSLKWDEG
QLLNTLKIYY NATAPLMGKS SLIIWPESAI TDLEINQQPF LKALDGELRD KGSSLVTGIV
DARLNKQNRY DTYNTIITLG KGAPYSYESA DRYNKNHLVP FGEFVPLESI LRPLAPFFDL
PMSSFSRGPY IQPPLSVNGI ELTAAICYEI ILGEQVRDNF RPDTDYLLTI SNDAWFGKSI
GPWQHFQMAR MRALELARPL LRSTNNGITA VIGPQGEIQA MIPQFTREVL TTNVTPTTGL
TPYARTGNWP LWVLTALFGF AAVLMSLRAR KR