Gene EcolC_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1093 
Symbol 
ID6065932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1190309 
End bp1192969 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content54% 
IMG OID641600509 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_001724087 
Protein GI170019133 
COG category[C] Energy production and conversion
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming)
[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.312736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.038329 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGC GAGGACTGGA AGCACTACTG CGACCAAAAT CGATAGCGGT AATTGGCGCG 
TCGATGAAAC CCAATCGCGC AGGTTACCTG ATGATGCGTA ACCTGCTGGC GGGAGGCTTT
AACGGACCGG TACTCCCGGT GACGCCAGCC TGGAAAGCGG TGTTGGGTGT GTTGGCCTGG
CCGGATATTG CCAGCTTGCC CTTTACACCC GACCTTGCGG TTTTATGTAC CAATGCCAGC
CGTAATCTTG CTCTTCTGGA AGAGCTCGGC GAGAAAGGCT GTAAAACCTG CATTATTCTT
TCCGCCCCAG CATCGCAACA CGAAGATCTC CGCGCCTGCG CCCTGCGCCA TAACATGCGC
CTGCTTGGAC CAAACAGTCT GGGTTTACTG GCTCCCTGGC AAGGTCTGAA TGCCAGCTTT
TCGCCTGTGC CGATTAAACG CGGCAAGCTG GCGTTTATTT CGCAATCGGC TGCCGTCTCC
AACACCATCC TCGACTGGGC GCAACAGCGT AAGATGGGCT TTTCCTACTT TATTGCGCTC
GGCGACAGCC TGGATATCGA CGTTGATGAA TTGCTTGACT ATCTGGCACG CGACAGTAAA
ACCAGCGCCA TCCTGCTCTA TCTCGAACAG TTAAGCGACG CGCGACGCTT TGTTTCGGCG
GCCCGTAGTG CCTCGCGTAA TAAACCGATT CTGGTGATTA AAAGCGGACG TAGCCCGGCG
GCACAGCGAC TGCTCAACAC GACGGCAGGA ATGGACCCGG CATGGGATGC GGCTATTCAG
CGTGCCGGTT TGTTGCGGGT ACAGGACACC CACGAGCTGT TTTCGGCGGT GGAAACCCTT
AGCCATATGC GCCCGCTACG TGGCGACCGG CTGATGATTA TCAGCAACGG TGCTGCGCCT
GCCGCGCTGG CGCTGGATGC CTTATGGTCA CGCAATGGCA AGCTGGCAAC GCTAAGCGAA
GAAACCTGCC AGAAACTGCG CGATGCACTG CCAGAACATG TGGCAATATC TAACCCGCTC
GATCTACGCG ATGACGCCAG CAGTGAGCAC TATATTAAAA CGCTGGATAT TCTGCTCCAC
AGCCAGGATT TTGACGCGCT GATGGTTATT CATTCGCCCA GCGCCGCTGC TCCCGCAACA
GAAAGCGCGC AAGTATTAAT TGAAGCGGTA AAGCATCATC CCCGCAGCAA ATATGTCTCT
TTGCTGACGA ACTGGTGCGG CGAGCACTCC TCGCAAGAGG CACGACGTTT ATTCAGCGAA
GCCGGGCTGC CGACCTACCG TACCCCGGAA GGAACCATCA CTGCTTTTAT GCATATGGTG
GAGTACCGGC GTAATCAGAA GCAACTACGC GAAACGCCGG CGTTGCCCAG CAATCTGACT
TCCAATACCG CAGAAGCGCA TCTTCTGTTG CAACAGGCGA TTGCCGAAGG GGCTACGTCG
CTCGATACCC ATGAAGTTCA GCCCATCCTG CAAGCGTATG GCATGAACAC GCTCCCTACC
TGGATTGCCA GCGATAGCAC CGAAGCGGTG CATATTGCCG AACAGATTGG TTATCCGGTG
GCGCTGAAAT TGCGTTCGCC GGATATTCCA CATAAATCGG AAGTTCAGGG CGTCATGCTT
TACCTGCGTA CAGCCAATGA AGTCCAGCAA GCGGCGAACG CTATTTTCGA TCGCGTAAAA
ATGGCCTGGC CACAGGCGCG GGTCCACGGC CTGTTGGTGC AAAGTATGGC TAACCGTGCT
GGCGCTCAGG AGTTGCGGGT TGTGGTTGAG CACGATCCGG TTTTCGGGCC GTTGATCATG
CTGGGTGAAG GCGGTGTGGA GTGGCGTCCT GAAGATCAAG CCGTCGTCGC ACTGCCGCCG
CTGAACACGA ACCTGGCCCG CTATCTGGTT ATTCAGGGGA TCAAAAGTAA AAAGATTCGT
GCGCGCAGTG CGCTACGCCC ATTGGATGTT GCAGGCTTGA GCCAGCTTCT GGTGCAGGTT
TCCAACTTGA TTGTCGATTG CCCGGAAATT CAGCGTCTGG ATATTCATCC TTTGCTGGCT
TCTGGCAGTG AATTTACCGC GCTGGATGTC ACGCTGGATA TCTCGCCGTT TGAAGGCGAT
AACGAGAGTC GGCTGGCAGT GCGCCCTTAT CCGCATCAGC TGGAAGAATG GGTAGAATTG
AAAAACGGTG AACGCTGCTT GTTCCGCCCG ATTTTGCCAG AAGATGAGCC ACAACTTCAG
CAATTCATTT CGCGAGTCAC CAAAGAAGAT CTTTATTACC GCTACTTTAG CGAGATCAAC
GAATTTACCC ATGAAGATTT AGCCAACATG ACACAGATCG ACTACGATCG GGAAATGGCG
TTTGTAGCGG TACGACGTAT TGATCAAACG GAAGAGATCC TCGGCGTCAC GCGTGCGATT
TCCGATCCTG ATAACATCGA TGCCGAATTT GCTGTACTGG TTCGCTCGGA TCTCAAAGGG
TTAGGCTTAG GTCGACGCTT AATGGAAAAG TTGATTACCT ATACGCGAGA TCACGGACTA
CAACGTCTGA ATGGTATTAC GATGCCAAAC AATCGTGGCA TGGTGGCGCT AGCCCGCAAG
CTCGGGTTTA ACGTTGATAT CCAGCTCGAA GAGGGGATCG TTGGGCTTAC GCTAAATCTT
GCCCAGCGCG AGGAATCATG A
 
Protein sequence
MSQRGLEALL RPKSIAVIGA SMKPNRAGYL MMRNLLAGGF NGPVLPVTPA WKAVLGVLAW 
PDIASLPFTP DLAVLCTNAS RNLALLEELG EKGCKTCIIL SAPASQHEDL RACALRHNMR
LLGPNSLGLL APWQGLNASF SPVPIKRGKL AFISQSAAVS NTILDWAQQR KMGFSYFIAL
GDSLDIDVDE LLDYLARDSK TSAILLYLEQ LSDARRFVSA ARSASRNKPI LVIKSGRSPA
AQRLLNTTAG MDPAWDAAIQ RAGLLRVQDT HELFSAVETL SHMRPLRGDR LMIISNGAAP
AALALDALWS RNGKLATLSE ETCQKLRDAL PEHVAISNPL DLRDDASSEH YIKTLDILLH
SQDFDALMVI HSPSAAAPAT ESAQVLIEAV KHHPRSKYVS LLTNWCGEHS SQEARRLFSE
AGLPTYRTPE GTITAFMHMV EYRRNQKQLR ETPALPSNLT SNTAEAHLLL QQAIAEGATS
LDTHEVQPIL QAYGMNTLPT WIASDSTEAV HIAEQIGYPV ALKLRSPDIP HKSEVQGVML
YLRTANEVQQ AANAIFDRVK MAWPQARVHG LLVQSMANRA GAQELRVVVE HDPVFGPLIM
LGEGGVEWRP EDQAVVALPP LNTNLARYLV IQGIKSKKIR ARSALRPLDV AGLSQLLVQV
SNLIVDCPEI QRLDIHPLLA SGSEFTALDV TLDISPFEGD NESRLAVRPY PHQLEEWVEL
KNGERCLFRP ILPEDEPQLQ QFISRVTKED LYYRYFSEIN EFTHEDLANM TQIDYDREMA
FVAVRRIDQT EEILGVTRAI SDPDNIDAEF AVLVRSDLKG LGLGRRLMEK LITYTRDHGL
QRLNGITMPN NRGMVALARK LGFNVDIQLE EGIVGLTLNL AQREES