Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1093 |
Symbol | |
ID | 6065932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1190309 |
End bp | 1192969 |
Gene Length | 2661 bp |
Protein Length | 886 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641600509 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_001724087 |
Protein GI | 170019133 |
COG category | [C] Energy production and conversion [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) [COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.312736 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.038329 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAGC GAGGACTGGA AGCACTACTG CGACCAAAAT CGATAGCGGT AATTGGCGCG TCGATGAAAC CCAATCGCGC AGGTTACCTG ATGATGCGTA ACCTGCTGGC GGGAGGCTTT AACGGACCGG TACTCCCGGT GACGCCAGCC TGGAAAGCGG TGTTGGGTGT GTTGGCCTGG CCGGATATTG CCAGCTTGCC CTTTACACCC GACCTTGCGG TTTTATGTAC CAATGCCAGC CGTAATCTTG CTCTTCTGGA AGAGCTCGGC GAGAAAGGCT GTAAAACCTG CATTATTCTT TCCGCCCCAG CATCGCAACA CGAAGATCTC CGCGCCTGCG CCCTGCGCCA TAACATGCGC CTGCTTGGAC CAAACAGTCT GGGTTTACTG GCTCCCTGGC AAGGTCTGAA TGCCAGCTTT TCGCCTGTGC CGATTAAACG CGGCAAGCTG GCGTTTATTT CGCAATCGGC TGCCGTCTCC AACACCATCC TCGACTGGGC GCAACAGCGT AAGATGGGCT TTTCCTACTT TATTGCGCTC GGCGACAGCC TGGATATCGA CGTTGATGAA TTGCTTGACT ATCTGGCACG CGACAGTAAA ACCAGCGCCA TCCTGCTCTA TCTCGAACAG TTAAGCGACG CGCGACGCTT TGTTTCGGCG GCCCGTAGTG CCTCGCGTAA TAAACCGATT CTGGTGATTA AAAGCGGACG TAGCCCGGCG GCACAGCGAC TGCTCAACAC GACGGCAGGA ATGGACCCGG CATGGGATGC GGCTATTCAG CGTGCCGGTT TGTTGCGGGT ACAGGACACC CACGAGCTGT TTTCGGCGGT GGAAACCCTT AGCCATATGC GCCCGCTACG TGGCGACCGG CTGATGATTA TCAGCAACGG TGCTGCGCCT GCCGCGCTGG CGCTGGATGC CTTATGGTCA CGCAATGGCA AGCTGGCAAC GCTAAGCGAA GAAACCTGCC AGAAACTGCG CGATGCACTG CCAGAACATG TGGCAATATC TAACCCGCTC GATCTACGCG ATGACGCCAG CAGTGAGCAC TATATTAAAA CGCTGGATAT TCTGCTCCAC AGCCAGGATT TTGACGCGCT GATGGTTATT CATTCGCCCA GCGCCGCTGC TCCCGCAACA GAAAGCGCGC AAGTATTAAT TGAAGCGGTA AAGCATCATC CCCGCAGCAA ATATGTCTCT TTGCTGACGA ACTGGTGCGG CGAGCACTCC TCGCAAGAGG CACGACGTTT ATTCAGCGAA GCCGGGCTGC CGACCTACCG TACCCCGGAA GGAACCATCA CTGCTTTTAT GCATATGGTG GAGTACCGGC GTAATCAGAA GCAACTACGC GAAACGCCGG CGTTGCCCAG CAATCTGACT TCCAATACCG CAGAAGCGCA TCTTCTGTTG CAACAGGCGA TTGCCGAAGG GGCTACGTCG CTCGATACCC ATGAAGTTCA GCCCATCCTG CAAGCGTATG GCATGAACAC GCTCCCTACC TGGATTGCCA GCGATAGCAC CGAAGCGGTG CATATTGCCG AACAGATTGG TTATCCGGTG GCGCTGAAAT TGCGTTCGCC GGATATTCCA CATAAATCGG AAGTTCAGGG CGTCATGCTT TACCTGCGTA CAGCCAATGA AGTCCAGCAA GCGGCGAACG CTATTTTCGA TCGCGTAAAA ATGGCCTGGC CACAGGCGCG GGTCCACGGC CTGTTGGTGC AAAGTATGGC TAACCGTGCT GGCGCTCAGG AGTTGCGGGT TGTGGTTGAG CACGATCCGG TTTTCGGGCC GTTGATCATG CTGGGTGAAG GCGGTGTGGA GTGGCGTCCT GAAGATCAAG CCGTCGTCGC ACTGCCGCCG CTGAACACGA ACCTGGCCCG CTATCTGGTT ATTCAGGGGA TCAAAAGTAA AAAGATTCGT GCGCGCAGTG CGCTACGCCC ATTGGATGTT GCAGGCTTGA GCCAGCTTCT GGTGCAGGTT TCCAACTTGA TTGTCGATTG CCCGGAAATT CAGCGTCTGG ATATTCATCC TTTGCTGGCT TCTGGCAGTG AATTTACCGC GCTGGATGTC ACGCTGGATA TCTCGCCGTT TGAAGGCGAT AACGAGAGTC GGCTGGCAGT GCGCCCTTAT CCGCATCAGC TGGAAGAATG GGTAGAATTG AAAAACGGTG AACGCTGCTT GTTCCGCCCG ATTTTGCCAG AAGATGAGCC ACAACTTCAG CAATTCATTT CGCGAGTCAC CAAAGAAGAT CTTTATTACC GCTACTTTAG CGAGATCAAC GAATTTACCC ATGAAGATTT AGCCAACATG ACACAGATCG ACTACGATCG GGAAATGGCG TTTGTAGCGG TACGACGTAT TGATCAAACG GAAGAGATCC TCGGCGTCAC GCGTGCGATT TCCGATCCTG ATAACATCGA TGCCGAATTT GCTGTACTGG TTCGCTCGGA TCTCAAAGGG TTAGGCTTAG GTCGACGCTT AATGGAAAAG TTGATTACCT ATACGCGAGA TCACGGACTA CAACGTCTGA ATGGTATTAC GATGCCAAAC AATCGTGGCA TGGTGGCGCT AGCCCGCAAG CTCGGGTTTA ACGTTGATAT CCAGCTCGAA GAGGGGATCG TTGGGCTTAC GCTAAATCTT GCCCAGCGCG AGGAATCATG A
|
Protein sequence | MSQRGLEALL RPKSIAVIGA SMKPNRAGYL MMRNLLAGGF NGPVLPVTPA WKAVLGVLAW PDIASLPFTP DLAVLCTNAS RNLALLEELG EKGCKTCIIL SAPASQHEDL RACALRHNMR LLGPNSLGLL APWQGLNASF SPVPIKRGKL AFISQSAAVS NTILDWAQQR KMGFSYFIAL GDSLDIDVDE LLDYLARDSK TSAILLYLEQ LSDARRFVSA ARSASRNKPI LVIKSGRSPA AQRLLNTTAG MDPAWDAAIQ RAGLLRVQDT HELFSAVETL SHMRPLRGDR LMIISNGAAP AALALDALWS RNGKLATLSE ETCQKLRDAL PEHVAISNPL DLRDDASSEH YIKTLDILLH SQDFDALMVI HSPSAAAPAT ESAQVLIEAV KHHPRSKYVS LLTNWCGEHS SQEARRLFSE AGLPTYRTPE GTITAFMHMV EYRRNQKQLR ETPALPSNLT SNTAEAHLLL QQAIAEGATS LDTHEVQPIL QAYGMNTLPT WIASDSTEAV HIAEQIGYPV ALKLRSPDIP HKSEVQGVML YLRTANEVQQ AANAIFDRVK MAWPQARVHG LLVQSMANRA GAQELRVVVE HDPVFGPLIM LGEGGVEWRP EDQAVVALPP LNTNLARYLV IQGIKSKKIR ARSALRPLDV AGLSQLLVQV SNLIVDCPEI QRLDIHPLLA SGSEFTALDV TLDISPFEGD NESRLAVRPY PHQLEEWVEL KNGERCLFRP ILPEDEPQLQ QFISRVTKED LYYRYFSEIN EFTHEDLANM TQIDYDREMA FVAVRRIDQT EEILGVTRAI SDPDNIDAEF AVLVRSDLKG LGLGRRLMEK LITYTRDHGL QRLNGITMPN NRGMVALARK LGFNVDIQLE EGIVGLTLNL AQREES
|
| |