Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3069 |
Symbol | |
ID | 5112604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 3338097 |
End bp | 3340760 |
Gene Length | 2664 bp |
Protein Length | 887 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640493263 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_001177784 |
Protein GI | 146312710 |
COG category | [C] Energy production and conversion [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) [COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.835722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00746938 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCAGC GAGGGCTAGA AGCGCTACTC CGTCCAAAAT CGATTGCCGT TATTGGCGCA TCCATGAAAC CCGATCGTGC GGGATATTTA ATGATGCGCA ATCTGCTGGC CGGGGGTTTT AATGGCCCGG TGATGCCCGT TACGCCGGCA TATAAAGCGG TTCAAGGTGT GCTTGCATGG CCCGATATAG ACAGTCTACC TTTTGTGCCC GATCTTGCTG TGCTGTGTAC GCATGCCAAA CGCAACATCA CGTTGCTGGA AGCGTTGGGT GAGAAAGGCT GTAAAACCTG CATCATTCTC TCTTCCCCGC CTGAACAGCA GGCAGAACTC ATGGCCTGCG CCAGCCGTTA CCAAATGCGC CTGCTGGGGC CAAACAGTCT GGGATTGCTG GCCCCCTGGC AGGGCTTAAA CGCCAGTTTC TCTCCAGTCC CGATACGTAA AGGTAAGTTA GCGTTCATTT CTCAATCGGC AGCCGTTTCT AACACGATTC TCGATTGGGC CCAGCAGCGT GAAATGGGGT TTTCGTATTT TATCGCGCTG GGTGATAGCC TCGATATCGA TGTGGACGAA CTGCTGGATT TCCTCGCGCG TGACAGTAAA ACCAGCGCCA TACTTCTTTA TCTGGAACAT TTAAGCGACG CGCGACGCTT TGTCTCCGCT GCTCGCAGTG CCGCGCGTAA CAAACCTATT CTGGTCATTA AAAGCGGTCG TAGCCCGGCA GCGCAGCGTC TGCTGCATTC CCATTCAGGT ATGGATCCTG CGTGGGACGC CGCCATTCAG CGTGCCGGTC TGCTGCGTGT GCAGGATACT CACGAACTTT TCTCTGCCGT CGAGACGTTG AGCCATATGC GTCCGTTGCG TGGCGAAAGG CTGATGATCA TCAGCAACGG TGCGGCTCCC GCTGCATTGG CGCTGGATGA GCTGTGGCTA CGCAATGGAA AATTGGCCAC GTTGAGTGAA GAGATGCTTG GCAAACTACG TGAAGTATTT CCGGACAGCG TGACGCCTGG TAATCCACTC GATTTACGAG ACGATGCAAG CAGCGAGCGT TATATCAAAG CCATTACATT ACTGCTGGAT AGTCAGGACT TTGATGCGCT GATGGTGATC CATTCCCCTA GCGCCGCTGC TCCAGGAAGC GAAAGCGCCA GAGCGTTAAT CGATGCTGTC AAAAATCACC CACGCGGAAA ATATGTCACT CTGCTAACCA ACTGGTGCGG CGAGTTTTCC TCGCAAGAGG CACGGCGCTT ATTCAGCGAA GCGGGACTCC CGACGTATCG CACGCCAGAA GGGACGATTA CCGCTTTTAT GCACATGGTT GAGTATCGTC GTAACCAAAA ACAACTGCGT GAAACGCCCG CGTTACCTGA CAATCTTACG GCAAACGCGA CCGCGGCTCA TAATCTGTTG CGTTCGGCAA TCGAAGACGG CGCACGTGCG CTTGATACGC ATGAGGTCCA GCCGATCCTT GATGCCTACA GCATACACAC TCTGTCGACC TGGATTGCTG GGGATAGCGC CGAGGCTGTT CACATTGCCG AACAGATTGG TTATCCCGTT GCATTGAAAC TGCGCTCCCC TGATATTCCG CATAAGTCGG ATGTACAAGG GGTGATGCTT TACCTGCGTA CTGCAGCGGA AGTGCAACAG GCCGCTGAAG CGATTTTTGA TCGCGTCAAA ATGGCCTGGC CTCAGGCGAG GGTTCATGGG CTATTGGTAC AAAGCATGGC TAATCGGGCC GGAGCGCAAG AACTGAGAGT TGTTGTCGAG CAGGATCCTG TATTTGGCCC GCTGATTATG TTGGGCGAAG GGGGCGTAGA GTGGCACCCA GAAGAACAGG CTGTCGTGGC ATTGCCTCCG CTGAATATGA ACCTGGCGCG CTATCTGGTA ATACAGGCTA TCAAGAGTAA AAAAATCCGT GGCCGCAGTG CTCTTCAGCC TCTTGATATT GCCGGGTTAA GCCAATTTTT AGTGCAAGTA TCAAACCTGA TTGTCGATTG TGCAGAAATA CAGCGATTGG ATATTCATCC GCTTTTGGCT TCAGGTAACG AGTTTACGGC TTTGGATGTG ACGCTGGATA TCGCTCCATT TGACGGCGAC CGGGAGAATC GGCTGGCCAT CCGTCCCTAT CCTTTGCACC TGGAAGAGTG GGTTGAGTTA AAGAACGGGG AAAGCGTGCT ATTTCGCCCT ATCCTTCCTG AAGACGAGCC GCAGCTGCGA GTATTTATCG AACAGGTCAC TAAAGAAGAT TTGTATTATC GTTACTTTAG CGAGATCAGC GAATTTACCC ATGAAGATTT AGCCAATATG ACCCAGATCG ACTACGATCG GGAAATGGCT TTTGTGGCCG TTCGTCATCA TGATAGCGGC GATGAGATCC TGGGCGTGAC GCGCGCGATC TCCGATCCTG ACAATGTAGA TGCGGAATTT GCGGTGCTTG TTCGGTCTGA TCTGAAAGGT CTGGGTCTGG GACGGCGACT GCTAGAAAAA CTGATCAGTT ACACCCGCGA TCACGGATTG TTGTGCCTGA ATGGCATTAC GATGCCCCAT AACCGCGGCA TGATTACCCT GGCGCGTAAA CTTGGATTTG ACGTCGATAT TCAGTTGGAC GAAGGGATTG TTGCTTTGTC GCTTAGTCTG ACACCGCCGT TGAGTCGGAA GTAA
|
Protein sequence | MSQRGLEALL RPKSIAVIGA SMKPDRAGYL MMRNLLAGGF NGPVMPVTPA YKAVQGVLAW PDIDSLPFVP DLAVLCTHAK RNITLLEALG EKGCKTCIIL SSPPEQQAEL MACASRYQMR LLGPNSLGLL APWQGLNASF SPVPIRKGKL AFISQSAAVS NTILDWAQQR EMGFSYFIAL GDSLDIDVDE LLDFLARDSK TSAILLYLEH LSDARRFVSA ARSAARNKPI LVIKSGRSPA AQRLLHSHSG MDPAWDAAIQ RAGLLRVQDT HELFSAVETL SHMRPLRGER LMIISNGAAP AALALDELWL RNGKLATLSE EMLGKLREVF PDSVTPGNPL DLRDDASSER YIKAITLLLD SQDFDALMVI HSPSAAAPGS ESARALIDAV KNHPRGKYVT LLTNWCGEFS SQEARRLFSE AGLPTYRTPE GTITAFMHMV EYRRNQKQLR ETPALPDNLT ANATAAHNLL RSAIEDGARA LDTHEVQPIL DAYSIHTLST WIAGDSAEAV HIAEQIGYPV ALKLRSPDIP HKSDVQGVML YLRTAAEVQQ AAEAIFDRVK MAWPQARVHG LLVQSMANRA GAQELRVVVE QDPVFGPLIM LGEGGVEWHP EEQAVVALPP LNMNLARYLV IQAIKSKKIR GRSALQPLDI AGLSQFLVQV SNLIVDCAEI QRLDIHPLLA SGNEFTALDV TLDIAPFDGD RENRLAIRPY PLHLEEWVEL KNGESVLFRP ILPEDEPQLR VFIEQVTKED LYYRYFSEIS EFTHEDLANM TQIDYDREMA FVAVRHHDSG DEILGVTRAI SDPDNVDAEF AVLVRSDLKG LGLGRRLLEK LISYTRDHGL LCLNGITMPH NRGMITLARK LGFDVDIQLD EGIVALSLSL TPPLSRK
|
| |