Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1037 |
Symbol | |
ID | 4269778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1185266 |
End bp | 1187929 |
Gene Length | 2664 bp |
Protein Length | 887 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638125789 |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_741880 |
Protein GI | 114320197 |
COG category | [C] Energy production and conversion [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) [COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.737751 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.467815 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTCA GAAACCTGGA ACACCTCTTT CAACCCCGGG CGATCGCCCT GATTGGCACC GGTGGTGATA CCGACAGCCT GCTGTTGCGC AACCTGGTCG GTGCCGGTTT CCGTGGTCCG GTGATGCCGG TCATGCCCGG TAAGCGGGCC CTGCATGGGG TGTTGTGTTA CCCGGATGTG GAGAGCCTGC CAATGGTCCC GGATCTGGCG GTCCTGGATG TGCCGTTCAA TCGGGTACCC GACGCCATCC GTGCGCTGGG CGAGAAGGGC ACCCGGGCGG CGGTGCTGGT CGGCAAACCG GCACCGAACC TGAGCCCGGA GGACTACCGG GCCCAGATCC AGGCGGTGCT GGATGCGGCT AAGCCCTTCC TGCTGCGCCT GTTGGGTCCC GGTTGCATCG ACCTGACGGT ACCGCGCACC GGGGTCAACG CCAGTGTCGC TCCCTTCCGG CCCGGGGCTG GGCGCGCGGC CCTGGTCACG GAGTCGGCGG CCGTGGCCAC CCGTGCGTTG GACTGGTGCC AGTCAGAGGG CTTGGGCTTG AGTCATCTGA TCCACCTGGG TGGCGCGATG GACGTGGATA CCGGCGACGT CCTGGATTAT CTGGCTAGCG ATGTGCACAG CCGCGCCATC CTGCTCTACC TCGAGTACAT CGACGATGCC CGGAAGTTCA TGTCGGCGGC GCGGCGCGCG GCGCGCGTCA AGCCGGTGGT GGTGCTCAAG CCGCGCCGCG GCAGCAAGGG GGCGGCCGAG GACGCCGTCT ATGAGGCCGC TTTCCGGCGG GCCGGGCTGG TCCGGGTGGC CGACTTGGAT GAGTTGTTCA ACGCGGTGGA GATTCTCACT TCAGCGAGAA AACCGGGGCG CCAGGGGCCG TTGGCGGTGC TCGGTAACAG CCGCAGTCTG GGCCTGTTGG CGGCGAATGA GTTGGAGGCC TACGGCGGTA CGCTGGAGGG CCTGTCGGAG GAGAGCAGTG AGGGCCTGGC GCTACTGGCC CGCGACCCGG AATCCACCGC CAATCCGCTG GACCTGGGCG GTGACGCCGA CGCCGACGCC TACGGCAAGG CACTGGACAC CCTGGCCGGG GACAAGCGCA TCGGTGGCAC CCTGGTGATC AACCAACCCA ACGAGCTGGT GGACAACACG GCCATTGTCG ATGTGCTGGA GGCGCATGCC CGTAAGAGCC GCCGTGCCGT GCTGGCGGTC TGGTCCGGTC CACGCGCTGG TGCCAGGGGC CGCGAGCGGC TGAAGCAGAC CATGCCGGCC TTCGAGGGTC CGGAAGAGGC GGTGCGCGCC TATATGCGCC TGGTCCAGTA CCAGCGCAAC CAGGAATTGT TGATGGAGAC CCCCACCTCC ATGCCGGAGG CGTTCGAGAC CGATCCGGAG TCGGCCCGGC TGCTGATCAG CGCGGCACTG ACCGCCGGCC GCGATCAGCT CAACGAGTAC CAGGCGCAGC AATTGCTGAC TGCCTACGAG ATTCCCTGTG TCCCCAGCCG GCGGGCCACC ACCCCCGAGG AGGCCGGGCG GGAGGCGGCG GCCATGGAAG GGCCCCTGGC GCTGAAGATC ATGTCGCCGG ATATCGTGCA CAAGTCCGAG GTGCGGGGGG TGGCACTGGA CCTGGAGTCA CCGGAGGCGG TGGTCCAAGA GGCCCACGCC ATGGAGGCGC GGCTGCGTGA GCTCTATCCC GATGCCCGGG TGGATGGCTA CCTGCTGCAG CCGATGACCC CCCGGGAAGG GGCGTTTGAA TTATGTGTCG CTGTTATGCC GGGGGGGCGC TTCGGACCAG TGATCCGGTT CGGACACGGG GGCACCGAGG CGCAGGTTAT TGCCGATGTG GCCTACGGCT TGCCGCCGCT CAACATGCAT CTGGCCCGCG AGATGATGAG CCAGACGCGG ATCTACTCCA TGCTCGCCAG CAACCGGTTG CGGGCGGCGG ATCTGGACGC CCTGGCGCTG ACGCTGATCA AGGTCTCGCA GATGGTGATT GACTTCGAGG CCATCGAGTC GTTGGAGATC AACCCCCTCT GGGCTACCGC CGAGGGCGTG GTGGCGCTGG ACTCACGGGT GGTCATCCGC CCGCCCTACA CCGGTGATCC CGCCCGCCGG CTCGCCATCC GGCCCTATCC CAAGGAGCTC GAGGAGGAGC TGAACCTCCC CAACGGCCGG CGTTTCCTGC TGCGGCCGAT CCTGCCGGAG GATGAGCCGG CCCTGACCAA GATGGTGGAA CGGACCCCGC CCGAACAGCT TCGCCTGCGC TTTTTCCGCA CCATCCGCAC GCTGCCCCAT GAGATGGCGG CCCGGCTCAC GCAGATCGAC TATGACCGGG AGATGGCGTT GGCGGTGACC GATCCGGGGT TGCCAGGCCA GGTGGAATTG TGGGGCGTGG TGCGGATCAG CGCGGACCCG GATAACGAGA CGGCCGAATA CGCCATCATG GTGGACAACA ATGTCACCGG CATGGGCCTG GGGCCGCTGT TGATGCGGCG GATCGTGGAG TATGCCCGCC AGCGCGGCAT CCGCGAGGTT TACGGGGAGG TGCTGCGCGA GAACCGGCCT ATGTTACGGA TCAACGAGGC CATGGGCTTT ACGGTAAAGA CGTCGGTGGA TGATCCCAAC GTCATGCATG TCACCCTGCG GTTGGATGGC AACGGTGACG ACACGGCGGG CTGA
|
Protein sequence | MTVRNLEHLF QPRAIALIGT GGDTDSLLLR NLVGAGFRGP VMPVMPGKRA LHGVLCYPDV ESLPMVPDLA VLDVPFNRVP DAIRALGEKG TRAAVLVGKP APNLSPEDYR AQIQAVLDAA KPFLLRLLGP GCIDLTVPRT GVNASVAPFR PGAGRAALVT ESAAVATRAL DWCQSEGLGL SHLIHLGGAM DVDTGDVLDY LASDVHSRAI LLYLEYIDDA RKFMSAARRA ARVKPVVVLK PRRGSKGAAE DAVYEAAFRR AGLVRVADLD ELFNAVEILT SARKPGRQGP LAVLGNSRSL GLLAANELEA YGGTLEGLSE ESSEGLALLA RDPESTANPL DLGGDADADA YGKALDTLAG DKRIGGTLVI NQPNELVDNT AIVDVLEAHA RKSRRAVLAV WSGPRAGARG RERLKQTMPA FEGPEEAVRA YMRLVQYQRN QELLMETPTS MPEAFETDPE SARLLISAAL TAGRDQLNEY QAQQLLTAYE IPCVPSRRAT TPEEAGREAA AMEGPLALKI MSPDIVHKSE VRGVALDLES PEAVVQEAHA MEARLRELYP DARVDGYLLQ PMTPREGAFE LCVAVMPGGR FGPVIRFGHG GTEAQVIADV AYGLPPLNMH LAREMMSQTR IYSMLASNRL RAADLDALAL TLIKVSQMVI DFEAIESLEI NPLWATAEGV VALDSRVVIR PPYTGDPARR LAIRPYPKEL EEELNLPNGR RFLLRPILPE DEPALTKMVE RTPPEQLRLR FFRTIRTLPH EMAARLTQID YDREMALAVT DPGLPGQVEL WGVVRISADP DNETAEYAIM VDNNVTGMGL GPLLMRRIVE YARQRGIREV YGEVLRENRP MLRINEAMGF TVKTSVDDPN VMHVTLRLDG NGDDTAG
|
| |