Gene Mlg_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1037 
Symbol 
ID4269778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1185266 
End bp1187929 
Gene Length2664 bp 
Protein Length887 aa 
Translation table11 
GC content67% 
IMG OID638125789 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_741880 
Protein GI114320197 
COG category[C] Energy production and conversion
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming)
[COG1670] Acetyltransferases, including N-acetylases of ribosomal proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.737751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.467815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCA GAAACCTGGA ACACCTCTTT CAACCCCGGG CGATCGCCCT GATTGGCACC 
GGTGGTGATA CCGACAGCCT GCTGTTGCGC AACCTGGTCG GTGCCGGTTT CCGTGGTCCG
GTGATGCCGG TCATGCCCGG TAAGCGGGCC CTGCATGGGG TGTTGTGTTA CCCGGATGTG
GAGAGCCTGC CAATGGTCCC GGATCTGGCG GTCCTGGATG TGCCGTTCAA TCGGGTACCC
GACGCCATCC GTGCGCTGGG CGAGAAGGGC ACCCGGGCGG CGGTGCTGGT CGGCAAACCG
GCACCGAACC TGAGCCCGGA GGACTACCGG GCCCAGATCC AGGCGGTGCT GGATGCGGCT
AAGCCCTTCC TGCTGCGCCT GTTGGGTCCC GGTTGCATCG ACCTGACGGT ACCGCGCACC
GGGGTCAACG CCAGTGTCGC TCCCTTCCGG CCCGGGGCTG GGCGCGCGGC CCTGGTCACG
GAGTCGGCGG CCGTGGCCAC CCGTGCGTTG GACTGGTGCC AGTCAGAGGG CTTGGGCTTG
AGTCATCTGA TCCACCTGGG TGGCGCGATG GACGTGGATA CCGGCGACGT CCTGGATTAT
CTGGCTAGCG ATGTGCACAG CCGCGCCATC CTGCTCTACC TCGAGTACAT CGACGATGCC
CGGAAGTTCA TGTCGGCGGC GCGGCGCGCG GCGCGCGTCA AGCCGGTGGT GGTGCTCAAG
CCGCGCCGCG GCAGCAAGGG GGCGGCCGAG GACGCCGTCT ATGAGGCCGC TTTCCGGCGG
GCCGGGCTGG TCCGGGTGGC CGACTTGGAT GAGTTGTTCA ACGCGGTGGA GATTCTCACT
TCAGCGAGAA AACCGGGGCG CCAGGGGCCG TTGGCGGTGC TCGGTAACAG CCGCAGTCTG
GGCCTGTTGG CGGCGAATGA GTTGGAGGCC TACGGCGGTA CGCTGGAGGG CCTGTCGGAG
GAGAGCAGTG AGGGCCTGGC GCTACTGGCC CGCGACCCGG AATCCACCGC CAATCCGCTG
GACCTGGGCG GTGACGCCGA CGCCGACGCC TACGGCAAGG CACTGGACAC CCTGGCCGGG
GACAAGCGCA TCGGTGGCAC CCTGGTGATC AACCAACCCA ACGAGCTGGT GGACAACACG
GCCATTGTCG ATGTGCTGGA GGCGCATGCC CGTAAGAGCC GCCGTGCCGT GCTGGCGGTC
TGGTCCGGTC CACGCGCTGG TGCCAGGGGC CGCGAGCGGC TGAAGCAGAC CATGCCGGCC
TTCGAGGGTC CGGAAGAGGC GGTGCGCGCC TATATGCGCC TGGTCCAGTA CCAGCGCAAC
CAGGAATTGT TGATGGAGAC CCCCACCTCC ATGCCGGAGG CGTTCGAGAC CGATCCGGAG
TCGGCCCGGC TGCTGATCAG CGCGGCACTG ACCGCCGGCC GCGATCAGCT CAACGAGTAC
CAGGCGCAGC AATTGCTGAC TGCCTACGAG ATTCCCTGTG TCCCCAGCCG GCGGGCCACC
ACCCCCGAGG AGGCCGGGCG GGAGGCGGCG GCCATGGAAG GGCCCCTGGC GCTGAAGATC
ATGTCGCCGG ATATCGTGCA CAAGTCCGAG GTGCGGGGGG TGGCACTGGA CCTGGAGTCA
CCGGAGGCGG TGGTCCAAGA GGCCCACGCC ATGGAGGCGC GGCTGCGTGA GCTCTATCCC
GATGCCCGGG TGGATGGCTA CCTGCTGCAG CCGATGACCC CCCGGGAAGG GGCGTTTGAA
TTATGTGTCG CTGTTATGCC GGGGGGGCGC TTCGGACCAG TGATCCGGTT CGGACACGGG
GGCACCGAGG CGCAGGTTAT TGCCGATGTG GCCTACGGCT TGCCGCCGCT CAACATGCAT
CTGGCCCGCG AGATGATGAG CCAGACGCGG ATCTACTCCA TGCTCGCCAG CAACCGGTTG
CGGGCGGCGG ATCTGGACGC CCTGGCGCTG ACGCTGATCA AGGTCTCGCA GATGGTGATT
GACTTCGAGG CCATCGAGTC GTTGGAGATC AACCCCCTCT GGGCTACCGC CGAGGGCGTG
GTGGCGCTGG ACTCACGGGT GGTCATCCGC CCGCCCTACA CCGGTGATCC CGCCCGCCGG
CTCGCCATCC GGCCCTATCC CAAGGAGCTC GAGGAGGAGC TGAACCTCCC CAACGGCCGG
CGTTTCCTGC TGCGGCCGAT CCTGCCGGAG GATGAGCCGG CCCTGACCAA GATGGTGGAA
CGGACCCCGC CCGAACAGCT TCGCCTGCGC TTTTTCCGCA CCATCCGCAC GCTGCCCCAT
GAGATGGCGG CCCGGCTCAC GCAGATCGAC TATGACCGGG AGATGGCGTT GGCGGTGACC
GATCCGGGGT TGCCAGGCCA GGTGGAATTG TGGGGCGTGG TGCGGATCAG CGCGGACCCG
GATAACGAGA CGGCCGAATA CGCCATCATG GTGGACAACA ATGTCACCGG CATGGGCCTG
GGGCCGCTGT TGATGCGGCG GATCGTGGAG TATGCCCGCC AGCGCGGCAT CCGCGAGGTT
TACGGGGAGG TGCTGCGCGA GAACCGGCCT ATGTTACGGA TCAACGAGGC CATGGGCTTT
ACGGTAAAGA CGTCGGTGGA TGATCCCAAC GTCATGCATG TCACCCTGCG GTTGGATGGC
AACGGTGACG ACACGGCGGG CTGA
 
Protein sequence
MTVRNLEHLF QPRAIALIGT GGDTDSLLLR NLVGAGFRGP VMPVMPGKRA LHGVLCYPDV 
ESLPMVPDLA VLDVPFNRVP DAIRALGEKG TRAAVLVGKP APNLSPEDYR AQIQAVLDAA
KPFLLRLLGP GCIDLTVPRT GVNASVAPFR PGAGRAALVT ESAAVATRAL DWCQSEGLGL
SHLIHLGGAM DVDTGDVLDY LASDVHSRAI LLYLEYIDDA RKFMSAARRA ARVKPVVVLK
PRRGSKGAAE DAVYEAAFRR AGLVRVADLD ELFNAVEILT SARKPGRQGP LAVLGNSRSL
GLLAANELEA YGGTLEGLSE ESSEGLALLA RDPESTANPL DLGGDADADA YGKALDTLAG
DKRIGGTLVI NQPNELVDNT AIVDVLEAHA RKSRRAVLAV WSGPRAGARG RERLKQTMPA
FEGPEEAVRA YMRLVQYQRN QELLMETPTS MPEAFETDPE SARLLISAAL TAGRDQLNEY
QAQQLLTAYE IPCVPSRRAT TPEEAGREAA AMEGPLALKI MSPDIVHKSE VRGVALDLES
PEAVVQEAHA MEARLRELYP DARVDGYLLQ PMTPREGAFE LCVAVMPGGR FGPVIRFGHG
GTEAQVIADV AYGLPPLNMH LAREMMSQTR IYSMLASNRL RAADLDALAL TLIKVSQMVI
DFEAIESLEI NPLWATAEGV VALDSRVVIR PPYTGDPARR LAIRPYPKEL EEELNLPNGR
RFLLRPILPE DEPALTKMVE RTPPEQLRLR FFRTIRTLPH EMAARLTQID YDREMALAVT
DPGLPGQVEL WGVVRISADP DNETAEYAIM VDNNVTGMGL GPLLMRRIVE YARQRGIREV
YGEVLRENRP MLRINEAMGF TVKTSVDDPN VMHVTLRLDG NGDDTAG