Gene Mlg_2085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2085 
Symbol 
ID4269404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2362849 
End bp2364045 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content72% 
IMG OID638126841 
Productglutamate N-acetyltransferase 
Protein accessionYP_742917 
Protein GI114321234 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.526394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC CGCAGCCGCT TCCCGTGCCC GGTGTACGGC TCGGCACGGC CCAGGCGGGT 
ATCAAGCGGG TCGGGCAACG GGACTTGGTG GTCATGGAAC TGGCCGCCGG CAGCCGCTGC
GCCGCGGTCT TTACCCGCAA CCGCTTCTGC GCCGCACCGG TGCACGTGGC GCGCGAGCAC
CTGGCCGCCG GTAGCCCCCG CTGGCTGCTG ATCAATACCG GCAACGCCAA CGCCGGCACC
GGCGAGGCCG GGATGCGCGA CGCCCGCGCC TGCTGCCAGG CCCTGGCCCA GCAGGTGGGC
GTGGCGCCCG AGGCGGTCCT GCCCTTCTCC ACCGGTGTCA TCGGTGAGCC GTTGCCGGTG
GACCGCATCG TCGCCGGGCT GCCGGACGCG GTGGCGGCCC TGAGTGAGGC GGGCTGGCAG
GAGGCCGGTT GGGGCATTCT CACCACCGAC ACCCGGCCCA AGCTGGCCTC AGCCACGGTC
CAGCTGGCGG GCGGGGCGGT GACGCTGACC GGCATGGCCA AGGGCTCGGG CATGATCCGG
CCCAACATGG CCACCATGCT GGCCTTCGTG GCCACCGACG CCGACATCCC GCAGGCCACC
CTGCAAGGGT TGCTGGGCGA GGCGGTCGCC CAATCCTTCA ACCGGGTGAC GGTGGACGGC
GACACCTCCA CCAACGATGC CTGTACGCTG GTGGCCACCG GCCACTCCGG TGTGGCGCTT
GCCGGCGAGG GGGACCGTGA GCGGTTGGCC TCCGCCCTGA CCGACCTCTG CGTCACCCTG
GCACGGGCGA TCGCCCGCGA CGGCGAGGGC GCCACCCGGC TGATCAATGT CGTCGTGGAG
GGCGCGCAGG CGGTCGCCGA GGCCGAGCGG GTGGCCTTCA CCGTGGCCGA GTCGCCCCTG
GTGAAGACGG CCCTGTTCGC CGCCGACCCC AACTGGGGGC GCATCCTGGC CGCGGTGGGC
AGGGCGGGCA TCGATGATCT GGACGTCGCC GGCGTGACCA TCGACCTGGA CGATTACCGG
ATCGCCGAAC AGGGGGGACG GGCCGCCGGG TACGATGAGG CCGAGGCCTC CCGCCGCATC
CAGGGCTCGG AGGTGACCAT CCGCATCGGC CTGGGGCGGG GCGCGGCCGC CGCCACCGTC
TGGACCTGCG ATTTCTCCTA CGACTACGTG CGCATCAACG CGGAGTACCG TACCTGA
 
Protein sequence
MSEPQPLPVP GVRLGTAQAG IKRVGQRDLV VMELAAGSRC AAVFTRNRFC AAPVHVAREH 
LAAGSPRWLL INTGNANAGT GEAGMRDARA CCQALAQQVG VAPEAVLPFS TGVIGEPLPV
DRIVAGLPDA VAALSEAGWQ EAGWGILTTD TRPKLASATV QLAGGAVTLT GMAKGSGMIR
PNMATMLAFV ATDADIPQAT LQGLLGEAVA QSFNRVTVDG DTSTNDACTL VATGHSGVAL
AGEGDRERLA SALTDLCVTL ARAIARDGEG ATRLINVVVE GAQAVAEAER VAFTVAESPL
VKTALFAADP NWGRILAAVG RAGIDDLDVA GVTIDLDDYR IAEQGGRAAG YDEAEASRRI
QGSEVTIRIG LGRGAAAATV WTCDFSYDYV RINAEYRT