Gene Mlg_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0204 
Symbol 
ID4269650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp237308 
End bp238369 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content69% 
IMG OID638124928 
Productsuccinylglutamate desuccinylase/aspartoacylase 
Protein accessionYP_741049 
Protein GI114319366 
COG category[R] General function prediction only 
COG ID[COG3608] Predicted deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCC GGGCGCCCTT CCGCATCCTG GACACGGAGG TCCGCCCCGG ACAGCGCGCC 
ACGGTGGATG TGCCGCTGGC CCAACTCTAC ACCCACACCC AGCTGCACAT GCCGGTGCAG
GTGGTGCACG GCCGACGCGA GGGGCCGGTG CTGCTGGTCA GTGCCGCGCT CCACGGCGAC
GAGATCAACG GCGTGGAGAT CATCCGCCGG CTGCTCAAGC TCTCGGCCCT GCGCCAGCTG
GCCGGCACTC TGGTGGCAGT GCCCATCGTC AACGTCTTCG GGTTCATCCA CCGCTCCCGC
TACCTGCCTG ACCGGCGCGA TCTCAACCGC TGTTTCCCGG GCAGCGAGCG AGGCTCCCTG
GGCGCCCGCA CCGCCTACCT GTTCCGCACC GGGATCGTCG AGCGCTGCAA CCACGTCATC
GACCTGCACA CCGCCGCCAT CCATCGGGAC AACCTCCCCC AGATCCGGGT CAACCTGGAG
AATGCCGAAG CCGCCGCCAT GGCCCGCGCC TTTGGCATGC CGCTCACCCT GAACAGTGGG
CTGATTGAGG GCAGCCTGCG GGCGGTGGCG GACGATGCCG GCATCCCGGT GATCACCTAT
GAGGCGGGTG AGGCCCTGCG CTTCCAGGAG CCGGCCATCA AGGCCGGACT GGCCGGCACC
GTGCGGGTGA TGCGCAGTCT GGGTATGCTG CCGTCACGGA GCGGGCGGCA CACCGGTGGC
TCCCGCCAGA GCTATGTCGC CAATGCCTCG CAATGGGTGC GCGCCGAACA AGACGGCATC
TTCCGCACCG TCAGCCCTCT CGGGACCCAC GTGAAACAAA GGCAGGTACT GGGTTATATT
GCAGACCCCT TCGGCGAGCG CGAGCTGCCC GTCCATGCGC CCTTCAGCGG GATCGTGGTG
GGCCGCAATA ACCTGCCGCT GGTGAACGAG GGCGAGGCGC TGTACCACGT GGCCCGATAC
GATCAGGCCG CCCGCGCCGA ACGGGTGGCG GCCCAGTGGG CCGCATTCGA GGAGGGGCTG
AACGGCGACT ACCCGCCCTC CGAGGAGCCG CCCATCGTCT GA
 
Protein sequence
MARRAPFRIL DTEVRPGQRA TVDVPLAQLY THTQLHMPVQ VVHGRREGPV LLVSAALHGD 
EINGVEIIRR LLKLSALRQL AGTLVAVPIV NVFGFIHRSR YLPDRRDLNR CFPGSERGSL
GARTAYLFRT GIVERCNHVI DLHTAAIHRD NLPQIRVNLE NAEAAAMARA FGMPLTLNSG
LIEGSLRAVA DDAGIPVITY EAGEALRFQE PAIKAGLAGT VRVMRSLGML PSRSGRHTGG
SRQSYVANAS QWVRAEQDGI FRTVSPLGTH VKQRQVLGYI ADPFGERELP VHAPFSGIVV
GRNNLPLVNE GEALYHVARY DQAARAERVA AQWAAFEEGL NGDYPPSEEP PIV