Gene Mlg_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1104 
Symbol 
ID4269811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1290016 
End bp1291551 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content71% 
IMG OID638125856 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_741946 
Protein GI114320263 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.723439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCGA TTGCCCTCAG TGTCTTTTCC AGCCGCATGG CCGCCGTCTG CGAGGAGATG 
GGCGCCGTGC TGCGGCGCAC CGCCTTCTCG CCCAACATCC GCGACCGGCT GGATTTTTCC
TGCGCGGTGT TCGATGCCGA CGGCGGGCTG GCGGCGCAGG CCGCGCACAT CCCGGTGCAT
CTGGGCAGCA TGGCCTACGC CATGGCGGGG GTGATCCGGC GCTTCGACTG GCAGCCGGGG
GACATGGTGG TCTTTAACGA CCCCTTCCTG GGCGGTACCC ACCTGCCCGA TGTCACGCTC
GTCTCGCCGC TGTTCGTGGA TGGCGAACGG GTCGCCTTCC TGGCCAACCG CGCCCACCAC
GCGGATATCG GTGCGGTGAC CCCGGGCTCC ATGCCCCTGT CCACCACCCT GGAGGAGGAG
GGGGTGCTCA TCAGTCCGGT GCGGCTCTAC CGGGCAGGCC GGCGCGACGA GGCCGTGTTG
CAGCGCATCG TCTCGCGCAC CCGCAACCCC CGGCAGGCGG GGGGCGACTT TGCCGCCCAG
GCCAGTTCGG TCTCCAGCGG TGTGCACCGG CTGCAGGAAC TGGTCGGGCG CATGGGGATG
TCTGAGTTCC GGGCCGCCCT TGCGGCGTTG AACGATTACG CCGAGCGGCT GGTGCGGGCC
GCCTTGGTGG ACCTGCCCGA CGGCAGGTGG ACCTTTACCG ACTACCTGGA TGACGACGGC
CAGGGGCAGC AGGACCTGCC CATCCAGGTG GCCCTGACCC TCGATCACCA CGATGCCCAC
GTGGACTTCG CCGGCTCCGC CGACCAGGTG CGGGGCAACC TGAACTGCCC CTTATCGGTG
GCGGCCGCCG CCGTGTTCTA CGCCTTCCGC TGTCTGATGC CGGAGCAGAC CCCGGCCTGT
GCCGGTGCCT TCCGGCCTAT CACCCTGAGC GCCCCCGAGG GCAGCCTGCT CAACGCCCGC
CACCCGGCCG CGGTGGCTGC CGGCAATGTC GAGACCAGCC AGCGGGTGGT GGACGCGGTG
CTGGGGGCGC TGGCGCCGGC GCTGCCGGAC CGCATCCCCG CGGCCAGCCA TGGCGGCATG
AACAACCTGG CCATGGGGGC GCTGGCGGAG GACTCGCCCT GGGACTACTA CGAGACCCTG
GGCGGCGGCA TGGGCGGCGG TCCCCACCAT CGCGGCCGGT CGGGCGTTCA GGTGCACATG
ACCAATACCC TCAACACCCC CCTGGAGGCG CTGGAGATGG CCTATCCGCT GCGTCTGCGC
CGCTATGCCC TGCGCCGTGG CTCCGGCGGT GCCGGGCGCC ATCCCGGAGG CGAGGGGGTG
ATCCGGGAGT ACGAGTTTCT CACCCCCGCA TCGGTCACGC TGATCACCGA ACGGCGCCGC
CATGCCCCCT GGGGGCTACA AGGTGGCGCG CCGGGCGCGG TGGGCGAGAA CCGGCTCAAC
GGCGAGCTGC TGCCGGGCAA GGTGCGCCTG GAGGTGGCGG CCGGGGACCG GCTCACTGTC
ATGACCCCCG GGGGTGGCGG CTGGGGCGGT TCATAG
 
Protein sequence
MDPIALSVFS SRMAAVCEEM GAVLRRTAFS PNIRDRLDFS CAVFDADGGL AAQAAHIPVH 
LGSMAYAMAG VIRRFDWQPG DMVVFNDPFL GGTHLPDVTL VSPLFVDGER VAFLANRAHH
ADIGAVTPGS MPLSTTLEEE GVLISPVRLY RAGRRDEAVL QRIVSRTRNP RQAGGDFAAQ
ASSVSSGVHR LQELVGRMGM SEFRAALAAL NDYAERLVRA ALVDLPDGRW TFTDYLDDDG
QGQQDLPIQV ALTLDHHDAH VDFAGSADQV RGNLNCPLSV AAAAVFYAFR CLMPEQTPAC
AGAFRPITLS APEGSLLNAR HPAAVAAGNV ETSQRVVDAV LGALAPALPD RIPAASHGGM
NNLAMGALAE DSPWDYYETL GGGMGGGPHH RGRSGVQVHM TNTLNTPLEA LEMAYPLRLR
RYALRRGSGG AGRHPGGEGV IREYEFLTPA SVTLITERRR HAPWGLQGGA PGAVGENRLN
GELLPGKVRL EVAAGDRLTV MTPGGGGWGG S