Gene Mlg_2644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2644 
Symbol 
ID4268534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2993351 
End bp2995306 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content73% 
IMG OID638127403 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_743474 
Protein GI114321791 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAACTAC TGGGTGTCGA TACCGGAGGG ACATTCACCG ACTTCGTTTG TTCCGACGGC 
GAGCGGCTGC GGATCCACAA GGTGTTGTCC ACCCCCGAGC GCCCGGAGGC GGCGATCCTG
CAGGGGGTGC GGGAACTGGG CCTGGCGGAT GAGCCGTTGC GGCTTATGCA CGGGTCCACG
GTGGCCACCA ACGCCGTCCT GGAGGGGCAG GGTGCGCGGG TGATGTACGT GACCGGCCGC
GGACTGGGGG ATGTGCTCAC CCTGGGGCGC CAACACCGCG AGCGACTCTA CGCCCTGGAG
CTGGCCCCCG CCGAACCGCC GGTGCCGCCG GAGCTGTGCT GGGAGACCGG CGGCCGGCTG
GGCCCGGACG GCGCACTGGT GGATCCGCTC AGCGACGAGG ACCTGGCCGC CTTTGATCAG
GCCCTTGCCG AACGGCGCCC CGAAGCGGTG GCCATCAACC TGCCCTTCTC CTTCGTCGAT
GGCGGTCCGG AAGACACCCT GGCCGCGCGG GTGCCGGAGG GGGTCTTCGT CGCCCGCTCG
CACAAAGTGC TGGCGGAATA CGGGGAGTAC GAACGCGGCA TCGCCACCTG GCTGAATGCC
CGCGTGGGGC CGGTCATGTC CGGTTACCTG AACCGGCTCA CCCAGGCCCT GCCCAAGGCG
TCGGTCTCGG TGATGCAGAG CTCTGGCGAG CGGGTGGCCG CCGACCAGGC GGCGCGCATG
GCGGTCAACC TGTTGCTCTC CGGCCCGGCC GGTGGGCTGA TGGCGGGGCG CTATCTGGGT
GAGCTGGCCG GTGAGCCCCG GCTGCTCAGC TTCGACATGG GTGGCACCTC CACCGACGTG
GCGGTGATCG ACGGCGAGCC GGCCCTGACC TCCGAGGGCC ACATCGGCGG CTGGCCGGTG
GCCGTGCCCA TGGTGGATAT GCACACGATC GGGGCCGGCG GCGGCTCGCT GGCCTCGGTG
GACGCCGGGG GCTTGCTGCA GGTGGGGCCA CGCTCCGCAG GTGCCGACCC GGGGCCGGCC
TGTTACGGGC GTGGCGGGGC CGGGGCTACC GTGACCGACG CCCACCTGGT GTTGGGTCGG
CTGCGCCCCG ACGCCTTCCT GGGCGGTGAT ATGACCTTGG AGCCGGCCGC GGCCCGGCAG
GCCCTGACCC GCCTGGGTGA GGGGCTGGCC CTGTCCCCGG AGCAGGCCGC CGAGGGGGTG
CTGCGCCTGG CCAACGAGCA TATGGCACGG GCCCTGCGGG TGATCTCCGT GGAACGGGGG
CTGGATCCGC GGGACTTCAC CCTGCTCTCC TTCGGTGGCG CCGGCGGGCT GCATGTCTGC
GCCCTGGCCG AGGCCCTGGA TATGTCCCGC GCCCTGGTCC CGGTGCACGC CGGGGTGCTC
TCCGCCCTGG GCATGCTGGC GGCGCCCCGC GGTCGGCAGC TCTCGCGTAC CCTGACCGGG
CCACTGAGCG CGCTGGGTGC GGAGCGGGTG GAGCAGGTGC TGGCCGCGCT GGCCGACTCC
GGGCGCCGGG CCTTGAGCGC CGAGGGGGTG GCCGTGGCCG AGCAGCGGAC CCATCCCTCG
CTGGATCTGC GCTACCAGGG GCAGGCCTAC ACCCTGAACG TGGCCTGGTC GGGGGTGGCG
GCCACCCTGG AGGCCTTCCA TCGGCTGCAC GAGCAGCGCT ACGGCCATCG GTTGGAGGAG
CCGGTCGAGC TGGTCAACGT CCGCCAGTCG GTGACGGGCC CCCGGCCGGT GCCGCCGCTA
CCGCGGTTGC TGCCGGGTGA GGGCGGTCCG GTGGCCCTGG GCCGGGTGCA CGGGGTGGCC
GGAGAGGTGC CGGTCTACGA TCGGGCACGG CTGGGGGCGG GGCAGGTGCT GCGTGGCCCG
GCGCTGGTCA CCGAGCAGGT CGCCACCACC TGGCTGGCGC CGGGCTGGAC GGCGCGGGTG
GATGAGTACG GCAACCTGCT GATGGAGCGG GCATAA
 
Protein sequence
MQLLGVDTGG TFTDFVCSDG ERLRIHKVLS TPERPEAAIL QGVRELGLAD EPLRLMHGST 
VATNAVLEGQ GARVMYVTGR GLGDVLTLGR QHRERLYALE LAPAEPPVPP ELCWETGGRL
GPDGALVDPL SDEDLAAFDQ ALAERRPEAV AINLPFSFVD GGPEDTLAAR VPEGVFVARS
HKVLAEYGEY ERGIATWLNA RVGPVMSGYL NRLTQALPKA SVSVMQSSGE RVAADQAARM
AVNLLLSGPA GGLMAGRYLG ELAGEPRLLS FDMGGTSTDV AVIDGEPALT SEGHIGGWPV
AVPMVDMHTI GAGGGSLASV DAGGLLQVGP RSAGADPGPA CYGRGGAGAT VTDAHLVLGR
LRPDAFLGGD MTLEPAAARQ ALTRLGEGLA LSPEQAAEGV LRLANEHMAR ALRVISVERG
LDPRDFTLLS FGGAGGLHVC ALAEALDMSR ALVPVHAGVL SALGMLAAPR GRQLSRTLTG
PLSALGAERV EQVLAALADS GRRALSAEGV AVAEQRTHPS LDLRYQGQAY TLNVAWSGVA
ATLEAFHRLH EQRYGHRLEE PVELVNVRQS VTGPRPVPPL PRLLPGEGGP VALGRVHGVA
GEVPVYDRAR LGAGQVLRGP ALVTEQVATT WLAPGWTARV DEYGNLLMER A