Gene Mlg_1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1723 
Symbol 
ID4268972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1970630 
End bp1972663 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content70% 
IMG OID638126481 
Productpeptidase S15 
Protein accessionYP_742559 
Protein GI114320876 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.463427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.841285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTGA TCGAATCTTT CGACGACGAT GTCCGGGTGA TCGAGAACGT CTGGATCCCC 
ATGTCCGACG GCGTGCGCTT GGCCGCGCGG GTCTGGCTGC CGGTGGGCTC CGCCGACCAC
CCGGTGCCGG CGGTGCTGGA GTACATGCCC TACCGCAAGC GCGACTTCAC ACGCCTGCGT
GACGAACCGC TGCACCACTA CTTCGCCGGC CACGGCTACG CCGCCATCCG CCTGGACGTG
CGCGGCACCG GCGACTCCGA GGGCATCCTG CGCGATGAGT ACCTGGCCCA GGAGCAGGAC
GACGCCGAGG AGGCCATCGC CTGGATCGCC GAACAGTCCT GGTGCAACGG CCGGGTGGGC
ATGATCGGCC TGTCCTGGGC CGGCTTCAAC GCCCTGCAGG TGGCGGCCCG CCAGCCGCCG
GCGCTGAAGG CCATCATCAC CATGTGCTCC ACCGACGACC GCTATGCCGA CGATGCCCAC
TACAAGGGCG GCTGTCTGCT CAACGAGAAC CTGACCTGGG GCGCGGCCTT CTTCTCGCTC
AACGCCTGCC CCCCCGACCC GGAGATCGCC GGTGAGCCCT GGCGCGAGCA ATGGCTGGAA
CGGCTGGCAC ACAACCGCCT CTTCCCCGCC CTGTGGATGC GTCACCCCCA CCGGGATGAT
TACTGGAAGC AGGGCTCGGT GTGCGAGGAC TATTCGGCCA TCCAGTGCGC GGTCTACGCC
GTGGGCGGCT GGGCGGACGG CTACGTCAAC GCCATCCCAC GCCTGATGGC CGGCCTACAG
GCGCCGCGCA AGGCGCTGAT CGGCCCCTGG CCCCACGCCT TTCCCCACGC CGCCGAGCCG
GGGCCGCGCA TCGGCTTCTT CCAGGAAGCG GTGCGCTGGT GGGACTATTG GCTGAAGGAG
GAAGAGAACG GCATCATGGA CGAGCCGCTG ATCCGTGCCT GGATGGAGGA CTGGATCGCC
CCGGCGCCCC GCCACGACGA GCGCCCCGGC CGCTGGGTGG CGGAAACCGA GTGGCCCTCG
CCGCGCATCA CCCCGCGCAC CTGGCACCTC AACGTGCTCT CGCTGGGCGA TCACCCGGAC
CCGGAGGATC GCATGCGCCT GCGCTCACCG CAGACCACCG GGCTGCGCGC CGGCGATTTC
TACGGCTTCG GCGCCGAGGG CGACGCCCCC ATGGACCAGC GCACCGACGA CGGCAAGTCG
CTGGTCTTCG ACTCCGACCC GCTCAGCGAG CCGGTGGAGA TGCTCGGCAC GCCGGTCGTC
ACCCTGGAAT TGGCCTGCGA CACGCCACTG GCCCACGTGA TTGTTCGCCT GAACGACGTG
GCGCCGGACG GGGCCTCCGG CCGGGTCAGC TATGGGGTGC TCAACCTGGC GCACCGGGAC
AGCCATGAGC GCCCCGCCCC GCTGGTGCCG GGGCAACGCT ACCGGATCAC CGTACGACTG
AACGACATGG CCTACCGCTT CGCCCCCGGG CACACCATCC GGCTGGCGAT CTCCAGCGCC
TATTGGCCGA TCATCTGGCC GGCACCGGAA CGGTCGGAGA TCACCCTGAT CACCGGGGCG
AGCACGCTGG CGCTGCCCCT GCGACCGCCG CGACCGGAGG ATGACCAGCT ACCGGCCTTC
GGGCCGCCGG AGCGCTGCCC CATCCCGACC CACACCATCC TCGAGACCGC CGAGCCGGAG
CGCAGTATCG CCGTGGACCT GACCAACGAC GAGACCACCT ATACCGCCTT CGGGGACGCC
GGTGACGTGG GTGGCGCGGC CCTGGCCCGG ATCGAGGACA TCGATCTCAC CCTGGGCTCC
ACCATGCGTC GGGTGTTCCG TATCCAGGAG CAGGACCCGT TGTCCGCCGA GGCCGTCATC
GAGCAGGAGA CCCGCTTCCA GCGCGGCGAC TGGGCGGTGC GCATCGACGC CCGCATCCGC
CTCACCGCCG ATGCCGAGCA TTTCTTCATC CAGGCGATCC TGGACGCCTA CGAGAACGGG
GCCCGCGTAG CCAGCCGGGA GTGGAACGAG GCCATCCCCC GGCTGCTCCT GTAA
 
Protein sequence
MRVIESFDDD VRVIENVWIP MSDGVRLAAR VWLPVGSADH PVPAVLEYMP YRKRDFTRLR 
DEPLHHYFAG HGYAAIRLDV RGTGDSEGIL RDEYLAQEQD DAEEAIAWIA EQSWCNGRVG
MIGLSWAGFN ALQVAARQPP ALKAIITMCS TDDRYADDAH YKGGCLLNEN LTWGAAFFSL
NACPPDPEIA GEPWREQWLE RLAHNRLFPA LWMRHPHRDD YWKQGSVCED YSAIQCAVYA
VGGWADGYVN AIPRLMAGLQ APRKALIGPW PHAFPHAAEP GPRIGFFQEA VRWWDYWLKE
EENGIMDEPL IRAWMEDWIA PAPRHDERPG RWVAETEWPS PRITPRTWHL NVLSLGDHPD
PEDRMRLRSP QTTGLRAGDF YGFGAEGDAP MDQRTDDGKS LVFDSDPLSE PVEMLGTPVV
TLELACDTPL AHVIVRLNDV APDGASGRVS YGVLNLAHRD SHERPAPLVP GQRYRITVRL
NDMAYRFAPG HTIRLAISSA YWPIIWPAPE RSEITLITGA STLALPLRPP RPEDDQLPAF
GPPERCPIPT HTILETAEPE RSIAVDLTND ETTYTAFGDA GDVGGAALAR IEDIDLTLGS
TMRRVFRIQE QDPLSAEAVI EQETRFQRGD WAVRIDARIR LTADAEHFFI QAILDAYENG
ARVASREWNE AIPRLLL