Gene Mlg_1343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1343 
Symbol 
ID4270016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1541412 
End bp1542857 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content68% 
IMG OID638126096 
Productprotease Do 
Protein accessionYP_742182 
Protein GI114320499 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.641282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGAC CGTCAAGCTA CGGATACCTG GCTGCGCTGC TGGCGCTGGC CCTGCTGACG 
TTTGGCGCCA GCGCCGGCGC CAAGTCGCAT CTGCCGGACT TCACCGAACT GGTTGAGGAG
AACAGCCCGG CAGTGGTCAA TATCAGCACC CGCCAAACCC CCGAGAGCCG GGGCGGGAGC
GGGCGTCTGC CGGACCATTT CGATATCCCG GAGGACCACC CGCTGCGTGA TTTCATGGAG
CGGTTCTTTG GCGAGCGGGG TGAGCGGCCA CCCGAGCACG GCCAGCGCCG CCCGCGCTCT
CTCGGGTCAG GTTTCATCAT CTCCGAAGAC GGGTATGTCC TGACCAATCA CCACGTCATC
GACGGTGCGG ATGAGGTCAA CGTCCGGCTG AGTGACCGGC GCGAGTTCGT GGCCGAGGTC
ATCGGCAGCG ATGAGCGCAG TGACGTGGCG GTGCTCAAGA TCGATGCCGA GGGGCTGCCC
ACGGTGCGCA TCGGCCAGTC CGACACGTTG CGCGTCGGCG AATGGGTGTT GGCCATCGGC
TCGCCCTTCG GTTTTGAGCA CTCGGCCACC GCCGGGATTG TCAGCGCCAA GGGGCGCAGC
CTGCCCAGCG GCAACTATGT GCCCTATCTG CAGACCGATG TGGCGATTAA TCCCGGCAAT
TCCGGCGGCC CGCTGTTCAA CCTGGACGGT GAGGTGGTCG GCATCAACTC CCAGATCTAT
AGCCGCACCG GCGGTTTCAT GGGGGTCTCC TTCTCCATCC CCATCGAGCT GGCCATGGAC
GTGGCCACCC AGCTGCGGGA GACCGGGCGT GTGGCTCGGG GCTGGCTCGG GGTGATCATC
CAGGACGTCA CCCGGGACCT GGCCGAGTCG TTCGATATGG ACCGGCCGCG TGGCGCCCTG
GTGGCCCAGG TGCTTTCGGA CAGCCCGGCC CTTGAGGCGG ATCTGCAACC CGGCGATATC
ATCGTCGAGT TCGACGGCGA GGCGGTGGAG ACCTCCGGCA GCCTGCCGCC GATGGTGGGG
GCTACGCCGG TGGGTGCGGA GGTGCAGGTA AAGGTGCTGC GCGAGGGCCG CGAGGTGATG
GTTGATGTCA CCATCGGCGA GCTGCCCGAG GAGCAGGCGC GGGCCCAGCG CCCGCCCCGG
GGTGAGCCGG AGCGTGCGCC GGATACCGCC GCGGAGCGAC TGGGGCTGCG GGTGGAGCCG
GTGCCCGCCG AGCGCCTGGA GGAGCTGCGC GTCGACAGTG GTGTGCTGGT CCGGCGCGTG
GAGAGCGGCC CGGCGCGGGA GGCCGGCATC CGCCCCGGTG ATGTGATCAC CTCCATCGAT
CAGCAGTCCG TGGAAGGCGT GGAGCAATTC GCAGAGTTGG TCGAAGGCCT GGCCTCCGGG
CGCAGTGTCC CGGTGCTGGT GCTGCGGGAG GGGGGGGCGC GTTTCTTCGC CCTGCGTATC
CCCTGA
 
Protein sequence
MNRPSSYGYL AALLALALLT FGASAGAKSH LPDFTELVEE NSPAVVNIST RQTPESRGGS 
GRLPDHFDIP EDHPLRDFME RFFGERGERP PEHGQRRPRS LGSGFIISED GYVLTNHHVI
DGADEVNVRL SDRREFVAEV IGSDERSDVA VLKIDAEGLP TVRIGQSDTL RVGEWVLAIG
SPFGFEHSAT AGIVSAKGRS LPSGNYVPYL QTDVAINPGN SGGPLFNLDG EVVGINSQIY
SRTGGFMGVS FSIPIELAMD VATQLRETGR VARGWLGVII QDVTRDLAES FDMDRPRGAL
VAQVLSDSPA LEADLQPGDI IVEFDGEAVE TSGSLPPMVG ATPVGAEVQV KVLREGREVM
VDVTIGELPE EQARAQRPPR GEPERAPDTA AERLGLRVEP VPAERLEELR VDSGVLVRRV
ESGPAREAGI RPGDVITSID QQSVEGVEQF AELVEGLASG RSVPVLVLRE GGARFFALRI
P