Gene Mlg_1593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1593 
Symbol 
ID4268564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1821513 
End bp1823639 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content67% 
IMG OID638126350 
ProductC-terminal processing peptidase-1 
Protein accessionYP_742430 
Protein GI114320747 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTGGA CTGAATCTAT GAAACGGCAA GTCATACGCC CCGCCCTGGT CATCGCCGGC 
ATACTGCTGG CCCTGGTCCT GGTCATCAGC GGTAACGATG GCCCCGGGCG CCCGGCACCG
GCGGTCGCCC ACTCGGAGCT GGCCCCTGAT TCGGAGCAAC GGGAGAAGGC CAGCGTCATC
GCCGACCTGC TCACCCGCTA CCACTATCGC GGCCAGCCGT TGGATGCCGG GCTCTCGGAG
CGGGTTTTCG ACGCCTGGCT GGACCAGCTC GACCGAGAGC GCTTTTACCT GCTCCAAGAG
GACATTGACG CCTTCGATGA GCACCGGATC GGGCTCCACG AGCAGCTCCG CCACGGCGAT
CTCAGCGTGC CCTTCGCTCT CTATGAGCGT TACCGCGAGC GTGTGGCCGA GCGCACCGAA
TATGCCATCG GGCTGCTGGA GGCCGGTCTG GACTTCGACA CCGACCTGCG CTTTGAGCAG
GACCGCAGGG ACGCCGACTG GGCCGAATCC CGGGAGGCCC TCGACCGGCT CTGGCGCAAG
CGCGTCACCC ATGATGCCCT CACCCAGAAG CTGGCGGGCC GTGACGAGGA GCAGGTCATC
CAGACCCTCA CTCAGCGCTA CGAACGCATC CGCCGCACCA CCGAGCAGGA GAGCGGCGAG
GATGTGTTCC AGCGCTACAT GGACGCCTGG GCACACGCCT TCGATCCGCA TAGCAGCTAC
CTCTCGCCGC GTCGCTCCGA GGACTTCGAC ATCAATATGA GTCTCTCGCT GGAGGGCATC
GGTGCGATGC TGCAGAGCGA GCACGACTTC GTCACCATCG TCGAGCTGGT GCCGGGCGGC
CCGGCCGCCC AGAGTGAGGC GCTCTCACCC GGCGACCGCA TTATCGGTGT CGCCGAGGGC
GAGGACGGTG AGATGAAGGA CGTGGTCGGC TGGCGCCTGT CCGACGTGGT CGATCTCATC
CGCGGTCCGC GCGGCTCGGT CGTGTGCCTG CTGGTCCTGC CCGAGGCGGG CAGTGGTAAC
GCCACGCCCC GCGAGGTGGT CCTGGAGCGC AACGAGATCA AGCTCGAGGA CCAGGCGGCC
AGCGCCGAGG TCATCGAGGT ACCGGGTGAG GGGAGTGGCA AGGATCGCAT CGGGGTAATC
ACAATCCCTG CCTTCTACAT GGATTTCGAG GCGGCCGAGG CAGGCGATCC GGACTACCGC
AGCACCACCC GCGATGTCCG TCGGTTGCTC AACGAGCTCA AGGAAGGGGG CATTGACGGC
CTGGTGCTCG ATCTGCGCGG CAACTCCGGG GGCTCGCTGC GGGAGGCCGC GTCACTGTCC
GGCCTGTTTA TGGGCGGCGG CCCCATTGTC CAGGTTCGCC GCAGCAGCGG CGAGCTGGAG
GTGCTCCGCG GCGGTGACCG CGCTAATTCC GCACCGCTCT ACGATGGCCC GCTGGGGGTG
ATGGTGGACG GGTTCAGCGC CTCCGCCTCG GAGATTTTGG CCGGGGCCAT CCAGGACTAC
GGCCGGGGCG TGATCATGGG CAAGGACACC TTTGGCAAGG GCACCGTGCA GACCATGATC
AACCTGGACC GCTTTGGCCT GGGCAACGGC GAGGACGGCG CCGGGCGGCT CAAACTGACG
GTGGCCAAGT TCTACCGTGT CACCGGCGAC AGCACCCAGA AGAAAGGGGT GCAGCCCGAT
ATCATCCTGC CCTCGCCCAT TGATGCGTCG GAGTTCGGCG AACGCGGCAT CGACAATGCC
CTGCCCTGGA ACCAGATCTC TGCGGTCGAC TACCGGCGCG ACGACACGCT GGAGGAGCTC
ATCCCCGCCC TGCGCAGCCG GTACCAGACC CGGGCGGAGG ACGATCCGCA GTTCCAGGCC
CTGCTGCGTG ACTTCGAGTA CCAGCTCCAG CGGCGTGAGC GGACCGATGT CTCCCTCAAC
GAGACAACCC GGAAGCAGGA GCGGGACGCC GAGGAGCAGG AGCGGCTGGC ACTGCACAAT
GCCCGTCGGG AGGCCGCCGG GTTGGAGCCG CTGGAGGACG CCGAATCCAG GGATGACGAT
GAGCTGCCCG ATGTACTCCT GGAGGCGGCC GCCTCGGTCA TCGCCGACTT GAGACGGTTG
CAGGCCGCCT ATATGGCGCA GCGCTGA
 
Protein sequence
MHWTESMKRQ VIRPALVIAG ILLALVLVIS GNDGPGRPAP AVAHSELAPD SEQREKASVI 
ADLLTRYHYR GQPLDAGLSE RVFDAWLDQL DRERFYLLQE DIDAFDEHRI GLHEQLRHGD
LSVPFALYER YRERVAERTE YAIGLLEAGL DFDTDLRFEQ DRRDADWAES REALDRLWRK
RVTHDALTQK LAGRDEEQVI QTLTQRYERI RRTTEQESGE DVFQRYMDAW AHAFDPHSSY
LSPRRSEDFD INMSLSLEGI GAMLQSEHDF VTIVELVPGG PAAQSEALSP GDRIIGVAEG
EDGEMKDVVG WRLSDVVDLI RGPRGSVVCL LVLPEAGSGN ATPREVVLER NEIKLEDQAA
SAEVIEVPGE GSGKDRIGVI TIPAFYMDFE AAEAGDPDYR STTRDVRRLL NELKEGGIDG
LVLDLRGNSG GSLREAASLS GLFMGGGPIV QVRRSSGELE VLRGGDRANS APLYDGPLGV
MVDGFSASAS EILAGAIQDY GRGVIMGKDT FGKGTVQTMI NLDRFGLGNG EDGAGRLKLT
VAKFYRVTGD STQKKGVQPD IILPSPIDAS EFGERGIDNA LPWNQISAVD YRRDDTLEEL
IPALRSRYQT RAEDDPQFQA LLRDFEYQLQ RRERTDVSLN ETTRKQERDA EEQERLALHN
ARREAAGLEP LEDAESRDDD ELPDVLLEAA ASVIADLRRL QAAYMAQR