Gene Mlg_2568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2568 
Symbol 
ID4269671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2908951 
End bp2910231 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID638127327 
ProductC-terminal processing peptidase-3 
Protein accessionYP_743398 
Protein GI114321715 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.257628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00928405 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCATGA TTCGACCGCT CTATTCTACC GCCATACTGC TGATCCTGGC CGCCGGGCTG 
GGGTTGGGGC AGACCGTATG GGCGGAGCGG CAACAGGCCA ACGCGGATCT GCCGTTGGAA
GAGCTTCAGG CCGTCGCCGA GGTCTACGCG CGGATTCGCA GCCACTACGT GGACGAGGTG
GATGACAAGG CGCTCCTGGA GGCGGCCGTG CGCGGCATGG TCAGCAGTCT GGACCCCCAC
TCCACCTTTC TGGATTCCAG CGAGTTCCAG GCCCTGCAGG AGGGCACCCG CGGTGAGTTC
GGCGGGCTGG GTATCGAGGT GGGCCAGGAG GACGGCTTCA TCAAGGTCAT CGCGCCCATC
GATGACACCC CCGCGAGCCG TGCCGGCCTG CGTCCGGGGG ACCTGATCAC CCGCATCGAC
GATAAACCGG TCAAAGGGAT GTCGTTGACG GAGGCGGTCA AGCAGATGCG CGGTGAGCCG
GGCAGCCAGA TAACTCTCAC CGTGGTGCGC GAAGGCGAGG ATCGCCCGCT GACCTTCGAG
ATCACCCGCG CCGTTATCCA GGTGGAGAGC GTGCGGGCCC GGATGCTGGA GCCCGGCTAC
GGCTATCTGC GCATCAGCCA GTTCCAGGAG CGCACCGGTC GTGACGTGCG GGAAGCGCTC
AGCGAACTGA AGCGGGAGGC CGACGGCAGC CTGCGCGGTC TGGTTCTGGA TCTGCGCAAC
AACCCCGGTG GGGTGCTGGA CGGTGCGGTC AGTGTCGCCG ACGTCTTTCT CAGCAACGGC
CGGATCGTCT ACACCGAGGG CCGGGACGAG CGCGCGGAGA TGAGCTTTAG CGCCACCCCG
GTGGATATGC TGCACGGCGC CCCGCTGGTG GTGCTGGTCA ACCAGGGGTC CGCCTCCGCC
TCGGAGATCG TCGCCGGGGC CCTGCAGGAT CATGGACGCG CGGTGGTCAT GGGGTCACCC
ACCTTTGGCA AGGGCTCGGT GCAGAGCATC CTGCCGCTGG GCCGCGGTGC GGCGGTCAAG
CTCACCACGG CGCGCTACTA CACCCCGGGC GGTCGCTCCA TTCAGGATAA GGGCATCCAG
CCCGATATCC TCTCCGAGGA GCTCAGGGTG GCCCGGGTGG AACGCGAGGA CATGAGCCCG
GCCGAGCTGG AGCGCCACGG CCTGCGCCAA CGGCCGGACG TGGATCGGGA CGACGAGACC
GAGAGCCTGG CGCAGCGCGA TTTCACGCTG TACGAAGCGC TGAACCTGCT GAAGGGCGTG
GGTATCTTTA CCGGTCGCTG A
 
Protein sequence
MRMIRPLYST AILLILAAGL GLGQTVWAER QQANADLPLE ELQAVAEVYA RIRSHYVDEV 
DDKALLEAAV RGMVSSLDPH STFLDSSEFQ ALQEGTRGEF GGLGIEVGQE DGFIKVIAPI
DDTPASRAGL RPGDLITRID DKPVKGMSLT EAVKQMRGEP GSQITLTVVR EGEDRPLTFE
ITRAVIQVES VRARMLEPGY GYLRISQFQE RTGRDVREAL SELKREADGS LRGLVLDLRN
NPGGVLDGAV SVADVFLSNG RIVYTEGRDE RAEMSFSATP VDMLHGAPLV VLVNQGSASA
SEIVAGALQD HGRAVVMGSP TFGKGSVQSI LPLGRGAAVK LTTARYYTPG GRSIQDKGIQ
PDILSEELRV ARVEREDMSP AELERHGLRQ RPDVDRDDET ESLAQRDFTL YEALNLLKGV
GIFTGR