Gene Mlg_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1428 
Symbol 
ID4270426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1631895 
End bp1632830 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content65% 
IMG OID638126184 
Productpeptidase S49 
Protein accessionYP_742267 
Protein GI114320584 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.574443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAA ACGACCGCTG GGAACGGGAT GCCATGCGCG AGATCGCCCT TGAGGGCATC 
AAGGAGCGCC AACGGGCGCG TCGCTGGGGC ATATTCTTCA AGCTGGCGTT CCTGGCCTAT
CTGGTCGCGT TGCTTATCCC CTTTGCCAGC GGGTTCCTGT TCGAGCGGCC CACAGGGCCG
CACCTGGCCA AGGTGAACGT TACCGGGCTG ATCAGCGCCG ACGAGCTGGC GAGTGCCGAG
CTGGTCAATC AGGGGCTGCA GGCCGCATTC AACGCCCCCC GGGCCGAGGG GGTGGTGCTG
TACATCAACA GCCCGGGTGG CAGCCCGGTG CAGTCAAACC GCATCTACTC CGAGATCAAT
CGCCTGCGCG AACAGCATCA GGGGATGGCG GTCTATGCGG TCATCGACGA CGTGGGCGCG
TCCGGGGCCT ATTATATCGC CTCTGCGGCG GATGAGATCT TCGTCAATCC GGCAAGCGTG
GTGGGCTCCA TCGGGGTCAT CTCCGGCGGT TTCGGCTTCA CGGAGGCGAT GGAGAAACTG
GGCGTGGAGC GGCGCATCTA CACCGCCGGC GAGAACAAGG CCTTTTTAGA CCCCTTTGCC
CCGGAGGAGG AAGCGCACCA GGCGCACATG GAGCGCCTGC TGGAGGAGGT GCATTCCCAG
TTCATCGCCG ACGTCCGCGC CGGGCGGGGT GAGCGCCTGG CCGATGATGA TCGCCTGTTC
AGCGGGCTCA TCTGGACCGG TGAAAGCAGT GTCGAACTGG GCCTGGCCGA CGGATTCGGC
GATATCGCGC ACGTGGCCCG CGAGGTGGCC GGTGTCGACC AGGTGTTGGA CTACAGTCGC
CACCCCGGCC TGCTGCGCTT CATCACCGAC CGGTTGGGCA TGAGTATCGG CAAGGCCATC
ACCCGCGCCC TGACCGAGGG CCACGAACTG CGCTGA
 
Protein sequence
MNENDRWERD AMREIALEGI KERQRARRWG IFFKLAFLAY LVALLIPFAS GFLFERPTGP 
HLAKVNVTGL ISADELASAE LVNQGLQAAF NAPRAEGVVL YINSPGGSPV QSNRIYSEIN
RLREQHQGMA VYAVIDDVGA SGAYYIASAA DEIFVNPASV VGSIGVISGG FGFTEAMEKL
GVERRIYTAG ENKAFLDPFA PEEEAHQAHM ERLLEEVHSQ FIADVRAGRG ERLADDDRLF
SGLIWTGESS VELGLADGFG DIAHVAREVA GVDQVLDYSR HPGLLRFITD RLGMSIGKAI
TRALTEGHEL R