Gene Mlg_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2081 
Symbol 
ID4269400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2359476 
End bp2360357 
Gene Length882 bp 
Protein Length293 aa 
Translation table11 
GC content69% 
IMG OID638126837 
Producttype 4 prepilin peptidase 1. Aspartic peptidase. MEROPS family A24A 
Protein accessionYP_742913 
Protein GI114321230 
COG category[N] Cell motility
[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1989] Type II secretory pathway, prepilin signal peptidase PulO and related peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.587572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGGCG AACTCAACGC CCTCCTGGCC GACAACGCCC CGCTTGCGGC GGCCCTGGCC 
CTGGTGCTGG GGCTGCTGGT CGGCAGCTTT CTCAATGTGG TCATACTGCG CCTACCGATC
ATGATGGAGC GGGCGTGGGC ATCCGAGGTG GCGGCCAGCC GGGGTGAGGT GCACGAGGAC
GCGCAGTCCA CCCCCTTCAA CCTGGTCACG CCCCGCTCCC ACTGCCCGCA GTGCGGGCAC
ACCCTCTCCG CGCTGGAGAA CATCCCGGTG GTGAGCTGGT TGCTGCTGCG CGGCCGCTGC
CGAGCCTGTG GCACCCGGAT CAGCGGCCGC TACCCGCTGG TGGAACTGAC CACCGGCCTG
CTCTCCGCGC TGGTGGTGCT GCAACTGGGC TGGACCCCGG AGACCGCGGC CGCACTGCTG
CTCACCTGGA CGCTGGTGGC CCTTTCCGGG ATCGACCTCG ATCACCAATT GCTGCCCGAC
AGCCTCACCC TGCCGCTGCT CTGGGCCGGG CTGCTGGTGA ACAGCACCGG TCTGTTCGCC
GAACTCACGG ACGCCGTCTG GGGCGCGGCC CTGGGCTATC TGGTGCTGTG GGGGGTATTC
CATGCCTTCC GCCTGCTCAC CGGTAAGGAG GGTATGGGCT ACGGCGACTT CAAACTGCTC
GCCGCCCTCG GCGCCTGGCT GGGCTGGCAG GCCCTGCCGT TGATCATTCT GCTCTCGTCC
CTGGTCGGTG CTGCAGTGGG CATCGCCCTG ATAGCGCTCA AGGGCCGGGG CCGCGAGGTG
CCCATCCCCT TCGGGCCCTA CATCGCCGCC GCCGGCTTCA TCACCCTGCT CTGGGGAGAG
GCCCTGGTGC ACTGGTATTT CCGGGCCTCG GGGTTGGCCT GA
 
Protein sequence
MVGELNALLA DNAPLAAALA LVLGLLVGSF LNVVILRLPI MMERAWASEV AASRGEVHED 
AQSTPFNLVT PRSHCPQCGH TLSALENIPV VSWLLLRGRC RACGTRISGR YPLVELTTGL
LSALVVLQLG WTPETAAALL LTWTLVALSG IDLDHQLLPD SLTLPLLWAG LLVNSTGLFA
ELTDAVWGAA LGYLVLWGVF HAFRLLTGKE GMGYGDFKLL AALGAWLGWQ ALPLIILLSS
LVGAAVGIAL IALKGRGREV PIPFGPYIAA AGFITLLWGE ALVHWYFRAS GLA