Gene Mlg_1368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1368 
Symbol 
ID4268131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1564990 
End bp1566177 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content66% 
IMG OID638126124 
Productpeptidase M50 
Protein accessionYP_742207 
Protein GI114320524 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00624726 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGGG CACAGCGGGC GGTAAGATCG GGGCATGGAG CCGAGGCCGG GAGTGTGATC 
ATGTTCAAAT CAGTGCTGGT GCTGGGGTAC TACCGGGGCA TCCGGCTGGA GGTGCACGTC
AGCTGGCTGG TCATCTTTGC CCTGCTTCTG GTCACCATGA GTGCCGGGTT TCACCACCAC
TACGATCATT GGCCGCTGCC GGTAGCCATA CTCACGGCGC TGTTCACGTC GCTCACCTTT
TTCGCCTCCA TCGTCGCCCA CGAACTCGGC CACAGCCTGG TGGCCATCCG TCGTGGGGTC
CCGGTCAAGG CCATCACCCT GTTCATCTTC GGCGGGGTGG CCCAGATGAG CCGCGATGCC
GACAGCCCCG ATGATGAGTT CTGGATCGCC ATTGCCGGAC CCGCGGTCAG TTTTGCCCTG
GCGCTGCTTT TCGCCGCCCT GGCCCAGATC ACGGCGGGGA TTTTTGAGCC GCTGACCGTG
GCCCTGGGCT GGTTGGCGGT GATCAACCTG GTGGTGGCCG TGTTCAACCT CATCCCCGGC
TTCCCGCTGG ATGGCGGACG GGTCTTCCGC GCCGCGGTCT GGAAGTTCAC CGGCAGCGCG
CGCAAGGGGA TCGAGGCCGC CGTGGCGGGT GGCCGGCTGG TCGCCTACGG GCTGTTTGCC
CTGGCCTTGT GGAACATCCT GGTGCTGGGC AACCTAATCG GGGGGTTGTG GATCACCCTG
ATCGCCTGGT TCCTGTTCAA TATGGCCCAG GCCCAGGGGC GAATGTTCGA CCTGCGCGAG
CGCCTTTCCG GGGTGCGGGC CCGCGATCTG GCGCGGCCCG ACATCCCCCA GGTCGAGCCC
GGGACAGCCG TCAGTGACTG GGTCCATCAC CAGGTGCTGC CGGGGGGGCA GCGCGCCCAT
ATCGTTGGCA ATCGCGAGCA CGCCCATGGG CTGGTCTCTC TCTCCGATGC CCGGGCGGTG
CCACAGGCGC AGTGGGCCAC CACCCGCGTC GACGACATCA TGACCCCGGC GGAGGCCCTG
GTCAGTGCGA CGCCGGAGAC CGACGCGGCC CAGGTCCTAC AACTGATCAC CGAGCACAAC
CTCAATCAGC TCCCGGTGAT GGAGGGGCGC CGCGTGTTGG GTTGGATCGA CCGCCATCAA
CTGCTGCATA CCATCGATCT GCACATGGAG CTGAGGCGGC CGGAGTGA
 
Protein sequence
MRRAQRAVRS GHGAEAGSVI MFKSVLVLGY YRGIRLEVHV SWLVIFALLL VTMSAGFHHH 
YDHWPLPVAI LTALFTSLTF FASIVAHELG HSLVAIRRGV PVKAITLFIF GGVAQMSRDA
DSPDDEFWIA IAGPAVSFAL ALLFAALAQI TAGIFEPLTV ALGWLAVINL VVAVFNLIPG
FPLDGGRVFR AAVWKFTGSA RKGIEAAVAG GRLVAYGLFA LALWNILVLG NLIGGLWITL
IAWFLFNMAQ AQGRMFDLRE RLSGVRARDL ARPDIPQVEP GTAVSDWVHH QVLPGGQRAH
IVGNREHAHG LVSLSDARAV PQAQWATTRV DDIMTPAEAL VSATPETDAA QVLQLITEHN
LNQLPVMEGR RVLGWIDRHQ LLHTIDLHME LRRPE