Gene Mlg_2658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2658 
Symbol 
ID4268548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3009925 
End bp3011307 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content70% 
IMG OID638127417 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_743488 
Protein GI114321805 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.287658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.000609596 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAGCCC GAATTCTGAT GGCCACCGGC CTGGCCCTCG GCCTGGCCTG GCTGGTCCCG 
CCGGCGGTGG CCGGCACGCC CGCGGTCCAC GAGTACACGC TGGACAACGG CATGACGGTG
GTGGTGCGCG AGGACCACCG GGCGCCGGTG GTGGTGAGCA TGGTCTGGTT TGCCGTCGGC
TCCAGCTACG AACAGCGGCC GCTGACCGGC ATCTCCCACG TGGTCGAGCA CATGATGTTC
AAAGGCACGG AGACCCGCCC GACCGGCGAG TTTTCCCGTC TTATCGCCGA GCGTGGGGGG
CGCCAGAACG CCTTCACCGG CCGGGATTTT ACCGGCTACC ACCAGCAGCT GGCGGTGGAG
CACCTGCCTT TGGCCTTCGA GTTGGAGGCC GACCGCATGC AGAACCTGGT CTTCGATCAG
GGTGAGTACG AGCGTGAGAT GGAGGTGGTG CGCGAAGAGC GCCGTCAACG GGTGGAGGAC
AACCCCACCG CCAAGTTCAT GGAGCGCTTC CGGGCCGTGG CCTGGAGCGC CAGTCCCTAC
GGCCAGCCGG TGATCGGCTG GATGGAGGAC CTGGACCGGT TGCGCCTGTC CGAGGTGGAG
GACTGGTACC GGCGCTGGCA CGGCCCGGAG AGCGCCACCC TGGTGGTCGT CGGCGCCGTG
GACCCGGATG CGGTTTTTGC CCTGGCCGAG GAGCATTTTG GTCCAGTCCC GGCCCGCGAG
CGGCCCGAAC CCATCCCCGG CGGCGATATC CCCGACCCGG GTGAGCGCGC CGTGACCGTG
CGTATCCCGG CGGAACTCCC CTACCTGGCC ATGGGCTGGC GGGTGCCCAC CCTGGGCAGT
ATCGACCGGG AAGACGAGGA GGCCCTGCGT GAGGTCTACG CCCTGGCGCT GCTTCGCGCC
GTGCTCTCCG GCGGCCAGGC GGCCATCCTG CCCGAGCGCC TGGAGCGGCA GCAGGGCGTG
GCCGTGGGCG CCGGGGCCAG CTATTCCGCC ACCGCGCGCC TCCAGGATCT GTTCCTGCTT
GCCGGCCGCC CCGCACCCGG CGCCGGACTG GACGAGCTGG AGGCCGCCCT GCGCGAGGAA
GTGCAGCGGT TGCAGGAGGA GCCGCTGGAC GAGGAGCGGT TGGTCCGCGC CCGCCGCCAG
TACGTGGCGG ATGAACTCTT CAGTCAGGAC TCCATGCGGG CGCAGGCGAT GCGTCTGGGG
GCGCTGGAGA GCACCGGGAT CGGCTGGGAG GCCGGTGAGC GCTTCCTGGA GGGGGTGCAG
ACCGTGACCG CTGAGGACAT CCAGCGCGTC GCCCGGCGCT ACCTGGTGGA TGATCAGCTC
ACGGTGGGTC GCCTGGTGCC CGCCGACCGC GAGGCGTCCA CTGACGCCGG GGAGGAGCAA
TGA
 
Protein sequence
MRARILMATG LALGLAWLVP PAVAGTPAVH EYTLDNGMTV VVREDHRAPV VVSMVWFAVG 
SSYEQRPLTG ISHVVEHMMF KGTETRPTGE FSRLIAERGG RQNAFTGRDF TGYHQQLAVE
HLPLAFELEA DRMQNLVFDQ GEYEREMEVV REERRQRVED NPTAKFMERF RAVAWSASPY
GQPVIGWMED LDRLRLSEVE DWYRRWHGPE SATLVVVGAV DPDAVFALAE EHFGPVPARE
RPEPIPGGDI PDPGERAVTV RIPAELPYLA MGWRVPTLGS IDREDEEALR EVYALALLRA
VLSGGQAAIL PERLERQQGV AVGAGASYSA TARLQDLFLL AGRPAPGAGL DELEAALREE
VQRLQEEPLD EERLVRARRQ YVADELFSQD SMRAQAMRLG ALESTGIGWE AGERFLEGVQ
TVTAEDIQRV ARRYLVDDQL TVGRLVPADR EASTDAGEEQ