Gene Mlg_0559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0559 
Symbol 
ID4270314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp607309 
End bp608427 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID638125300 
Productpermease YjgP/YjgQ family protein 
Protein accessionYP_741403 
Protein GI114319720 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.298895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00000418027 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGTCGCCCA TCCGCATTCC CGTCGTCGAT CGCTACCTGG TGCGCGAGGT GTTGTTCGCC 
TGGCTGGCGG TGCTGTTGGT GCTGTTCGCG GTGCTGGCGA CCAACCGGCT GATCGGCTAT
CTGGGCGATG CCGCCGGTGG CGAACTGCCC GGTGGGGTCA TCCTCACCCT GCTGGGCCTG
CAGACGGTTC GCTATCTGGG CATGATTCTG CCCGCCAGCT TCTTTCTGGG CATCGTCCTC
GCCTTTGGCC GGCTCTACCG GGACAGCGAG ATGGCGGTGA TGTCCGCCTG CGGGATCGGG
CCCTGGCGGC AGTTTCGCGC GCTGCTCTGG TTGGCCCTGC CGCTGGCAGG GCTGGTGGGG
CTGCTGTCGC TGTACTGGGG GCCGGCGGCG ACCCAGAAGG CGGAGCAGGT GCAGGCCGAG
GCGGAGGCCC AGGTGGAGTT CGCCGCCCTG CAGGCCGGCC GTTTCCTGCA GGCGCGCGGG
GCCACCGAAG GAACGCTCTA CCTGGAACGG CTCAGCGAAG ACCAGCGCGA GATGGAGGAC
GTCTTCATCC GCGCCGGTGG CACCGCGGAC CGGGTGGTCC TGGCGGCCCG GCGCGGGGTA
CAGGAGAAGG ACCCGGAGAC CGGTGACCGC TACCTGGTGC TGCTGGATGG CTGGCGTTAC
GACGGCCGAC CGGGCGCTGC GGACTGGCGG GTGACCCGCT TCGAGCGCCA CGGGGTCCTG
GTGGCGGAGG GTTCGGAGGA GGTGGCCGTG CGCCTGCGCC GCAATGCTCA GCCCACCGCC
GAGCTGTGGG GTTCCGACCA TCCGGCCGAC CGGGCCGAGG TGCAATGGCG GCTGGCGATG
CCGGCCATGA CCCTCCTGCT GGCGCTGCTG GCGGTGCCGC TCAGCAAGAG CGCGCCGCGG
GACGGGCGTT ACGGGCGCCT GCTCTCTGCC GTGTTGGTCT ATGTGGGCTA TTTCCAGTTT
CTGACCGTGG GCCAGGATTG GCTGGAGACG GGCCAGGTCC CGGCTGCCCT TGGGCTCTGG
TGGTTGCATG GGGCGGTGCT GGCCGTGGGC GTGCTGGGCC TGCTCTGGCG CTTCGACCTG
CTGCCCGCGC GTGGCGGGCA CAAGGGGCGG GCACCATGA
 
Protein sequence
MSPIRIPVVD RYLVREVLFA WLAVLLVLFA VLATNRLIGY LGDAAGGELP GGVILTLLGL 
QTVRYLGMIL PASFFLGIVL AFGRLYRDSE MAVMSACGIG PWRQFRALLW LALPLAGLVG
LLSLYWGPAA TQKAEQVQAE AEAQVEFAAL QAGRFLQARG ATEGTLYLER LSEDQREMED
VFIRAGGTAD RVVLAARRGV QEKDPETGDR YLVLLDGWRY DGRPGAADWR VTRFERHGVL
VAEGSEEVAV RLRRNAQPTA ELWGSDHPAD RAEVQWRLAM PAMTLLLALL AVPLSKSAPR
DGRYGRLLSA VLVYVGYFQF LTVGQDWLET GQVPAALGLW WLHGAVLAVG VLGLLWRFDL
LPARGGHKGR AP