Gene Mlg_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1094 
Symbol 
ID4269801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1276114 
End bp1277427 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content75% 
IMG OID638125846 
Producthypothetical protein 
Protein accessionYP_741936 
Protein GI114320253 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.095311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTGGC TGGCCCTCCA CCTGCCCGAC CAGGCCCTGC CCGAGTCGCC CTGCACGCCC 
CCGGCAACGG GGGCGTGCAG GGCGACCGCC GATCAGGGGG CCGACGACCC CCTCGAAAAC
CTGGCGGCCT GGGCCTACCA GTACAGCAGC CGGGTCTGCC CCTGGCCGCC GGCCACGCTG
GTGCTGGAGA TTGGTGCCAG CCTGAACCTA TTCGGCGGCC TCCAGGCCCT GCTGTCCAGG
ATTGACACCG GACTGCGCCG GCTGGACCGG ACGGCCCGGC GGGCGGTGGC GCCCACGCCC
CGGGGGGCCT GTTGGCTGGC GCGCTGCGGG CAACAGCGGA TCCTCCAGAG CCCCGAGGCC
CTGCGCCAGG CCCTGGATCC GCTGCCCCTC GCGGTGCTGG ACCTGGAACC GCGCCAGCAT
CAGGCCCTGC ACGGTCTGGG CCTGCGCCGC CTGGGCGACT GTCTGGCCCT GCCCCGCCGG
GAACTGGCCC GTCGTCTGGG TCCGGCGCTC CACCAGCAAC TGGACCAGGC CCTGGGCCAC
CGCCCCGAAC CGCTGCCCGA ATGGCGGCCG CCGGCCCGCT ACCGGGGCCG CCGGGAACTG
GTGCGGGAGA CGGACAACCT CACCCCTCTG CTGCCCCTGC TGGAACGGCT GCTGCACGAA
CTGCAGGGCC TGCTGCGCGG CCTGGACGCC GGTGTCCCCC GCTTTGAGCT GGTGCTGGAG
CACCTCCACC GCCCCGCCAG TCGGCTGACC GTCGGCCTGA CGGAGCCGGA CCGCGACCCG
GAGCGTCTGC TGCGGGTGGC TGGCGAACGC CTGGCCCGGG AGCCGCTGGC CGCCCCGGTA
CAGGCGATCA CACTGCTGGC CGAGGACATC CAACCCCTGC GGCCCGAGCC GGAGGCCCTG
CCCGGCACTC GCGCCGCCCA CGATCATCAC CCCATGCGGG TGCTGCTGGA GCGACTGACC
GCACGCCTGG GCGAGACCCG CGCCCACGGG CTGGCGGTGC ATCCGGAGCA CCGCCCGGAA
CGGGCCTGGC GCCGGGTGCC ACCGGGCCAG GCCGGTGCCG CCGCCCCCCA GAAACCCCGC
CCCACCTGGC TGTTGGAGCG GCCGCGCATC CTCGGCCAGC AGCAGGGCCA GCCGGTCTGC
CGCGGCCCGC TGATCCTGGA GCGGGGGCCG GAGCGCATCG AGAGCGGCTG GTGGGACGGC
GCCGACGTGG CCCGCGACTA CTACGTGGCC CGCGACCATG ACGGCGCCCG CCTGTGGATC
TTCCGCGAGC GCCGTGGCCG CCGGCGCTGG TTCCTGCACG GCCTGTTCGG CTGA
 
Protein sequence
MLWLALHLPD QALPESPCTP PATGACRATA DQGADDPLEN LAAWAYQYSS RVCPWPPATL 
VLEIGASLNL FGGLQALLSR IDTGLRRLDR TARRAVAPTP RGACWLARCG QQRILQSPEA
LRQALDPLPL AVLDLEPRQH QALHGLGLRR LGDCLALPRR ELARRLGPAL HQQLDQALGH
RPEPLPEWRP PARYRGRREL VRETDNLTPL LPLLERLLHE LQGLLRGLDA GVPRFELVLE
HLHRPASRLT VGLTEPDRDP ERLLRVAGER LAREPLAAPV QAITLLAEDI QPLRPEPEAL
PGTRAAHDHH PMRVLLERLT ARLGETRAHG LAVHPEHRPE RAWRRVPPGQ AGAAAPQKPR
PTWLLERPRI LGQQQGQPVC RGPLILERGP ERIESGWWDG ADVARDYYVA RDHDGARLWI
FRERRGRRRW FLHGLFG