Gene Mlg_0542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0542 
Symbol 
ID4268071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp590079 
End bp591404 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content67% 
IMG OID638125283 
Productmembrane protein 
Protein accessionYP_741386 
Protein GI114319703 
COG category[S] Function unknown 
COG ID[COG3174] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00215324 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCGCGAC AGGGAGGGGC CCGCCCCGGG CCACAGCCGC CAGCAATGGA AGAACTCACC 
GAGCAGTTCA TCGCCGGCAA CGAAACCATC CTGCAGTTGG CTGTGGCACT GCTGCTGGGC
GCGCTCATCG GTCTGGAGCG CGGTTGGGAA TCGCGGGAGC TGGCCGCCGG GCGCCGGGTG
GCAGGGATCC GCACCTATGC CCTGCTCGGG CTGTTGGGTG GCCTGTCGGC GGTGCTTTCC
GAGGCCCTCA GCCCCTGGGC CTTCCCGGTG ATGCTGATCG GCGTGGCGGC GCTGACGCTG
GTGGCCTACC GCACCCAGGC GGAGCAGGAG CGCAACGTCA GTATCACCGG CGCGGTGGGC
CAGATACTCA CCTTCTCGTT CGGGGCGATC GCGGTGGCGG TGGACATGGT GGTGGCCACC
GCGGGCGCAG TGGTCACGGT GCTGATCCTG GACAACAAGC GGGAGATCCA CGGCCTGATC
AACCAGCTTC ATGCCCATGA GCTGGACGCG GCCTTCAAGC TGCTGTTGAT CTCCGTGGTC
ATGCTGCCCC TGCTGCCGGA CGAGGGCATG GGCCCCGGCA GGGCCATCAA CCCTTACGAG
ATCTGGTGGC TGGTGGTGTT GATCGCCTCG GTCTCCTTCG TCGGCTACTT CGCCGTCCGG
GTCGGCGGCA CCGAGAAGGG GATCCTGTTC ACCAGCCTGT TCGCGGGGCT GAGCTCCTCC
ACCGCGCTGA CCCTGCACTT CTCCCGACAG TCGCGCCAGG CCGCCGAACT CAGCCCACTG
CTGGCCGCCG GCATCCTCAT CGCCTGCGGC ACCATGTTCC CGCGCATCCT GCTCTATGCG
CTGATCATCA ATCCGGCACT GATCCCGGCG CTGGTGCTGC CGGTCATCGT CATGGCCACC
CTGCTCTACC TGCCCGCGCT GGTCATCTGG CACCGGCAAC GCCGGCGCCA GGACGTGGCC
CAGCCGACCC TAAAACAGAA CCCGCTGGAT CTGAAGTCGG CGTTGATGTT CGGTGCCTTG
CTCACCGCCA TCATGTTCCT CGGTGAATGG CTGCGGGAAT GGCTGGGGGA CGCCGGCATC
TATCTGCTGG CCGCCTCCTC CGGGGTGGCG GACGTGGACG CCATCACGTT GTCGTTGACC
CGGATGTCCA ACGTCTCCAT CACCCTGGAC ACGGCGGTGA TGGGCATCGT CATCGCCGCC
TCGGTGAACA ACCTGATCAA GGGCGGCCTG GCCGCGGTGA TCGGCACCGG CGCGCTGGGC
AAACGGGTCA CCGGTCCCAT GCTGTTGTCG CTGGCCGCCG GGCTGGCCGT GGCTTGGTGG
CAATAG
 
Protein sequence
MPRQGGARPG PQPPAMEELT EQFIAGNETI LQLAVALLLG ALIGLERGWE SRELAAGRRV 
AGIRTYALLG LLGGLSAVLS EALSPWAFPV MLIGVAALTL VAYRTQAEQE RNVSITGAVG
QILTFSFGAI AVAVDMVVAT AGAVVTVLIL DNKREIHGLI NQLHAHELDA AFKLLLISVV
MLPLLPDEGM GPGRAINPYE IWWLVVLIAS VSFVGYFAVR VGGTEKGILF TSLFAGLSSS
TALTLHFSRQ SRQAAELSPL LAAGILIACG TMFPRILLYA LIINPALIPA LVLPVIVMAT
LLYLPALVIW HRQRRRQDVA QPTLKQNPLD LKSALMFGAL LTAIMFLGEW LREWLGDAGI
YLLAASSGVA DVDAITLSLT RMSNVSITLD TAVMGIVIAA SVNNLIKGGL AAVIGTGALG
KRVTGPMLLS LAAGLAVAWW Q