Gene Mlg_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1045 
Symbol 
ID4270518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1195795 
End bp1197279 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content69% 
IMG OID638125797 
Producthypothetical protein 
Protein accessionYP_741888 
Protein GI114320205 
COG category[S] Function unknown 
COG ID[COG0397] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0432954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCAG AACCTATCTG GCCCTTTGAC AACAGTTACG CCCGGCTGCC CGAGCGCTTC 
TTTGCCCGTG TGCGGCCGAC CCCGGTGGCA CAGCCCGGTC TGGTGCGGCT CAACGAGCCT
CTGGCCGAAG CATTGGGGCT GGAGGTGGCG GCCTTACGCG GTAAGGCGGG CCTGGCGATG
TTCGCCGGCA ACCGTCTGCC TGAGGGGGCG GAACCCATCG CCCTGGCCTA TGCCGGCCAC
CAATTCGGGC AGTGGGTGCC GCAACTGGGT GATGGCCGGG CGGTGCTGTT GGGCGAGGTA
GTGGACAGGG ACGGCCGGCG CCGGGACATT CAGCTCAAGG GCTCCGGCAT CACCCCCTTC
TCCCGGGGTG GTGACGGGCG GGCGCCCATC GGACCGGTGG TCCGCGAATA CCTGGCGAGC
GAGGCCATGC ACGCCCTGGG CATCCCCACC ACCCGCTCGC TGGCGGCGGT GACCACCGGG
GAGCCGGTGC TGCGCGAGCG GGTGGAGCCC GGCGGCATCC TCACCCGGGT GGCGCACAGC
CATGTGCGGG TGGGCACCTT CGAGTACTTC CACTGGCGGG AGGATGTCGA CGCCCTGAGG
ACCCTGGCCG ATTACGTTAT CGCCCGCCAT TACCCGGAAC TGGCAGACGA CGCGCGGCCC
CATCTCGCGT TATTGAAGGC GGTGATCGAT CGCACTGCCG AGCTGGTGGC CCACTGGATC
AGCGTGGGCT TCATCCACGG GGTGATGAAC ACCGATAACA CCTCGCTGGT GGGCGAGACC
CTGGATTACG GGCCCTTCGG CTTCCTGGAC GCCTACCACC CCAGGACCTG CTACAGCGCC
ATCGACATTG AAAACCGTTA CGCCTTCGAC CAACAGCCGC GGATCGCGCA CTGGAACCTC
ACCCGGTTGG CGGAGACCCT GCTGCCATTG CTGCACGAGG ATGAGGACGA GGCCGTGGCG
CGGGCCGGGG AGGCGCTGAA CGGCTTCCTC CCGCGCTTCG AGGCCTGCCA CCATGCCCGA
CTGCGGGCCA AGCTGGGCCT TGCCGAAAGC CGCCGCGGGG ACATCGACCT GGCGCACGAG
TTGCTTGATC TCATGGCTCG GCAACAGGCG GACTTCACCC AGGTCTTCCG CGCCCTTTCC
GACGAGCGGA TGGATGATCC CGACGAAGGG CCCGCCCGAC GCTGCTTCGC CCGGCCCGAG
GCCCTGGATG GCTGGCGCGC ACGCTGGATC CAGCGATTAC GCCAGGAGGG ACGGCCGGAG
CCGGCACGCC AGGCCGCCAT GCGGGCGGTA AACCCCAAGT TCATCCTGCG CAACCACTTG
GCCCAATGGG CGGTGGATGC CGCCACCGAG CGGGGGGATT TCGGCCCCAT GGACCGGCTG
CTGCAGGTGC TGACCCGCCC CTACGACCCG CAGCCGGAGG CGGAGGCACT GGCCGCCCCG
CCCCGGCCGG AGCAGCAGGT CTATCAGACC TTCTGCGGTA CCTGA
 
Protein sequence
MPAEPIWPFD NSYARLPERF FARVRPTPVA QPGLVRLNEP LAEALGLEVA ALRGKAGLAM 
FAGNRLPEGA EPIALAYAGH QFGQWVPQLG DGRAVLLGEV VDRDGRRRDI QLKGSGITPF
SRGGDGRAPI GPVVREYLAS EAMHALGIPT TRSLAAVTTG EPVLRERVEP GGILTRVAHS
HVRVGTFEYF HWREDVDALR TLADYVIARH YPELADDARP HLALLKAVID RTAELVAHWI
SVGFIHGVMN TDNTSLVGET LDYGPFGFLD AYHPRTCYSA IDIENRYAFD QQPRIAHWNL
TRLAETLLPL LHEDEDEAVA RAGEALNGFL PRFEACHHAR LRAKLGLAES RRGDIDLAHE
LLDLMARQQA DFTQVFRALS DERMDDPDEG PARRCFARPE ALDGWRARWI QRLRQEGRPE
PARQAAMRAV NPKFILRNHL AQWAVDAATE RGDFGPMDRL LQVLTRPYDP QPEAEALAAP
PRPEQQVYQT FCGT