Gene Mlg_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0078 
Symbol 
ID4269906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp84577 
End bp87321 
Gene Length2745 bp 
Protein Length914 aa 
Translation table11 
GC content73% 
IMG OID638124803 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_740925 
Protein GI114319242 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.323405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCG GAATCCACCT GCCGGCTTGG GGGAGCGTTG TGAGGCTGGA ACTGCGGACG 
CTGGGCAGGA CCGAGGCCCG GCTTGATGGC GAGGCATTGC CGGGGCTGTG CCATACCAAG
ATCGGGCTGC TCCTGGTCTA TCTGGCCGTT GAGCGGCCGC GCCGCGTGCC CCGCCTGGAA
CTGGCCGGGC GCTTCTGGCC GGACCAGGCG GCCGACGCCG GCCGGGCCAA CCTGCGCCAA
GCGCTGTTCC AACTGCGCCG GCAACTGGGC GGGGCGGCGG CAGACCTGCT GCAGGCGGAC
GCGCGCCAGG TGAGATTGCG GCCGGAGGCC GGGGTGTGGG TGGACGGTCT AAGGCTGGAG
GCCTGCGTCC GGCCATCGGC GGCGCGGCAG CGACTCGACC CCGCGCAGTT GCTTACGGAG
CTGAACGCCT ACCAGGGGCA CTTTCTGGTC GACCTCGGTG TGTCCCCGGA GGCCACCGGG
CTGGAGGACT GGCTGACCCA AAACCGGCAG CGTTTCTGGC GTGCGGCGCT CACCCTCCTG
GAGCAGTGCC TGGAGGAGCG GCCGAGCCCG GAGGACAACG AGCGGATCCT GCGGGAGCTG
CAACGCTTTC TGGACCTGGA ACCCTTCGAT GAGGTATTGC ACCGGCTGAT GATGCGGGGG
CTGGCCGCCG GCCGCCAGCC CACCCGCGCC CTCGACCATT TCCTGGCCTT CGAGGCATTG
CTGCGGCGCG AGCTGGACAC CCGGCCCGGA CCGACCCTGC GCCGGCTCTA TAGCGAGCTG
CGCGAGCGCC TGGCCCCCGC TGCCCAGGCC TCGGTGGAGG ACGTCCGTCT GCGCCCTGAG
CGCCGGCGCG TGGCCTTGGT GGTCTGTGAC CTCCGCCCGC CCCCGCGCAC TGCCGGGCAG
ACCGATGTGG AGACCCTGAC CGCTGCCGTG CAGCAGGGGG TCACCCGGGT GGAGAGCGCG
TTGTTCGAGC ACCGGGGCCA CATCGTCCGC CTGCCCTGGG CCTGCCTGCT GGGCTATTTC
GGTTATCCCA GCGGACGCGA GGAGGCCACC CTGGATGCCC TGCGCGCGGC CTGGGAGGCG
TTGCAGAGCG GGCCCGCTGG GGTGCGCCCG CGGATCGCCG TGGACTCCGA CGTGATCATC
ACCGGCAGTG AGCCGGAGGT GCCGGATCCG GCCGGTCTGC TCACCGCCCG CGCGCTGAGC
CTGGCGGAGC GCACTGCCCC GGGCACGGTA GCGGTCTCGC CGGGGATTCG CGACCGTTTC
CAGGGGCGCT TCCGTTTCGC CGGCGGTGGG ACGTCGGCAC CACGGCTGGT CAACAACCCC
GGTCATGCCG ACCGGGTCGA GGCCCGGCTG GCGCGGGCAG CGGTCCCGCT GATCGGGCGG
GCCGAGCCGC TGGCCCGGCT GCGCCGCGCC TGGCGTCAGG CCGCAGATGG CCGTTGCACC
AGCCTGGTGA TCCGCAGTGA GGCCGGGCTG GGGAAAAGCC GCCTGGCCCG CGGCCTGGCG
GAAGAGGTCC GGGAGGAGGC GGCGCTGGTG CTGATGCTGC CCTGCCGCCA GCGCCTCAAG
CGACAGTTGC TGATGCCGCT GCGCCAGGCC CTGGCCGGCT GGATCGGTCC CCGGCCGGAC
CAGCGCCGGG CCCGCGCCCG GCGGCGGCTA CTGCAGGCGC GGCTGCGCCC CCATTTGGAT
CATGGCGCCG CCGCCCGCCG TCTGGCGGCC TGGCTGACCG ACACCGGCGA CCGCGGTCTG
ATCCGCCAGG GCAGTGATCG CGACCGGGTG CTGGACAGCC TGGTCCAGGT ACTGGCCGGT
GAGACCCGGC GCGGGCCGGT GCTTATCGTC TGTGATGACG CCCATTGGAT CGACTCCGGT
ACCGCCGAAC TGTTTCGCCG CATTCAGCGC CGCCTGGCGC GGCACCCGGT GCTGCTGATC
CTCACCGGCC GCATGAGCTT CCGCGCCGAG TGGCTACATA CCGCGCCTGG GGAGTTGCGC
CTGAGCGGCC TCTCCGGGCC GGACAGCCAG CGGCTGATCC GCGCCCTGGA CGGCGATGGC
GTGCTGCCGG AGGCCACCCG GCGCGCGATC AGCGCACGCG GCGAGGGTGT CCCGCTGTTC
CTGGAGGAGC TGACCCTGCA CGCCCTGCAG CGGCACCGGG GCGGCGAGCC CGGGGCGGCC
CTGCCGCCGG GGTTGTCCGA CCTGTTGGTC GCGCGGTTGG AGAGCCTGGG ACCCTGTCGC
GAGCTGGCCC ACGCTGCTGC CGTCATCGGC CGGGAGTTCG ACAGCCGCCT GCTGGCCCGC
CTCACCGGCA GCAACCTGGA CCAGGTGCAG ACCCAGGTGA AGCGGTTGAT GGCCCAGGGT
TTCGTCGAGC GTGACGGCAG CCGCCTGCGG TTCCGGCACG CCCTGTTCCA TCAGGCCGCC
TACGAGGCCC TGTTGGCCAC CGAGCGCCGG GCGCTGCACG GTCGCCTCGC GCAGTTGCTG
GAGGAGGACG CCGCGCTCGG GCTGGACGCC CCGGACCACG AGGTACTGGC CGAGCACCTG
CGTGAGGCGG GGCGTCCGGA GGAGGCGGTG GAGCACTGGC TGATGGCCGG GGAGCATGCC
CTGGCTCTGG GCATGCTCCC CGAGGCGGAG CAGCATTGCG CCGATGCACT GGCATTACTG
GGACGGCTCA CCGAGGCGGG GGGGCCGCCT GGGCGTCAGC GGAGCGATGG GCAGGCCCGA
AAAAAAGGCG GGCCGGCTGA GCCGACCCGC CCAACACAAC GCTGA
 
Protein sequence
MAGGIHLPAW GSVVRLELRT LGRTEARLDG EALPGLCHTK IGLLLVYLAV ERPRRVPRLE 
LAGRFWPDQA ADAGRANLRQ ALFQLRRQLG GAAADLLQAD ARQVRLRPEA GVWVDGLRLE
ACVRPSAARQ RLDPAQLLTE LNAYQGHFLV DLGVSPEATG LEDWLTQNRQ RFWRAALTLL
EQCLEERPSP EDNERILREL QRFLDLEPFD EVLHRLMMRG LAAGRQPTRA LDHFLAFEAL
LRRELDTRPG PTLRRLYSEL RERLAPAAQA SVEDVRLRPE RRRVALVVCD LRPPPRTAGQ
TDVETLTAAV QQGVTRVESA LFEHRGHIVR LPWACLLGYF GYPSGREEAT LDALRAAWEA
LQSGPAGVRP RIAVDSDVII TGSEPEVPDP AGLLTARALS LAERTAPGTV AVSPGIRDRF
QGRFRFAGGG TSAPRLVNNP GHADRVEARL ARAAVPLIGR AEPLARLRRA WRQAADGRCT
SLVIRSEAGL GKSRLARGLA EEVREEAALV LMLPCRQRLK RQLLMPLRQA LAGWIGPRPD
QRRARARRRL LQARLRPHLD HGAAARRLAA WLTDTGDRGL IRQGSDRDRV LDSLVQVLAG
ETRRGPVLIV CDDAHWIDSG TAELFRRIQR RLARHPVLLI LTGRMSFRAE WLHTAPGELR
LSGLSGPDSQ RLIRALDGDG VLPEATRRAI SARGEGVPLF LEELTLHALQ RHRGGEPGAA
LPPGLSDLLV ARLESLGPCR ELAHAAAVIG REFDSRLLAR LTGSNLDQVQ TQVKRLMAQG
FVERDGSRLR FRHALFHQAA YEALLATERR ALHGRLAQLL EEDAALGLDA PDHEVLAEHL
REAGRPEEAV EHWLMAGEHA LALGMLPEAE QHCADALALL GRLTEAGGPP GRQRSDGQAR
KKGGPAEPTR PTQR