Gene Mlg_0699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0699 
Symbol 
ID4268858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp778630 
End bp781395 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content60% 
IMG OID638125448 
ProductFkbM family methyltransferase 
Protein accessionYP_741543 
Protein GI114319860 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID[TIGR01444] methyltransferase, FkbM family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.25199 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGAT ATGCAATCCA GCGCCACTTT CCTTTACAGT GCCCGAATAC ACACCAAGGG 
ATTACTACAG GCTCAACCGG CATGCCCAAA CATTCAGCTC CCATCCAGAC GTACCGCGCC
ATCGCCCAAG CACTCAATGC CGGAGATATC CACGGGGCAC GGATCCTCTG CGATAAGGCC
CTTGAGCGCA ACAGCAGCGA CGGCTGCGTG CTCACCCTGC ACGCCCGCCT CCTCTGGCAA
GAGGGACGGC CCGAAGAGGC GCTCTCAGCG TCAAGGGAGG CATTGGATCA AGGGGGGGAA
GGCGCAGCCG ATGCGGCCGC GATTCACGCA CAATTCCTCC TGGAACTCAA CCACACCGAC
GACGCCCTCG GGTTCCTCCG CCGGGCGGCG GAGAAGGCCC CTGAGCACCG TCGCTTGCAA
GCCCTCCTCG CGCAGGCCCT GGTCAAGGCC AACCGCCGCG AACAGGCGCG CGACATCCTT
GTGGAGCTGA TCAAGACGCA ACCCGACGCC GATACCTTAA ACAGCCTGGG GCACGTCCTG
TTCGACCTTG GTGACGGTGA GGCCGGGTTT CAGCAGTTGC AGTTGGCCGG GGTGCTTGAG
CCAGGAAATC ATGTCCTCTG GTCTAACATA CTGATGATGG CCCACTACCT GCCATCTCAA
AGTGCCAGGG ACCTACGCAA ACTGCATGCG CGCTGGTACG CAAACTGCGC CGCCCATCTA
GAGACAGAAC GCCATTTCGA GCGGGATCGC GACCCGGAGC GCCCACTGCG CATCGGTTTC
ATTTCCAATG GGTTCCACTC GCATCCCGCC GGCTGGCTCA GCTTCGGTGC GATCAATACC
CTTGCCCGGT ATTTTGATCA CACGCTGCAT CTCTACTCCA CTGCACCACC CCGACCCAAG
GACTTCCTAT CACACCGCTT CCAGAACATG ACCGGCCACT GGTGCCAAAT GGTCCATTGG
TCGCAAGAGG CCATCCATGC GCAACTGTTG AAGGACCGAC TCGACATCCT GGTGGACATG
ACCGGACACA GTGGGTATTC GGCACTGTCC GCCATAGCGA GAAGGGCCGC CCCGGTTCAA
GTCAAATGGG TCGGCGGGCT CTTCAACACC TCAGCAGTGC CAACCATCGA TTACCTGCTC
ACCGATTGGA TGGAGACCCC GGAAGGGGTC GAAGAGTTTT ATACCGAAAA GCTGATCCGG
CTGCCAACGG GGTATGTCAC CTATGCCCCA CCCCCTTACC TGCCGGACAT CGCCCCATTA
CCCGCAAAGG AGAATGGGTA CATCACATTC GCGTGCATGA ACAACCTGCA TAAGGTCAAC
CGGGAAATCG CCGGCGTCTG GGCAGGGATC CTGAAAGCAG TTCCGGACAG CCGCCTGCTT
ATCAAGGACA AGAAGCTATC CGACCCCGGC GCCCGGAAGC AGCTTTGGCA CATGCTGGTG
GAGGCGGGGG TTCCGGACAC AAGGCTGATC CTGGAGGAGG GAGCGCCCCA CCGGTACCTG
CTAGAGACCT ATCATCGGGT GGACATCGCC CTTGACCCCT GGCCTTATTC CGGCGGGTTG
ACGACTATTG AGGGGTTATA CATGGGGGTG CCGGTGATCA CGTGCCCCGG ACCAACTTTT
GCGGGCCGTC ATGCCGCCAG TCACCTTCAC AACTCAGGCC TGGACCAGTT CATCGCCGCC
GATTTCCACG ACTACAAGCA GATCGCCGTT GAGACGGCGG GTGACATCGA AAAGCTGGAA
GCACTGCGCG CGGGACTGCG GAAACAGTGC CAGAACTCTC CGCTGGGGAA TCACGCCCAA
TTTGCCACCA ACCTGGACCG GGCCTTCCGG ACCATCTGGC GGCGGTGGTG CGCCGACGCC
CAACCGGCAC ACCTGCATTT CTCCAAGCCG GCGCGCATCC CGTCCCATAT CACGGCCCGA
ATGAAGGCCG AGTTGCAGGA CCGGGATTCG CAACGAGCCG GGAACAGCAG AACGGCACCT
CGCTTCAGGC TGGCTGGCTT GGAGGCTGCG GCAATACAGC AGCCCAATGA CCGGGATGAA
ACAAACGGCG ATGTCGCACC CACCTTGCCA TCGGAAGCAG TCAAAGCCAC GGTCACTGGT
CCCGACAATC AGTCCCGTTC TTTATTGATC CCCAAGGGGG AAACCTTCCG CCTGAAAAAC
ATCTTCGAAG AGGAGGAGTA CGCGCTGCCA GCGGGTTTCC ACATATCGCC TGAGATGGTG
GTAGTGGATG TCGGTGCGAA CATCGGCGCG TTCGCCTTGT ACGCGGACCT TTGGTCACCA
CACTGCATCG TCCATTGTTT CGAGCCCAAC CCCCAGGTCC TGCCGCTGCT CGAGCGCAAT
AAACAGGAAG CGCGGGGCAC CATCCAGATC CATCCGTTCG CACTGTCCGA CGAAGATGGC
GAGCTGACCC TTTGGCAGCA CCCCAGGAAC ACGGGGGAGA CCTCACTGGC GAGACGCTCG
GATGGCGCGA CCCAAGTCCA AGTGCCCGTC CGAAACGCGC TGGATGCGCT GACCGCGGCG
GGAGTGGATC ATATCGATGT CCTGAAGATC GATACCGAAG GTTCAGAGGT CCCCGTTGTG
CAGACACTCG TACCGTTTCT GCCGAAAGTA TCGATCGTCA TGCTCGAATA CCACAGCGAG
GCCGACCGAC GAGCACTGGA CCGTCTGCTC TCCGATTTTC AGCTTTATGA CTGCACCGTA
ATGGGTGCCA GCGGCGTGGG AACCGTCAAA TATTTCAACA ACGCCCTGAA GCAGAGTAAA
GTGTAA
 
Protein sequence
MAGYAIQRHF PLQCPNTHQG ITTGSTGMPK HSAPIQTYRA IAQALNAGDI HGARILCDKA 
LERNSSDGCV LTLHARLLWQ EGRPEEALSA SREALDQGGE GAADAAAIHA QFLLELNHTD
DALGFLRRAA EKAPEHRRLQ ALLAQALVKA NRREQARDIL VELIKTQPDA DTLNSLGHVL
FDLGDGEAGF QQLQLAGVLE PGNHVLWSNI LMMAHYLPSQ SARDLRKLHA RWYANCAAHL
ETERHFERDR DPERPLRIGF ISNGFHSHPA GWLSFGAINT LARYFDHTLH LYSTAPPRPK
DFLSHRFQNM TGHWCQMVHW SQEAIHAQLL KDRLDILVDM TGHSGYSALS AIARRAAPVQ
VKWVGGLFNT SAVPTIDYLL TDWMETPEGV EEFYTEKLIR LPTGYVTYAP PPYLPDIAPL
PAKENGYITF ACMNNLHKVN REIAGVWAGI LKAVPDSRLL IKDKKLSDPG ARKQLWHMLV
EAGVPDTRLI LEEGAPHRYL LETYHRVDIA LDPWPYSGGL TTIEGLYMGV PVITCPGPTF
AGRHAASHLH NSGLDQFIAA DFHDYKQIAV ETAGDIEKLE ALRAGLRKQC QNSPLGNHAQ
FATNLDRAFR TIWRRWCADA QPAHLHFSKP ARIPSHITAR MKAELQDRDS QRAGNSRTAP
RFRLAGLEAA AIQQPNDRDE TNGDVAPTLP SEAVKATVTG PDNQSRSLLI PKGETFRLKN
IFEEEEYALP AGFHISPEMV VVDVGANIGA FALYADLWSP HCIVHCFEPN PQVLPLLERN
KQEARGTIQI HPFALSDEDG ELTLWQHPRN TGETSLARRS DGATQVQVPV RNALDALTAA
GVDHIDVLKI DTEGSEVPVV QTLVPFLPKV SIVMLEYHSE ADRRALDRLL SDFQLYDCTV
MGASGVGTVK YFNNALKQSK V