Gene Mlg_0827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0827 
Symbol 
ID4268252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp933247 
End bp936660 
Gene Length3414 bp 
Protein Length1137 aa 
Translation table11 
GC content74% 
IMG OID638125578 
Producthypothetical protein 
Protein accessionYP_741671 
Protein GI114319988 
COG category[S] Function unknown 
COG ID[COG4717] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCG AGAGCCTGCA CATTCGGCAA CTGCCCGGCA TCCACCCCGG ATTCGACCTG 
GAGGGGCTGG ACCCGCGGAT CAACCTGCTG CTGGGGCCCA ACGCCTCCGG CAAGAGCAGC
GTGGTGCGGG CCCTGCGCCA TCTGATGGAG ACCCGCACCG ACGACCCCGC CAACCTGGTG
CTGTCGGCCA CCTTCAGCGA GGGAGAGCGG CAATGGCAGG TGGATCGGCT GGGCCGCGAC
ATCCACTGGC GCTGCGACGG CGAACCCTGC GCGCCCCCAC CCCTGCCCGG CGCCGAGGGG
CTCGGCTATT ACTGGATCGG GCTCGACCGC CTGCTCGAAA CCACCCGCGA GGACCAGGCC
GCCGAGGCCA CCCTGCGCCG CGAGATGCAG GGCGGCTACG ACCTGCAGGC CCTGCGGGAG
AGTCAGCCGT TCAGCCTGTC CGCCCAGCGG GGGCGCATCG CCGCCCGGGA ACTGCAACAG
CGCCAGCAGG CCCTGCGCCG GGTGGAGCGC GAACACCAGG CGCTGGCCAG CGACGAGGCC
CGACTGCCGG AACTCAGCGA GGCCGTGGCC GAGGCGCAAC GGGCCCGGGA ACGCCGGCAG
GCCTGTGAGC AGGCCTTGCT GGCCCTCGAC GCCGCGCGGG AACTGCGCCA CCAGGAGGAT
CGCCTGGCCC GCTATCCCGA TCCCATGCCC GTCGGCCTGA CCCGCGACCG GCTGGAGACC
CTGGAGCAGC AGGAGGCGGA ACTGGCACGC GCCCTGGCCA GCGCCCGCGA CGACCAGGCC
CGGGCGGAGG CCGACCGGCA GGCGACCGGT CTGGCGGATG GCGGCCCGCC ACCGGCGGAG
CTGCAAGCCG CCGGCGAGCA GGCCCGTCGG CTCACCCGGC TGGAGGAACA GGTCCGCCAT
ACCCACGAGC AACTGGCTGC CACCCGCCGG GCCGCGGCCC AGGCGGCCCG CGCCCTGGGC
CGCGGCCCCG ACGACGACCG GGCGCAACCG GCGCTCTCCC CGGAGGAGCT GGCCGGCCTG
GAGCGGCTGG CCCACGAGGC CCACGCGGCA CGGGAACGCC TCCAGGCGCT GGACGCCCGC
CTGGCAACCC TGGAGCCCAA CGCTGAGCCC TTCGACCCCG CGCCACTGGA ACGGGGCTGC
CATGAACTGC GCCGCTGGCT GCGCCAGCCT GCCCCGCGGC CGCTGCACTG GCTGGGCCTC
GGGCTGACCG GGCTGGGCGG CGCGGGTACC GCGGCACTGG GGCTCACCCT TGGCCACTGG
CCCACCGTGG CCAGCAGCCT GGTGGTGCTC GCCGGCCTCG GTGCCAGTGC CGTGGCAATG
GGGCGCCGCC GGGACCGGCG CGAGAGCGAG ACCCGGTTTA CTGAACTGCC GCTCCCCGCC
CCGGAGGTCT GGGAGGAGGC CGCCGTCCGG CAACGCCTGG ATGAGCTGGA ACAGGCCTGG
CACCAGGCCC GAAGCCTTCA GCAGCGGCGC CAGGAGGCCG ACCAACTCCA CGCGCAAAGG
GGCCGGGCGC GCACAGAACA GGCTGAGGCG GAGCAGGCCC TCCACCGGCA TGCCGCCGCC
CTGGGCCTGG ACGCGGAACT GCCGCTGGCC TTCGACCGCG CCGTCCGCCT GCTGAGCCGC
CATCAGGAGG CCCGCGAACA ACAGGCCGCC CAGGAGGCCG TACTGGACCG CCAGGCCCAG
GAACTCGACA CCCTGCGCGA CACCCTGCAC CACTTCCTGG CGACCTGGCA CACCGCCCCG
CGCGAGGAGC GCAGCGAGGC ACTGGGCGCC GCCCTGGACG ACCTGCGGCA ACGCTGCGAG
GCCGCCGAGC GTGCCCGCCA GCAGGCGGAC AACGCCACTC GACTGATCCG GGAATTGGAG
CGCCAGCTTC GGGATACCCG CCGGCAGCTC GACCAGCTCT ACCACGACGC CGGCCTGCAA
CCCGACCAGC GGGCCACCCT GCTGGAACGC ATCGACCAGC TCCCTGAGTG GCAGGCCTGC
CGGCAGGCCC TGGAGGCCGC ACGGAGCAAT TACCGGCTGC GCCGTGAGGA CCTGGAGGGG
CAGGCCGACC CCGACATCAT CGAGTGGCTG GAGGCCGGCG ATGAGCCCGC CCTGCGCCGC
GCCGCTGAGG ACGCCGCCGA GGCCGCCGGA CGGCTGGAGC CGCTGCGCGA GGAGCGCGCC
GGCATCCGCA CCCGCCTGGA ACAGACCCGC CAGCGCCACG ACCTGGAGGA GGCCCTGGCC
CAGCGCGAGG AGGCGCGCGA GGCGCTGGCC GGAGAGCGCG AACAGGCCCT ATCCGCGGCC
GCGGCCCGGT TCCTGCTGGA GCGCGTGGCC CGCCGGCACG AACAGGCCCA CCGCCCCGAG
GCCCTGGCCC GGGCGGACCG GCTCTTTGCC CGCTTCACCC ACCAGCGCTA TGGGTTACGC
CTGGGCCCGG ACCAGCGACT GCAGGCAATG GACCACCATA GCGAACAGCC CCAGCCGCTG
GAGCGCCTCT CCACCGGCAC CCGCATGCAA CTGCTGATCG CCCTGCGAGT GGCGTGGCTG
GAGCAACTGG AGCGCCAGAC CCGGCCGCTA CCCCTGATCC TCGACGAGGC CCTCACCACC
ACCGACCCGG AACGCTTCCA GGCGGTGGCC GGCAGCCTCG CAGCGCTACT GGAGACCGGC
CGCCAGATCT TCTACCTGAG CGCCCAGCCG GAGGACGCCC GGCGCTGGGA GCTCGCCCTG
GGCCAGCGGC CCCACTGCAT CGAACTGGCC CAACTGCGCG GTACCGGTTC CGCACTGTCC
GACCAAGCCC TGCAACTGCC GGAGGCGGAG CCGGTCCCGG CCCCGGACGG GCACACCGCC
GAAAGCTACG CCCGCGCCCT GCAGGTGCCC GGCATCGACC CCTGGCGTCC GGCCGGCGAG
ATCCACCTCT TTCACCTGTT GCGCGACCGG CTCGACACCC TCCACCGGCT GCTGCGCGAC
TACCGCGTCC ACCACAGCGG CGAGTTGCAG CGGCTGCTGG AGGACCCGGC CCTTAAACAC
CACCTGCCAG CGGAACTGCG CGAGCAGCTC TCGCGCCGCG TGCAACTGGC CCACCACTGG
CTGCAGGCCT GGCGCCAGGG TCGCGGCCGG CCGGTCACCC GGGCGGTGCT GGAGGCCAGC
GGGGCGGTGA GCGACACCTT TATGCCACGG GTCGCCGAAC TCAACGACGC CCACCACGGC
GATGCCCGGG CCCTGCTGGA GGCCCTGGGT GCCGGCGAGG TCTCAGGCTT CCGCCGGGCC
AAGCTCGAGG AGCTGGAGAG CTACCTGGAG GCGGAGGGCC ACCTGGACCC GCAGCCCCGA
TTGGAGCGCG ACGAGCGCTA TCGCGCCGCG CTGGCCAAGG TGGAGCTGCC CCTGGAGGCG
ATCGGGCGCG ACGCTGAAAT CATCGACTGG CTGGAGGCGG CGCTACTGGC CTGA
 
Protein sequence
MKLESLHIRQ LPGIHPGFDL EGLDPRINLL LGPNASGKSS VVRALRHLME TRTDDPANLV 
LSATFSEGER QWQVDRLGRD IHWRCDGEPC APPPLPGAEG LGYYWIGLDR LLETTREDQA
AEATLRREMQ GGYDLQALRE SQPFSLSAQR GRIAARELQQ RQQALRRVER EHQALASDEA
RLPELSEAVA EAQRARERRQ ACEQALLALD AARELRHQED RLARYPDPMP VGLTRDRLET
LEQQEAELAR ALASARDDQA RAEADRQATG LADGGPPPAE LQAAGEQARR LTRLEEQVRH
THEQLAATRR AAAQAARALG RGPDDDRAQP ALSPEELAGL ERLAHEAHAA RERLQALDAR
LATLEPNAEP FDPAPLERGC HELRRWLRQP APRPLHWLGL GLTGLGGAGT AALGLTLGHW
PTVASSLVVL AGLGASAVAM GRRRDRRESE TRFTELPLPA PEVWEEAAVR QRLDELEQAW
HQARSLQQRR QEADQLHAQR GRARTEQAEA EQALHRHAAA LGLDAELPLA FDRAVRLLSR
HQEAREQQAA QEAVLDRQAQ ELDTLRDTLH HFLATWHTAP REERSEALGA ALDDLRQRCE
AAERARQQAD NATRLIRELE RQLRDTRRQL DQLYHDAGLQ PDQRATLLER IDQLPEWQAC
RQALEAARSN YRLRREDLEG QADPDIIEWL EAGDEPALRR AAEDAAEAAG RLEPLREERA
GIRTRLEQTR QRHDLEEALA QREEAREALA GEREQALSAA AARFLLERVA RRHEQAHRPE
ALARADRLFA RFTHQRYGLR LGPDQRLQAM DHHSEQPQPL ERLSTGTRMQ LLIALRVAWL
EQLERQTRPL PLILDEALTT TDPERFQAVA GSLAALLETG RQIFYLSAQP EDARRWELAL
GQRPHCIELA QLRGTGSALS DQALQLPEAE PVPAPDGHTA ESYARALQVP GIDPWRPAGE
IHLFHLLRDR LDTLHRLLRD YRVHHSGELQ RLLEDPALKH HLPAELREQL SRRVQLAHHW
LQAWRQGRGR PVTRAVLEAS GAVSDTFMPR VAELNDAHHG DARALLEALG AGEVSGFRRA
KLEELESYLE AEGHLDPQPR LERDERYRAA LAKVELPLEA IGRDAEIIDW LEAALLA