Gene Mlg_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1049 
Symbol 
ID4270522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1218945 
End bp1223327 
Gene Length4383 bp 
Protein Length1460 aa 
Translation table11 
GC content71% 
IMG OID638125801 
Producthypothetical protein 
Protein accessionYP_741892 
Protein GI114320209 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.523001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.173427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATTC ACCTGCACTT CGTCGGCCGC GAGGTCGAGT CGGACGATCC GGATGCCGCG 
GTCTTCGAGG TGGGCTCGCC CTCCCGGAGC TACGATCTCA GTTGGGTCCG CAGGGACGTC
GAATGCGACG CCACTGCGCT GATCGAGATA GAGGATGACG ACGAGTGCAT CCTCTGGACC
ACGGCGGAAG ATCTGGCCGA GGACGAGGCC GCCCGCCGGG GGCACCTGCG CGGAGAGGGC
GACGCCGATA TTGCGCTGGA TGCATTGCTG GCGGGCGCGT CCACGAGCTC GCGGGGCGTG
CTGCGCCGGT TTGGCCGGGT GGTCCTGCGG ATCTTCGGGC GCCGGGTCGC CGACGAATTT
GCCGGCAGGT CCGCCAAGGC CATCGCCGAG TACTGGGAGG GCCAGGTCAC ACCGGGCCAC
GGGCTGTACC GCTGCGACCG CCCCGAGGGC CCGGACGCCG TCGACGAGGC GCAACGTGTG
CGCCAGGGCG AGGGCCTGCC CGGTGACGGG CCACTGCTGC TGCTCCTCCA CGGCACGGCG
TCCAGTATCG CGGGCAGTTT CTCCAAGCTC GGTGGGTCCC CCAGAGGTGA TGAGCCCACC
GTATGGGCCA CGCTACAGGC CCATTACCGG GCACCGGACG GCCGCTCGCG CATCTGCGCC
TTCCAGCACC CCACCCTCAG CGCCAGTCCC ATCGACAATG CCCTGCAGTT GGTTGACGAC
CTGCCCCGGG GGGCTGAACT CCACCTGGTC ACCCACTCCC GTGGCGGGTT GATCGGCGAT
CTGCTCTGCC TTGCCGGGCA GGATCCGTCT GCGCTCCGGG GTGGCCTGGA GCAGCAGTAC
CGGCGCGACG AGCAGTGGCT GGCGGCCACC GGTCGTGATC TGGACGATCC CGGGATCGCT
GCCCTCGCGG AACAGGACCG GGCCGACCGG GAGGCGCTGC TGCAACTGCT GGAACGGCTC
CGGGACAAAG GGATCCGGGT GCTCCGCTAC GTGCGGGTGG CCTGCCCGGC GGCCGGCACC
TCTTTGGCCT CCGAGCGCCT GGACCGACTG CTGAGCCTGC TGGGCAACCT CATCCGGTTG
CTGCCGCCGG CCAACGGGCC CTTCTTCAAC GGCGTGATCC GGCCGATGAT CCGGGCCCTG
GTCGCGCAGC GATTCAGCCC GGACCGGTTG CCGGGGCTTG AGGCGATGCT GCCCCACGGG
CCCTTGGTGC GCTGGCTGAA CCTGGCGGCC CCCGAGGCGG ATGCCGAGCT GGCGGTGATC
GCCGGCGACG CCGAGGGCAG CGGGTTGATC CGCCGGCTGG GGGTGCTGTT GCTGGACCGG
CTGATCTTTC CCGGCCGGCA CGATCTGGTG GTGGACACCC GCTCCATGTA CGAAGGCCTG
CGGCGCCGGC AAGGGGTGTG GCAATACTTC CAGAGCCGGC CCGACATCTC CCACTTCAAC
TATTTCGACA ACCCTGACTC ACAGCGGGCG CTGGTCGGTT GGCTGCTGGA GACCGACGAC
CGGGCGCTGT TCCGAAGCCG CGATGCCGAG GCACCCTCCC TGGCGGCCGA GACCTTCAGC
GGTGTGCTGG GCGTGAGCCG GGGTGACCCC CACCGGCCGC TGCTCTTCCT GCTCCCGGGG
ATCATGGGCT CGCAACTGAG GGCGCGGGAA ACCCGCATCT GGATGGTCCG CCGGCAGATC
CTGCTGGGCC GCATCACCCG GCTGGCCTAC GGCCGCAGTG GGGTGGAGAC CGATGGGGTC
TTCAGTCGGT ATTACCAGCC GCTGGCCGAG TATCTGGCCG AGACCCATGA GGTGCATGTC
TTCCCCTACG ACTGGCGGCT CTCCATCGAC GACGCGGCCC GGCGCCTGGC CGCAGAGGTG
CGCCGGGCCC GCGGGAGCGG GGGGCGCCCG GTGCGGTTCG TGGCCCACTC CATGGGCGGG
TTGGTGACCC GGGCCATGAT GGCCAGCGAC CCGACCTTGT GGGCGGAGAT CTCGGAGCAG
CCGGGCTCGC GGCTGCTGAT GCTGGGCACG CCGAACCGCG GTTCGTTTGC CATCGTCGAG
GCACTCCTCG GGCACGACCG GCTGGTGCGA ATGCTGGCGC TGCTGGATCT GCCGAACAGC
CTGAACCGGA TCCTCGGCCT TTTGGCCGAG TACCCGGGGG TCATGGATCT GCTGCCCCAG
GACGGCGAAC TGGACCTCTA CGACGTGGCC ACCTGGAGCC GCCTGACCCA CCACAACGGC
CGCTTCGGGC ACGTGCCCGA AGCAGAGCGG TTGAGGGGTG CGCGGCTGCG CCGCGACCGG
CTGCCGGCGC GGGTCTCGGG GAATGACCCC ATCGTCTACG TGGCCGGTCA GGCGCCGCTG
ACGCCGGCCG GTATCCGTAT GGACGGCAGC GGCGACGGCC GGCTGCGCTT TGTCGGGACC
ACCGAGGGGG ATGGCCGGGT CACCCACGCC GCCGGCGAGT TGCCGGGGGT GCCGCGTTGG
TTCTGCGACG CGGTGCACGG CGATCTCCCC GCCACCCGCT CAGCCTTCCC CGCCTATCTC
GACCTGCTGG AGACCGGGCG GACAGACCGT TTGCCCACGG AACCGCTCAG CCGCCGGCGT
GGCCTGGGGG CAAGGGAGTT CCTCTACGAC CCTTCGCCGG CCACCCAGCC GGATGAGGCG
ACACTGATCG CGGAGGCCCT GGGGGCCTCC GGGGACGCAG ACGAGGAGGC GGGCCGCCCC
GGCCTGACGG TCTCGCTGGT ATATGCCGAT CTGGCCCAGG CGCGCCACCC GGTGCTCTGC
GGGCATGCCG TGGGCGATGT GATCTCCGGG GCCGAGGCGG CCCTAGACGA GGCCCTGGGC
GGGGCACTGT CCCGGCGCTA CCAACTGGAC CGCATCCCGG GCCGGGTCGG CAGCACGGCG
GTGATTCTGC CCGACGGATC GCAGCGCTCG CCGGTGCCCG GCGCCGTGGT GATGGGCCTC
GGTGAGCTGG GCGAGCTCAG CGCCAACGTG ATCGCCGAGG CCAGCTACGC CGCCTGCCTG
GACTATCTGT TGGCGGTGGC CAGCCCGAGC GACGGCGCCT GGGTCGGGCT CAGCGCGATC
CTGGTGGGCG GCAATAGCCC GGCCGCCCTG CACGAGGCGG ATATCATTCG ATCGATCATC
CAGGGCGTGG TCCGGGCCAA CCGCGCCATG GTCGAGCAGG CGAATGGCAC CGGTGTGGCG
ATCACGGAGT TGGAGATCCT GGAGCACGAC GGCCTTCGTT TCCAGGAGGC CTGCCAGTTC
CTGCGCGCCG ATACCGGCCA ACTCAGCGAG GAGCTGGGCA CGGACCTCGT GCTGCGCCGG
GAGGAAAGGC ATCCCGAGCC GCGGGTGGAG ACCGGCAGCC TGGCAGCGCG GCGGGCGACC
CGCCGTTGGC GCCGGCTGAT CGTCACCGAG CAGGAGCGGC CCCCGCTGTT GAACTACGAA
CTGCTGGGCC TGCGTGCGCG AATCCCCGGT CGGGCGGTGG ACTTCGATCC GGTCACGGTG
GCGCGGCTAC TGGCGGAGGC CATGAGCGAT CCGCGCTTCG ACCGCGAGAC CGCCAATGCC
CTCTACCATC TGCTGATCCC CAACCACCTG GGCGATCAGA TCGATCATTT CGATGCCCTG
ATGCTGATCG TGGACGAGCG GACCGCGCGC CTGCCCTTCG AGATGCTGAG CAACAACCGC
CGCCCCGGGG CGCGGCCCTG GGCGGTGCGC CAGGCCATGG TGCGCCAGTT CCGGACGCCC
GAGTTCCGTC AGACCCTGAA GCCGGTGCGG GCGCAGAGCG CGCTGGTCCT GGGTGACCCG
GTCACCCCCT TGCCGGAGTT GCCCGGCGCC CGGGCGGAGG CGCAGGCCGT GGCCACCGCG
CTGCTGGGCC AGGGCTACGA TGTGCGGCTT CTGTTACAGC CGCGCGGGGC CGATGTGCTG
CGTGCGCTGT TCGCCAAGCC CTGGCGGATA CTGCATGTGG CCGGGCACGG CATCCACGAG
GCCCACGGGG AGGCCCCCGG GGCGTTGCGC AACGGCATCC TGCTGGACGA CGGCGCCCTG
CTGGGCCGCC CGCAGATCAG CGAGCTGCCG GAGACCCCGG ACTTCGTGTT CCTGAACTGT
TGCCACCTGG GTCGCATGGT CCAGCTCACC GCAGAGCAGC GGGTACGGTT TCAGCACTTC
GCCGGTACCA TCGCCCGCAG CTTCATCAAC ATCGGCGTGC GCGGCCTGGT GGTCGGGGGC
TGGGAGGTCA ACGACCGGGC CGCCCAGCGC TTCGCGGAGG TCTTCTACCA GGCCCTGCTC
AACGGCGAAG CATTGGGCCT GGCCGTGTTG CGTGCCCGCG CGGCCTGCTG GGCTCAGTTT
CCCGATTACA ACACCTGGGG GGCCTACCAG TGCTACGGGG ACCCGGACCT GCGGCTGGTG
TGA
 
Protein sequence
MAIHLHFVGR EVESDDPDAA VFEVGSPSRS YDLSWVRRDV ECDATALIEI EDDDECILWT 
TAEDLAEDEA ARRGHLRGEG DADIALDALL AGASTSSRGV LRRFGRVVLR IFGRRVADEF
AGRSAKAIAE YWEGQVTPGH GLYRCDRPEG PDAVDEAQRV RQGEGLPGDG PLLLLLHGTA
SSIAGSFSKL GGSPRGDEPT VWATLQAHYR APDGRSRICA FQHPTLSASP IDNALQLVDD
LPRGAELHLV THSRGGLIGD LLCLAGQDPS ALRGGLEQQY RRDEQWLAAT GRDLDDPGIA
ALAEQDRADR EALLQLLERL RDKGIRVLRY VRVACPAAGT SLASERLDRL LSLLGNLIRL
LPPANGPFFN GVIRPMIRAL VAQRFSPDRL PGLEAMLPHG PLVRWLNLAA PEADAELAVI
AGDAEGSGLI RRLGVLLLDR LIFPGRHDLV VDTRSMYEGL RRRQGVWQYF QSRPDISHFN
YFDNPDSQRA LVGWLLETDD RALFRSRDAE APSLAAETFS GVLGVSRGDP HRPLLFLLPG
IMGSQLRARE TRIWMVRRQI LLGRITRLAY GRSGVETDGV FSRYYQPLAE YLAETHEVHV
FPYDWRLSID DAARRLAAEV RRARGSGGRP VRFVAHSMGG LVTRAMMASD PTLWAEISEQ
PGSRLLMLGT PNRGSFAIVE ALLGHDRLVR MLALLDLPNS LNRILGLLAE YPGVMDLLPQ
DGELDLYDVA TWSRLTHHNG RFGHVPEAER LRGARLRRDR LPARVSGNDP IVYVAGQAPL
TPAGIRMDGS GDGRLRFVGT TEGDGRVTHA AGELPGVPRW FCDAVHGDLP ATRSAFPAYL
DLLETGRTDR LPTEPLSRRR GLGAREFLYD PSPATQPDEA TLIAEALGAS GDADEEAGRP
GLTVSLVYAD LAQARHPVLC GHAVGDVISG AEAALDEALG GALSRRYQLD RIPGRVGSTA
VILPDGSQRS PVPGAVVMGL GELGELSANV IAEASYAACL DYLLAVASPS DGAWVGLSAI
LVGGNSPAAL HEADIIRSII QGVVRANRAM VEQANGTGVA ITELEILEHD GLRFQEACQF
LRADTGQLSE ELGTDLVLRR EERHPEPRVE TGSLAARRAT RRWRRLIVTE QERPPLLNYE
LLGLRARIPG RAVDFDPVTV ARLLAEAMSD PRFDRETANA LYHLLIPNHL GDQIDHFDAL
MLIVDERTAR LPFEMLSNNR RPGARPWAVR QAMVRQFRTP EFRQTLKPVR AQSALVLGDP
VTPLPELPGA RAEAQAVATA LLGQGYDVRL LLQPRGADVL RALFAKPWRI LHVAGHGIHE
AHGEAPGALR NGILLDDGAL LGRPQISELP ETPDFVFLNC CHLGRMVQLT AEQRVRFQHF
AGTIARSFIN IGVRGLVVGG WEVNDRAAQR FAEVFYQALL NGEALGLAVL RARAACWAQF
PDYNTWGAYQ CYGDPDLRLV