Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1049 |
Symbol | |
ID | 4270522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1218945 |
End bp | 1223327 |
Gene Length | 4383 bp |
Protein Length | 1460 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638125801 |
Product | hypothetical protein |
Protein accession | YP_741892 |
Protein GI | 114320209 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.523001 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.173427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATTC ACCTGCACTT CGTCGGCCGC GAGGTCGAGT CGGACGATCC GGATGCCGCG GTCTTCGAGG TGGGCTCGCC CTCCCGGAGC TACGATCTCA GTTGGGTCCG CAGGGACGTC GAATGCGACG CCACTGCGCT GATCGAGATA GAGGATGACG ACGAGTGCAT CCTCTGGACC ACGGCGGAAG ATCTGGCCGA GGACGAGGCC GCCCGCCGGG GGCACCTGCG CGGAGAGGGC GACGCCGATA TTGCGCTGGA TGCATTGCTG GCGGGCGCGT CCACGAGCTC GCGGGGCGTG CTGCGCCGGT TTGGCCGGGT GGTCCTGCGG ATCTTCGGGC GCCGGGTCGC CGACGAATTT GCCGGCAGGT CCGCCAAGGC CATCGCCGAG TACTGGGAGG GCCAGGTCAC ACCGGGCCAC GGGCTGTACC GCTGCGACCG CCCCGAGGGC CCGGACGCCG TCGACGAGGC GCAACGTGTG CGCCAGGGCG AGGGCCTGCC CGGTGACGGG CCACTGCTGC TGCTCCTCCA CGGCACGGCG TCCAGTATCG CGGGCAGTTT CTCCAAGCTC GGTGGGTCCC CCAGAGGTGA TGAGCCCACC GTATGGGCCA CGCTACAGGC CCATTACCGG GCACCGGACG GCCGCTCGCG CATCTGCGCC TTCCAGCACC CCACCCTCAG CGCCAGTCCC ATCGACAATG CCCTGCAGTT GGTTGACGAC CTGCCCCGGG GGGCTGAACT CCACCTGGTC ACCCACTCCC GTGGCGGGTT GATCGGCGAT CTGCTCTGCC TTGCCGGGCA GGATCCGTCT GCGCTCCGGG GTGGCCTGGA GCAGCAGTAC CGGCGCGACG AGCAGTGGCT GGCGGCCACC GGTCGTGATC TGGACGATCC CGGGATCGCT GCCCTCGCGG AACAGGACCG GGCCGACCGG GAGGCGCTGC TGCAACTGCT GGAACGGCTC CGGGACAAAG GGATCCGGGT GCTCCGCTAC GTGCGGGTGG CCTGCCCGGC GGCCGGCACC TCTTTGGCCT CCGAGCGCCT GGACCGACTG CTGAGCCTGC TGGGCAACCT CATCCGGTTG CTGCCGCCGG CCAACGGGCC CTTCTTCAAC GGCGTGATCC GGCCGATGAT CCGGGCCCTG GTCGCGCAGC GATTCAGCCC GGACCGGTTG CCGGGGCTTG AGGCGATGCT GCCCCACGGG CCCTTGGTGC GCTGGCTGAA CCTGGCGGCC CCCGAGGCGG ATGCCGAGCT GGCGGTGATC GCCGGCGACG CCGAGGGCAG CGGGTTGATC CGCCGGCTGG GGGTGCTGTT GCTGGACCGG CTGATCTTTC CCGGCCGGCA CGATCTGGTG GTGGACACCC GCTCCATGTA CGAAGGCCTG CGGCGCCGGC AAGGGGTGTG GCAATACTTC CAGAGCCGGC CCGACATCTC CCACTTCAAC TATTTCGACA ACCCTGACTC ACAGCGGGCG CTGGTCGGTT GGCTGCTGGA GACCGACGAC CGGGCGCTGT TCCGAAGCCG CGATGCCGAG GCACCCTCCC TGGCGGCCGA GACCTTCAGC GGTGTGCTGG GCGTGAGCCG GGGTGACCCC CACCGGCCGC TGCTCTTCCT GCTCCCGGGG ATCATGGGCT CGCAACTGAG GGCGCGGGAA ACCCGCATCT GGATGGTCCG CCGGCAGATC CTGCTGGGCC GCATCACCCG GCTGGCCTAC GGCCGCAGTG GGGTGGAGAC CGATGGGGTC TTCAGTCGGT ATTACCAGCC GCTGGCCGAG TATCTGGCCG AGACCCATGA GGTGCATGTC TTCCCCTACG ACTGGCGGCT CTCCATCGAC GACGCGGCCC GGCGCCTGGC CGCAGAGGTG CGCCGGGCCC GCGGGAGCGG GGGGCGCCCG GTGCGGTTCG TGGCCCACTC CATGGGCGGG TTGGTGACCC GGGCCATGAT GGCCAGCGAC CCGACCTTGT GGGCGGAGAT CTCGGAGCAG CCGGGCTCGC GGCTGCTGAT GCTGGGCACG CCGAACCGCG GTTCGTTTGC CATCGTCGAG GCACTCCTCG GGCACGACCG GCTGGTGCGA ATGCTGGCGC TGCTGGATCT GCCGAACAGC CTGAACCGGA TCCTCGGCCT TTTGGCCGAG TACCCGGGGG TCATGGATCT GCTGCCCCAG GACGGCGAAC TGGACCTCTA CGACGTGGCC ACCTGGAGCC GCCTGACCCA CCACAACGGC CGCTTCGGGC ACGTGCCCGA AGCAGAGCGG TTGAGGGGTG CGCGGCTGCG CCGCGACCGG CTGCCGGCGC GGGTCTCGGG GAATGACCCC ATCGTCTACG TGGCCGGTCA GGCGCCGCTG ACGCCGGCCG GTATCCGTAT GGACGGCAGC GGCGACGGCC GGCTGCGCTT TGTCGGGACC ACCGAGGGGG ATGGCCGGGT CACCCACGCC GCCGGCGAGT TGCCGGGGGT GCCGCGTTGG TTCTGCGACG CGGTGCACGG CGATCTCCCC GCCACCCGCT CAGCCTTCCC CGCCTATCTC GACCTGCTGG AGACCGGGCG GACAGACCGT TTGCCCACGG AACCGCTCAG CCGCCGGCGT GGCCTGGGGG CAAGGGAGTT CCTCTACGAC CCTTCGCCGG CCACCCAGCC GGATGAGGCG ACACTGATCG CGGAGGCCCT GGGGGCCTCC GGGGACGCAG ACGAGGAGGC GGGCCGCCCC GGCCTGACGG TCTCGCTGGT ATATGCCGAT CTGGCCCAGG CGCGCCACCC GGTGCTCTGC GGGCATGCCG TGGGCGATGT GATCTCCGGG GCCGAGGCGG CCCTAGACGA GGCCCTGGGC GGGGCACTGT CCCGGCGCTA CCAACTGGAC CGCATCCCGG GCCGGGTCGG CAGCACGGCG GTGATTCTGC CCGACGGATC GCAGCGCTCG CCGGTGCCCG GCGCCGTGGT GATGGGCCTC GGTGAGCTGG GCGAGCTCAG CGCCAACGTG ATCGCCGAGG CCAGCTACGC CGCCTGCCTG GACTATCTGT TGGCGGTGGC CAGCCCGAGC GACGGCGCCT GGGTCGGGCT CAGCGCGATC CTGGTGGGCG GCAATAGCCC GGCCGCCCTG CACGAGGCGG ATATCATTCG ATCGATCATC CAGGGCGTGG TCCGGGCCAA CCGCGCCATG GTCGAGCAGG CGAATGGCAC CGGTGTGGCG ATCACGGAGT TGGAGATCCT GGAGCACGAC GGCCTTCGTT TCCAGGAGGC CTGCCAGTTC CTGCGCGCCG ATACCGGCCA ACTCAGCGAG GAGCTGGGCA CGGACCTCGT GCTGCGCCGG GAGGAAAGGC ATCCCGAGCC GCGGGTGGAG ACCGGCAGCC TGGCAGCGCG GCGGGCGACC CGCCGTTGGC GCCGGCTGAT CGTCACCGAG CAGGAGCGGC CCCCGCTGTT GAACTACGAA CTGCTGGGCC TGCGTGCGCG AATCCCCGGT CGGGCGGTGG ACTTCGATCC GGTCACGGTG GCGCGGCTAC TGGCGGAGGC CATGAGCGAT CCGCGCTTCG ACCGCGAGAC CGCCAATGCC CTCTACCATC TGCTGATCCC CAACCACCTG GGCGATCAGA TCGATCATTT CGATGCCCTG ATGCTGATCG TGGACGAGCG GACCGCGCGC CTGCCCTTCG AGATGCTGAG CAACAACCGC CGCCCCGGGG CGCGGCCCTG GGCGGTGCGC CAGGCCATGG TGCGCCAGTT CCGGACGCCC GAGTTCCGTC AGACCCTGAA GCCGGTGCGG GCGCAGAGCG CGCTGGTCCT GGGTGACCCG GTCACCCCCT TGCCGGAGTT GCCCGGCGCC CGGGCGGAGG CGCAGGCCGT GGCCACCGCG CTGCTGGGCC AGGGCTACGA TGTGCGGCTT CTGTTACAGC CGCGCGGGGC CGATGTGCTG CGTGCGCTGT TCGCCAAGCC CTGGCGGATA CTGCATGTGG CCGGGCACGG CATCCACGAG GCCCACGGGG AGGCCCCCGG GGCGTTGCGC AACGGCATCC TGCTGGACGA CGGCGCCCTG CTGGGCCGCC CGCAGATCAG CGAGCTGCCG GAGACCCCGG ACTTCGTGTT CCTGAACTGT TGCCACCTGG GTCGCATGGT CCAGCTCACC GCAGAGCAGC GGGTACGGTT TCAGCACTTC GCCGGTACCA TCGCCCGCAG CTTCATCAAC ATCGGCGTGC GCGGCCTGGT GGTCGGGGGC TGGGAGGTCA ACGACCGGGC CGCCCAGCGC TTCGCGGAGG TCTTCTACCA GGCCCTGCTC AACGGCGAAG CATTGGGCCT GGCCGTGTTG CGTGCCCGCG CGGCCTGCTG GGCTCAGTTT CCCGATTACA ACACCTGGGG GGCCTACCAG TGCTACGGGG ACCCGGACCT GCGGCTGGTG TGA
|
Protein sequence | MAIHLHFVGR EVESDDPDAA VFEVGSPSRS YDLSWVRRDV ECDATALIEI EDDDECILWT TAEDLAEDEA ARRGHLRGEG DADIALDALL AGASTSSRGV LRRFGRVVLR IFGRRVADEF AGRSAKAIAE YWEGQVTPGH GLYRCDRPEG PDAVDEAQRV RQGEGLPGDG PLLLLLHGTA SSIAGSFSKL GGSPRGDEPT VWATLQAHYR APDGRSRICA FQHPTLSASP IDNALQLVDD LPRGAELHLV THSRGGLIGD LLCLAGQDPS ALRGGLEQQY RRDEQWLAAT GRDLDDPGIA ALAEQDRADR EALLQLLERL RDKGIRVLRY VRVACPAAGT SLASERLDRL LSLLGNLIRL LPPANGPFFN GVIRPMIRAL VAQRFSPDRL PGLEAMLPHG PLVRWLNLAA PEADAELAVI AGDAEGSGLI RRLGVLLLDR LIFPGRHDLV VDTRSMYEGL RRRQGVWQYF QSRPDISHFN YFDNPDSQRA LVGWLLETDD RALFRSRDAE APSLAAETFS GVLGVSRGDP HRPLLFLLPG IMGSQLRARE TRIWMVRRQI LLGRITRLAY GRSGVETDGV FSRYYQPLAE YLAETHEVHV FPYDWRLSID DAARRLAAEV RRARGSGGRP VRFVAHSMGG LVTRAMMASD PTLWAEISEQ PGSRLLMLGT PNRGSFAIVE ALLGHDRLVR MLALLDLPNS LNRILGLLAE YPGVMDLLPQ DGELDLYDVA TWSRLTHHNG RFGHVPEAER LRGARLRRDR LPARVSGNDP IVYVAGQAPL TPAGIRMDGS GDGRLRFVGT TEGDGRVTHA AGELPGVPRW FCDAVHGDLP ATRSAFPAYL DLLETGRTDR LPTEPLSRRR GLGAREFLYD PSPATQPDEA TLIAEALGAS GDADEEAGRP GLTVSLVYAD LAQARHPVLC GHAVGDVISG AEAALDEALG GALSRRYQLD RIPGRVGSTA VILPDGSQRS PVPGAVVMGL GELGELSANV IAEASYAACL DYLLAVASPS DGAWVGLSAI LVGGNSPAAL HEADIIRSII QGVVRANRAM VEQANGTGVA ITELEILEHD GLRFQEACQF LRADTGQLSE ELGTDLVLRR EERHPEPRVE TGSLAARRAT RRWRRLIVTE QERPPLLNYE LLGLRARIPG RAVDFDPVTV ARLLAEAMSD PRFDRETANA LYHLLIPNHL GDQIDHFDAL MLIVDERTAR LPFEMLSNNR RPGARPWAVR QAMVRQFRTP EFRQTLKPVR AQSALVLGDP VTPLPELPGA RAEAQAVATA LLGQGYDVRL LLQPRGADVL RALFAKPWRI LHVAGHGIHE AHGEAPGALR NGILLDDGAL LGRPQISELP ETPDFVFLNC CHLGRMVQLT AEQRVRFQHF AGTIARSFIN IGVRGLVVGG WEVNDRAAQR FAEVFYQALL NGEALGLAVL RARAACWAQF PDYNTWGAYQ CYGDPDLRLV
|
| |