Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0827 |
Symbol | |
ID | 4268252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 933247 |
End bp | 936660 |
Gene Length | 3414 bp |
Protein Length | 1137 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 638125578 |
Product | hypothetical protein |
Protein accession | YP_741671 |
Protein GI | 114319988 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCG AGAGCCTGCA CATTCGGCAA CTGCCCGGCA TCCACCCCGG ATTCGACCTG GAGGGGCTGG ACCCGCGGAT CAACCTGCTG CTGGGGCCCA ACGCCTCCGG CAAGAGCAGC GTGGTGCGGG CCCTGCGCCA TCTGATGGAG ACCCGCACCG ACGACCCCGC CAACCTGGTG CTGTCGGCCA CCTTCAGCGA GGGAGAGCGG CAATGGCAGG TGGATCGGCT GGGCCGCGAC ATCCACTGGC GCTGCGACGG CGAACCCTGC GCGCCCCCAC CCCTGCCCGG CGCCGAGGGG CTCGGCTATT ACTGGATCGG GCTCGACCGC CTGCTCGAAA CCACCCGCGA GGACCAGGCC GCCGAGGCCA CCCTGCGCCG CGAGATGCAG GGCGGCTACG ACCTGCAGGC CCTGCGGGAG AGTCAGCCGT TCAGCCTGTC CGCCCAGCGG GGGCGCATCG CCGCCCGGGA ACTGCAACAG CGCCAGCAGG CCCTGCGCCG GGTGGAGCGC GAACACCAGG CGCTGGCCAG CGACGAGGCC CGACTGCCGG AACTCAGCGA GGCCGTGGCC GAGGCGCAAC GGGCCCGGGA ACGCCGGCAG GCCTGTGAGC AGGCCTTGCT GGCCCTCGAC GCCGCGCGGG AACTGCGCCA CCAGGAGGAT CGCCTGGCCC GCTATCCCGA TCCCATGCCC GTCGGCCTGA CCCGCGACCG GCTGGAGACC CTGGAGCAGC AGGAGGCGGA ACTGGCACGC GCCCTGGCCA GCGCCCGCGA CGACCAGGCC CGGGCGGAGG CCGACCGGCA GGCGACCGGT CTGGCGGATG GCGGCCCGCC ACCGGCGGAG CTGCAAGCCG CCGGCGAGCA GGCCCGTCGG CTCACCCGGC TGGAGGAACA GGTCCGCCAT ACCCACGAGC AACTGGCTGC CACCCGCCGG GCCGCGGCCC AGGCGGCCCG CGCCCTGGGC CGCGGCCCCG ACGACGACCG GGCGCAACCG GCGCTCTCCC CGGAGGAGCT GGCCGGCCTG GAGCGGCTGG CCCACGAGGC CCACGCGGCA CGGGAACGCC TCCAGGCGCT GGACGCCCGC CTGGCAACCC TGGAGCCCAA CGCTGAGCCC TTCGACCCCG CGCCACTGGA ACGGGGCTGC CATGAACTGC GCCGCTGGCT GCGCCAGCCT GCCCCGCGGC CGCTGCACTG GCTGGGCCTC GGGCTGACCG GGCTGGGCGG CGCGGGTACC GCGGCACTGG GGCTCACCCT TGGCCACTGG CCCACCGTGG CCAGCAGCCT GGTGGTGCTC GCCGGCCTCG GTGCCAGTGC CGTGGCAATG GGGCGCCGCC GGGACCGGCG CGAGAGCGAG ACCCGGTTTA CTGAACTGCC GCTCCCCGCC CCGGAGGTCT GGGAGGAGGC CGCCGTCCGG CAACGCCTGG ATGAGCTGGA ACAGGCCTGG CACCAGGCCC GAAGCCTTCA GCAGCGGCGC CAGGAGGCCG ACCAACTCCA CGCGCAAAGG GGCCGGGCGC GCACAGAACA GGCTGAGGCG GAGCAGGCCC TCCACCGGCA TGCCGCCGCC CTGGGCCTGG ACGCGGAACT GCCGCTGGCC TTCGACCGCG CCGTCCGCCT GCTGAGCCGC CATCAGGAGG CCCGCGAACA ACAGGCCGCC CAGGAGGCCG TACTGGACCG CCAGGCCCAG GAACTCGACA CCCTGCGCGA CACCCTGCAC CACTTCCTGG CGACCTGGCA CACCGCCCCG CGCGAGGAGC GCAGCGAGGC ACTGGGCGCC GCCCTGGACG ACCTGCGGCA ACGCTGCGAG GCCGCCGAGC GTGCCCGCCA GCAGGCGGAC AACGCCACTC GACTGATCCG GGAATTGGAG CGCCAGCTTC GGGATACCCG CCGGCAGCTC GACCAGCTCT ACCACGACGC CGGCCTGCAA CCCGACCAGC GGGCCACCCT GCTGGAACGC ATCGACCAGC TCCCTGAGTG GCAGGCCTGC CGGCAGGCCC TGGAGGCCGC ACGGAGCAAT TACCGGCTGC GCCGTGAGGA CCTGGAGGGG CAGGCCGACC CCGACATCAT CGAGTGGCTG GAGGCCGGCG ATGAGCCCGC CCTGCGCCGC GCCGCTGAGG ACGCCGCCGA GGCCGCCGGA CGGCTGGAGC CGCTGCGCGA GGAGCGCGCC GGCATCCGCA CCCGCCTGGA ACAGACCCGC CAGCGCCACG ACCTGGAGGA GGCCCTGGCC CAGCGCGAGG AGGCGCGCGA GGCGCTGGCC GGAGAGCGCG AACAGGCCCT ATCCGCGGCC GCGGCCCGGT TCCTGCTGGA GCGCGTGGCC CGCCGGCACG AACAGGCCCA CCGCCCCGAG GCCCTGGCCC GGGCGGACCG GCTCTTTGCC CGCTTCACCC ACCAGCGCTA TGGGTTACGC CTGGGCCCGG ACCAGCGACT GCAGGCAATG GACCACCATA GCGAACAGCC CCAGCCGCTG GAGCGCCTCT CCACCGGCAC CCGCATGCAA CTGCTGATCG CCCTGCGAGT GGCGTGGCTG GAGCAACTGG AGCGCCAGAC CCGGCCGCTA CCCCTGATCC TCGACGAGGC CCTCACCACC ACCGACCCGG AACGCTTCCA GGCGGTGGCC GGCAGCCTCG CAGCGCTACT GGAGACCGGC CGCCAGATCT TCTACCTGAG CGCCCAGCCG GAGGACGCCC GGCGCTGGGA GCTCGCCCTG GGCCAGCGGC CCCACTGCAT CGAACTGGCC CAACTGCGCG GTACCGGTTC CGCACTGTCC GACCAAGCCC TGCAACTGCC GGAGGCGGAG CCGGTCCCGG CCCCGGACGG GCACACCGCC GAAAGCTACG CCCGCGCCCT GCAGGTGCCC GGCATCGACC CCTGGCGTCC GGCCGGCGAG ATCCACCTCT TTCACCTGTT GCGCGACCGG CTCGACACCC TCCACCGGCT GCTGCGCGAC TACCGCGTCC ACCACAGCGG CGAGTTGCAG CGGCTGCTGG AGGACCCGGC CCTTAAACAC CACCTGCCAG CGGAACTGCG CGAGCAGCTC TCGCGCCGCG TGCAACTGGC CCACCACTGG CTGCAGGCCT GGCGCCAGGG TCGCGGCCGG CCGGTCACCC GGGCGGTGCT GGAGGCCAGC GGGGCGGTGA GCGACACCTT TATGCCACGG GTCGCCGAAC TCAACGACGC CCACCACGGC GATGCCCGGG CCCTGCTGGA GGCCCTGGGT GCCGGCGAGG TCTCAGGCTT CCGCCGGGCC AAGCTCGAGG AGCTGGAGAG CTACCTGGAG GCGGAGGGCC ACCTGGACCC GCAGCCCCGA TTGGAGCGCG ACGAGCGCTA TCGCGCCGCG CTGGCCAAGG TGGAGCTGCC CCTGGAGGCG ATCGGGCGCG ACGCTGAAAT CATCGACTGG CTGGAGGCGG CGCTACTGGC CTGA
|
Protein sequence | MKLESLHIRQ LPGIHPGFDL EGLDPRINLL LGPNASGKSS VVRALRHLME TRTDDPANLV LSATFSEGER QWQVDRLGRD IHWRCDGEPC APPPLPGAEG LGYYWIGLDR LLETTREDQA AEATLRREMQ GGYDLQALRE SQPFSLSAQR GRIAARELQQ RQQALRRVER EHQALASDEA RLPELSEAVA EAQRARERRQ ACEQALLALD AARELRHQED RLARYPDPMP VGLTRDRLET LEQQEAELAR ALASARDDQA RAEADRQATG LADGGPPPAE LQAAGEQARR LTRLEEQVRH THEQLAATRR AAAQAARALG RGPDDDRAQP ALSPEELAGL ERLAHEAHAA RERLQALDAR LATLEPNAEP FDPAPLERGC HELRRWLRQP APRPLHWLGL GLTGLGGAGT AALGLTLGHW PTVASSLVVL AGLGASAVAM GRRRDRRESE TRFTELPLPA PEVWEEAAVR QRLDELEQAW HQARSLQQRR QEADQLHAQR GRARTEQAEA EQALHRHAAA LGLDAELPLA FDRAVRLLSR HQEAREQQAA QEAVLDRQAQ ELDTLRDTLH HFLATWHTAP REERSEALGA ALDDLRQRCE AAERARQQAD NATRLIRELE RQLRDTRRQL DQLYHDAGLQ PDQRATLLER IDQLPEWQAC RQALEAARSN YRLRREDLEG QADPDIIEWL EAGDEPALRR AAEDAAEAAG RLEPLREERA GIRTRLEQTR QRHDLEEALA QREEAREALA GEREQALSAA AARFLLERVA RRHEQAHRPE ALARADRLFA RFTHQRYGLR LGPDQRLQAM DHHSEQPQPL ERLSTGTRMQ LLIALRVAWL EQLERQTRPL PLILDEALTT TDPERFQAVA GSLAALLETG RQIFYLSAQP EDARRWELAL GQRPHCIELA QLRGTGSALS DQALQLPEAE PVPAPDGHTA ESYARALQVP GIDPWRPAGE IHLFHLLRDR LDTLHRLLRD YRVHHSGELQ RLLEDPALKH HLPAELREQL SRRVQLAHHW LQAWRQGRGR PVTRAVLEAS GAVSDTFMPR VAELNDAHHG DARALLEALG AGEVSGFRRA KLEELESYLE AEGHLDPQPR LERDERYRAA LAKVELPLEA IGRDAEIIDW LEAALLA
|
| |