Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0800 |
Symbol | |
ID | 4270564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 893186 |
End bp | 895351 |
Gene Length | 2166 bp |
Protein Length | 721 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638125551 |
Product | hypothetical protein |
Protein accession | YP_741644 |
Protein GI | 114319961 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCAAT ACGAACACCG CCCGACGTCT TCGGACCGTG ATGACGAGAT CGAGATCGGC CGTCTGGTCG AGATCCTGCT GAACGGCAAA TGGATCATTG CCGGTCTCAG TGCATTGGGG CTGGCCCTGG GCGGGTTCTA TGCCTACACC AGCACACCGG TCTACAGCGG CGACATGCTG CTGCAGGTGG AGAGTAAGCA GGCGACGCTA CCGGGGCTGT CGGAGATGCT GGGCGGCGAA CAGCAGGTGC CCACCAGTGC GGAGGTGGAG CTGCTGCGAT CGCGCATGGT GGTGGGTTCG GTGGTGGACC AACTGGACCT GGCCGTGCAG GCTCGGCCCT TGCCGACGGA CTGGCGCGAG GCGGTGCTGG GTGTGAGCAA CCCCCGGCGC CACATTGAGG TGGGCCGCTT CGAGGTCTCG CCGGCCCTGG AGGGCCAGAC GTTCATCGTC CGTGTGCTGG GCGAGGGGCA GTTCGCCCTG CTTGAGGACG ACGGCGGGGC GGAACTGGCG CGCGGGCGGG TGGACGAGAC CCTGGTGCTG GACAACCCGG ACGGCAGCTC GCTTCAGCTC TTTGTCCACG TGCTCACCGG CGAGCCGGGC GACACCTTCG AGCTGGTGCG CCAGCCGCGG CTGCGCGCCA TTCAGAACCT GCGCAACGGC TTGAACGTCA GTCAGCGCGG TGAGTCGGGG ATCCTGGAGG TCACTTACGA GCACCCGGAC CGCCAGCTTA TCGAGGAGGT CCTCAACACC CTGGGTATGG TCTACGTGCG CCAGAACGTG GAGCGTCGGA CCGAGGAGGC GGCCCGCAGC CTGGAGTTCC TGGAGGAGCA GCTGCCACAG TTGCAGGACC GACTGCAGCA GGCCGAGGAT GCCTTCAACG CCTTCCGCCG CGAGCACCAG GCGGTGGATA TGGATGAGAA CACCCGGGTG ATGCTCAGCC AGTTGGTCGA GGTGGAGAGC GAACTCCAGG CCTTGCGCCT GGAGGAGAGC GAGAAGAGCC TGCGCTTTGG TCGCGAGCAC CCGCAAATGC AGTCGCTGCG CCAACGGCGC CAATCGCTGG AGACCCTGCG TGCCGAGCTG GAGGAGGAGC TGGGGGAACT GCCGGAGCGC CAGCAGCAGT TGGTGCGCCT GCGCCGCGAG GTGGAGGTGA ACACCCAGCT CTACACCAAC CTGCTCAACA CCGCCCAGGA GTTGCGGATC TCCCAGGCCG GCACGGTTGG CAATGCCCGC GTGGTGGATG ACGCGGCCGT GGGCTTCAAT CCAGTGGCGC CGCAGACCAC CCTGATTATG GCGCTGAGCC TGCTCCTGGG CGGCATGCTG GGGGTGGGCA CCGTGTTCGG CCGCGAGATG CTCCGTCGCG GGGTCGAAGA CCCGGACGCC CTGGAACTGG AGGTGGGCAT CCCGGTCTAC GCCGTGGCGC CCCACAGCCC GGCGGCCCTG CAGCTCGAGA AGAAGGGCCG TCGACAGCGG ACCCAGGTGC CGCTGCTGGC GGAGAAAAAC AGCCAGGACC CGCTGGTGGA GAGCCTGCGC AGTCTGCGCA CCAGCCTGAA CTTCGCTCTG CTTAACAACG CCCGTAACGT CCTGGCCCTG ACCAGTACCG GCCCGGGGGA GGGCAAGACC ACCCTGTCCG TCAACTTGGC GGCGGTCCTG GCCCAGAGCG GCCAGCGGGT GCTGCTCATC GATACCGACA TGCGCCGGGG CCATCTGCAT ATCTTTCTGC AGAACCGGCG GCGCGAGCCG GGCCTGTCCG GGGTGCTGGC CGGGCAGGCC ACACTGGAGG AGGCGGTCTC ACGCATCAGG GAGAATCTCG ACGTGCTGCC CGCCGGCACC TTCCCGCCGA ACCCCTCGGA GCTGCTGATG CAGGAGGGAT TCGGCCGGCT GATCGAGGAG CAGCGCCAGC GCTATGACCT GGTGATCCTG GACACCGCGC CAGTGATGCC GGTCACCGAC GGCGTCCTGG CGGCCGCCCA CGCCGGGCCG GTGTTCCTGG TGGCCCGGGC GGGCTATGTG ACCACCCGCG CGGTGCAAAG CACCATCTGG CGGCTGGAGA AGAACCAGAT CGACACCACC GGGCTCGTGG TCAACGACCT CAACCCGAAA CGGAGTGGGC GCTCGAGCGA TTACTACTAT TATCAGTACC AGTACAAGGC GCGGGCCAAG GACTGA
|
Protein sequence | MSQYEHRPTS SDRDDEIEIG RLVEILLNGK WIIAGLSALG LALGGFYAYT STPVYSGDML LQVESKQATL PGLSEMLGGE QQVPTSAEVE LLRSRMVVGS VVDQLDLAVQ ARPLPTDWRE AVLGVSNPRR HIEVGRFEVS PALEGQTFIV RVLGEGQFAL LEDDGGAELA RGRVDETLVL DNPDGSSLQL FVHVLTGEPG DTFELVRQPR LRAIQNLRNG LNVSQRGESG ILEVTYEHPD RQLIEEVLNT LGMVYVRQNV ERRTEEAARS LEFLEEQLPQ LQDRLQQAED AFNAFRREHQ AVDMDENTRV MLSQLVEVES ELQALRLEES EKSLRFGREH PQMQSLRQRR QSLETLRAEL EEELGELPER QQQLVRLRRE VEVNTQLYTN LLNTAQELRI SQAGTVGNAR VVDDAAVGFN PVAPQTTLIM ALSLLLGGML GVGTVFGREM LRRGVEDPDA LELEVGIPVY AVAPHSPAAL QLEKKGRRQR TQVPLLAEKN SQDPLVESLR SLRTSLNFAL LNNARNVLAL TSTGPGEGKT TLSVNLAAVL AQSGQRVLLI DTDMRRGHLH IFLQNRRREP GLSGVLAGQA TLEEAVSRIR ENLDVLPAGT FPPNPSELLM QEGFGRLIEE QRQRYDLVIL DTAPVMPVTD GVLAAAHAGP VFLVARAGYV TTRAVQSTIW RLEKNQIDTT GLVVNDLNPK RSGRSSDYYY YQYQYKARAK D
|
| |