Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2084 |
Symbol | |
ID | 4269403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2361885 |
End bp | 2362844 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 638126840 |
Product | hypothetical protein |
Protein accession | YP_742916 |
Protein GI | 114321233 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase [COG1051] ADP-ribose pyrophosphatase |
TIGRFAM ID | [TIGR00586] mutator mutT protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.498942 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTC TGCACGTGGC CGTGGGTGTC ATCCTCGACG ACCGGCAGCG GGTACTGGTG GCGCGCCGGG CCGCCCACCG CCACCAGGGC GGGCGGTGGG AGTTTCCCGG CGGCAAGGTG GAGCCGGGCG AGACGGTGGT GCAGGCGCTC TGCCGCGAAC TCGAGGAGGA GTTGGCGATC AGCCCCACCC GCACCTCGCC GATGATGCGC ATCGAACACG ACTACCCGGA CCGCCGCGTC AGCCTGGATG TGCACCGGGT GAGCGCCTGG CGGGGCGAGC CACGCGGGCT CGAGGGCCAG CCGCTGGCCT GGCTGAGGGC CACGGAGTTG GCCCGCCGGC CTTTTCCGCA GGCCAATCTC CCCATCATCC GACGGCTGGC CCTGCCGCCC TTTCTGATCA TCACCGAGCC GCTGGCCCCC GGTGACCTGG CGGGCCTGGC GCGCCGGCTC CAGTCGCTGG CCGTGCCGGC TCGCGGGGCC TGGCTGCAGC TGCGTCTGCC GGACTGGGAT GATCGGGCCT ATGGCCGGGC GCTGGCGTTG GCCATCAGGA CCCTGGGGCC CCGGGGGGTG GACGTGACCG CGAACCGCTC ACCCGCGGTG GCACGCCGCG CCGGTGGTCA CGCCCTGCAC CTGAACGCCC GCGCGCTGAT GGCCTGCGAG GCGCGTCCCG AGGGCTTTGT GCGGGTGGGG GCCTCCTGCC ACAGCCCTGA GGAACTGGCC CGGGCCGAGG CCCTGGGGCT GGACTATGCG CTGCTCTCTC CGGTCGCCGC CACGGCCTCG CACCCCCGGC AGGTGCCGTT GGGCTGGGAG CGATTCCGGG ACTGGCTGGG CCGGGTGGAC CTGCCGGTCT ACGCCTTGGG TGGCTTGGGG CCGGAGGCGC TGGAGTTGGC CTGGGCCCAT GGGGCGCACG GGGTGGCGGG GATCCGCGGC TTCTGGCCGC CGCGCGGATC GCCGCCATAG
|
Protein sequence | MARLHVAVGV ILDDRQRVLV ARRAAHRHQG GRWEFPGGKV EPGETVVQAL CRELEEELAI SPTRTSPMMR IEHDYPDRRV SLDVHRVSAW RGEPRGLEGQ PLAWLRATEL ARRPFPQANL PIIRRLALPP FLIITEPLAP GDLAGLARRL QSLAVPARGA WLQLRLPDWD DRAYGRALAL AIRTLGPRGV DVTANRSPAV ARRAGGHALH LNARALMACE ARPEGFVRVG ASCHSPEELA RAEALGLDYA LLSPVAATAS HPRQVPLGWE RFRDWLGRVD LPVYALGGLG PEALELAWAH GAHGVAGIRG FWPPRGSPP
|
| |