Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2175 |
Symbol | |
ID | 4270954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2472727 |
End bp | 2473713 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126931 |
Product | protein of unknown function DUF900, hydrolase family protein |
Protein accession | YP_743007 |
Protein GI | 114321324 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.334078 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.109087 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCACG ACTTTGTGGT CTGCGTCATC AACACCCGTG TCCGCAGCGG CAAGCGGGTC TTCGGCCGCG CGCCGGGGCC CACCCGGTTC CTGCTCGTCC CCGATGGCGA GGTGCAGCAA CCCGCGCACA CCGTGCCCCG CGCCGAGTGG GTGGAGGCGG TCATGGCCGC CGGCACCACC GGCAGGGACC CCATGTCCGA CAACCCCACC GGCAACGTCC TGGTCTTCAT CCACGGCTAC AACAACAGCC AGGAGATCGT CATCAAGCGC CACCGCAAGC TCAAGGCGAC GCTGCACGCG GCCGGCTACC GGGGCACCGT GGTCAGCTTC GACTGGCCCA GCGCCGAGGC CACGCTGCTC TACATGCGCG ACCGCCGCTA CGCCAAGCAC ACCGCGGAGC GGCTCACCGA CGACTGCATC AGCCTGTTCT CGACACGCCA GGCGCGCGGC TGTGACCTGA ATGTCCACCT GCTGGGCCAC TCCACCGGCG CCTACGTCAT CCGCCACGCC TTCGCCGACG CCGACGAGGT CGCCGAGATC AAGAACCGCC CCTGGAAGGT CAGCCAGATC GCTCTCATCG GTGCCGATGT CTCCAGCAGT TCCCTGGCCG CCGATGACTC GCGCTTCGTC TCCGTCTACC GCCACTGCTC CCGGCTCACC AACTACCAGA GCGGCCACGA TGGCGTGCTG CGCGTCTCCA ACGCCAAGCG CATCGGCCTG CGCGCCCGCG CCGGCCGGGT CGGCCTGCCC GACAACGCCC ACCGCAAGGC GGTGAACGTG GACTGCAGCC CCTACTTCGC CGGCATCGAC CCGGACAGCC GCACCCCCGG CGAGGACTAC TTCGGCAACT TCGCCCACTC CTGGCACATC GGCGACCCGC TGTTCGCCCG CGACCTCTGC CACACCCTGC ACGGCGAACT GGACCGCCAC TCCATCCCCA CCCGGCGCGA GGAGGACGAC CGGCTGTACC TGCACGACCC CGGCTGA
|
Protein sequence | MSHDFVVCVI NTRVRSGKRV FGRAPGPTRF LLVPDGEVQQ PAHTVPRAEW VEAVMAAGTT GRDPMSDNPT GNVLVFIHGY NNSQEIVIKR HRKLKATLHA AGYRGTVVSF DWPSAEATLL YMRDRRYAKH TAERLTDDCI SLFSTRQARG CDLNVHLLGH STGAYVIRHA FADADEVAEI KNRPWKVSQI ALIGADVSSS SLAADDSRFV SVYRHCSRLT NYQSGHDGVL RVSNAKRIGL RARAGRVGLP DNAHRKAVNV DCSPYFAGID PDSRTPGEDY FGNFAHSWHI GDPLFARDLC HTLHGELDRH SIPTRREEDD RLYLHDPG
|
| |