Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1142 |
Symbol | |
ID | 4269637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1336220 |
End bp | 1337359 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125891 |
Product | hypothetical protein |
Protein accession | YP_741981 |
Protein GI | 114320298 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCCACG GCCATGCCGC CGTCTCCGTG GCGCTGGGCG GCGGCGTCGG GCTCGACACC GCCGGGGGCC GTCTCGCCGT CAACGGCCTC GCCCCCGTGG AGCGCGACGG CGTCGATGCC CGACTGGAGG CCTTCGCCGG CACCGGACTG GCCGGCCACA ACCACTGCCG TCTGCTCTGG CAGCCGCCGG CGAACCTGCT CGCGCGCCTG CCGCGCTACC AGGCCATGGC CGAGATCGAC CGCGCGGGCT ACGCCCGGGA CGAGGCCCGC CAGTGGAAAA CCCTGACCGC CGCCGAGATC AACCCCGAGG TGCGCGTCGG CGTCGGCGGC GAGGCCGCCT TCCGGCTCGG CCTGCACAAC GGCCGCTTCG TGCTGCACGC CTCCCTGCGC CTGGTGCTCG GCGTCGGCGG CGGGGGCAGC GTGCGCCTGG CGCTTGACCC CCGCCACCTC GACCTCTGGC TCGCCATGAT GCACCAGGCG CTGGTGGAGG TCGGCTACGA GCGCGTCGAC TGGATCGACG AAGACGCCTT CGAGGAGATG AGCCGCCTGG CCTATCTCGC CGCCATCACC CTGGTCGAAC CCGCCCTGCT CCTGCTGCGC GGCACCCACC GACTGCGCCA GATGATCGAG TGGTTTACCC GGGACCGGGA CATGGCCAGC CGGATCGCCT ACACCATCGT CAACGACCCG CAACGGGACG CCATTGCCGC CTGGGTGCGC CAGCTACCGC CCGAGGCCCT GGGGCCGCTG TTATACACCC TGACCAGTCG GCCGCAGGCG TTCGAGGTGG AAGGCGACCA ATACAACGCG GAACAAGCAC GAGAATTCCA CCAACGGGCC ATCCTCAATT GCCTGCAATG GATCGTCTCT GGCGCCCAGA ATGGCGTTTA CGGGCCGCGG CGCGAGTTCT CGGCCGAGCA GCCCAACCCG GCGCAGAAAT TGTTCGAGAA GGCCGTGGTG CGCATGGGCC GAGATGGACA GCCTACCGAC GAATCCAGGG CCGATGCTTA TGCGGAAAAC CGCAAGCGTC TGGATGACTT TATGCCTGGA GCCTCACCGA CCAGCCGCGT CGGCGCGATC GGCGAACGGT ATGAGCGTGC GGTAAGAACC CTCTCCCGAC ACATTACCCC GCTGAACTGA
|
Protein sequence | MLHGHAAVSV ALGGGVGLDT AGGRLAVNGL APVERDGVDA RLEAFAGTGL AGHNHCRLLW QPPANLLARL PRYQAMAEID RAGYARDEAR QWKTLTAAEI NPEVRVGVGG EAAFRLGLHN GRFVLHASLR LVLGVGGGGS VRLALDPRHL DLWLAMMHQA LVEVGYERVD WIDEDAFEEM SRLAYLAAIT LVEPALLLLR GTHRLRQMIE WFTRDRDMAS RIAYTIVNDP QRDAIAAWVR QLPPEALGPL LYTLTSRPQA FEVEGDQYNA EQAREFHQRA ILNCLQWIVS GAQNGVYGPR REFSAEQPNP AQKLFEKAVV RMGRDGQPTD ESRADAYAEN RKRLDDFMPG ASPTSRVGAI GERYERAVRT LSRHITPLN
|
| |