Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0647 |
Symbol | |
ID | 4270837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 698391 |
End bp | 699566 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125396 |
Product | hypothetical protein |
Protein accession | YP_741491 |
Protein GI | 114319808 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.268839 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0829547 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGC CCGCCCCCAC CAACGAAGCC CCCGACCTCT CCGGACACCC CCGGGCCGGG GACGTGCGCC GCTTCGCCAC CGGCGAGGCG CTGGTCCTCG CGCCGCTGCC ATTACCGCTC CCGGTGCGCC CCATGGACCC GAAGCAGTTC CTCGTCCACA TCGACCAGAC CTACCTGGAT CTGGACTCCA GCCGCCACAC CGCCCAAGCC CACCAGGTCA TGATCGGCGT GCCCTTCTTC ATCGGCGTGA TCTTTATCGG GCTCGGTTTT CCGTTGCTTA TTGGCACGAC CGCGTTCGGG ACCGACCACA CCTTTTGGGC AAGCGCCCTG CATGCCGCCA TTGTCTCCAT CCCCTACGGC CTTTTCGGCG GCACCCTGTT GTTTCTCATT GCCCTCCACG GCTTTTTCCA CCGCATGAAG CAGGCCCGGC GGCATCCGCC GGTGCGCTTC CATCGCCAGC GCCGGGAGGT CGCCTGCTTC GACCCCGACA CCGGCCAGAC CCTCGTCGCC CCGTTCGAGC GCGTCACCGC CTGGATGGCC ACCAGCAGCG GCGCCACCCC CTACGGCGCC ATGACCCACT ACAACTTCGG CCTCACCGTC GAGGACGCGG AAACCGGACA GTCCTATACC GCCCTCTTCC CCGCCTCGCT CCCCGAGGAG GCCCTGGGCC TGTGGGAGGC CATCCGCCGC TACATGGATC ACGGGCCGGG CACGCTCGAA CGGCCCACGA AAACCTTCTC CGGCTTGCCC ATCGACCCCA GGGAGCACCT CCCCTACGAC GGCGTCCACA CCCTCGAGAT CGCCCGCAAG AAACTCCACG AAGACCTTCG TGATGGCTTC ACCAGCCGGG TCTTCGTCTT CTTCTGGTAC CTCTACCACC TGATCACCTT CTGGAAGCTG CCCTTCCGGC TGGCCACCTG GGAATACCAC CGGAGCCGCG CGCCCATTCC CCCCGAGATC CAGGCCTGGT CCGAACCCAT CCCGGAGCAC GACTGGGCCA CGCCCAGCCC CGAACTGGAG GCCGCCGCCC GGCGCATGGT GCAGGCCGGC GAGCAAGCCC CCGACATCAA ACTCCCCGAG CTGCTCGCCG CCGGCATCGC CGACTGGCAC CCGGACCACG ACGGCGGCAA CGGCAACGGC AACGGCAACA ACGAGCGGAC CCCACGGAAA CCATGA
|
Protein sequence | MSKPAPTNEA PDLSGHPRAG DVRRFATGEA LVLAPLPLPL PVRPMDPKQF LVHIDQTYLD LDSSRHTAQA HQVMIGVPFF IGVIFIGLGF PLLIGTTAFG TDHTFWASAL HAAIVSIPYG LFGGTLLFLI ALHGFFHRMK QARRHPPVRF HRQRREVACF DPDTGQTLVA PFERVTAWMA TSSGATPYGA MTHYNFGLTV EDAETGQSYT ALFPASLPEE ALGLWEAIRR YMDHGPGTLE RPTKTFSGLP IDPREHLPYD GVHTLEIARK KLHEDLRDGF TSRVFVFFWY LYHLITFWKL PFRLATWEYH RSRAPIPPEI QAWSEPIPEH DWATPSPELE AAARRMVQAG EQAPDIKLPE LLAAGIADWH PDHDGGNGNG NGNNERTPRK P
|
| |