Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0646 |
Symbol | |
ID | 4270836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 697261 |
End bp | 698394 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638125395 |
Product | hypothetical protein |
Protein accession | YP_741490 |
Protein GI | 114319807 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.436257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.112784 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGC CCGCCCCCAC CAACGAAGCC CCCGACCTCT CCGGACATCC CCGGGCCGGG GACGTGCGCC GCTTCGCCAC CGGCGAGGCG CTGGTCCTCG CGCCGCTGCC ATTACCGCTC CCGGTGCGCC CCATGGACCC GAAGCAGTTC CTCGTCCACA TCGACCAGAC CTACCTGGAT CTGGACTCCA GCCGCCACAC CGCCCAAGCC CACCAGGTCA TGATCGGCAT GCCCTTCTTC ATCGGCGTGA TCTTTATCGG GCTCGGTGCA CCGATGCTTA TCGGCGCGGC CGGACTCGTG TATGGCGAAC CATTCTGGGC AAATGCCCTC TATGCCGCCA TAGTCTCCAT CCCCTACGGC CTTTTCGGCG GCACCCTGTT GTTTCTGATC CTCCTCTACG GCTTCGTCGA CCGCATGAAG CAGGCCCGGC GGCATCCCCC GGTGCGCTTC CATCGCCAGC GCCGGGAGGT CGCCTGCTTC GACCCCGAAA CCGGCCAAAC CCTCGTCGCC CCGTTCGAGC GCGTCACCGC CTGGATGGCC ACCAGCAGCG GCGCCACCCC CTACGGCGCC ATGACCCACT ACAACTTCGG CCTCACCGTC GAGGACGCGG AAACCGGACA GTCCTATACC GCCCTCTTCC CCGCCTCGCT CCCCGAGGAG GCCCTGGGCC TGTGGGAGGC CATCCGCCGC TACATGGATC ACGGGCCGGG CACGCTCGAA CGGCCCACGA AAACCTTCTC CGGCTTGCCC ATCGACCCCA GGGAGCACCT CCCCTACGAC GGCGTCCACA CCCTCGAGAT CGCCCGCAAG AAACTCCACG AAGACCTTCG TGATGGCTTC ACCAGCCGGG TCTTCGTCTT CTTCTGGTAC CTCTACCACC TGATCACCTT CTGGAAGCTG CCCTTCCGGC TGGCCACCTG GGAATACCAC CAGAGCCGCG CACCCATTCC CCCCGAGATC CAGGCCTGGT CCGAACCCAT CCCGGAGCAC GACTGGGCCA CGCCCAGCCC CGAACTGGAG GCCGCCGCCC GGCGCATGGT GCAGGCCGGC GAGCAAGACC CCGACATCAA ACTCCCCGAG CTGCTCGCCG CCGGCAGATC CTGCAAACTT CAAAAGGTGC GTATGCATCC ATGA
|
Protein sequence | MSKPAPTNEA PDLSGHPRAG DVRRFATGEA LVLAPLPLPL PVRPMDPKQF LVHIDQTYLD LDSSRHTAQA HQVMIGMPFF IGVIFIGLGA PMLIGAAGLV YGEPFWANAL YAAIVSIPYG LFGGTLLFLI LLYGFVDRMK QARRHPPVRF HRQRREVACF DPETGQTLVA PFERVTAWMA TSSGATPYGA MTHYNFGLTV EDAETGQSYT ALFPASLPEE ALGLWEAIRR YMDHGPGTLE RPTKTFSGLP IDPREHLPYD GVHTLEIARK KLHEDLRDGF TSRVFVFFWY LYHLITFWKL PFRLATWEYH QSRAPIPPEI QAWSEPIPEH DWATPSPELE AAARRMVQAG EQDPDIKLPE LLAAGRSCKL QKVRMHP
|
| |