Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0500 |
Symbol | |
ID | 4268436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 548323 |
End bp | 549564 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638125241 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_741344 |
Protein GI | 114319661 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.565018 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATC TCAGTTGGCC AGATGTCAGT CTCGGGAATA TATTCACCAT CAACACTAGT GCGGTGATCC CGAATGCGGC CCCCAATACC GAGTTCTATC ACCATAGCCT GCCCGCTTGG GACGCCACAG GGGGACCTAC GGTGGAAAAA GGCTCTTCGA TCGAAAGCAA TAAAGTCAAT ATCACCAAGC CTTGCGTGCT GGTTTCAAAG TTGAACCCAA GGAAACCTCG GGTATCTGTG CTTGAATCGG TGGGAAAAGA CGAGCGTCAT TGCGCCTCAA CAGAATTCGT TTGTCTTGAA CCAAAAGCCA AAGAGCACCT CAGGTTCTGG GGCCATCTTT TTTCGAACAA GAGGTTTGCA GGCCACCTTG ATCGCATGGC TATTGGTTCG ACCAATAGCC ACAAACGGTT TAGCCCCGGG GTGCTCTTGT CCTTAAGGAT TGAGCTGCCC TCAGAGCCGG AAAGGAGGCT GATTGCCCGA ATCCTCGACA CCCTCGACAC CCAGATCCAG AAAACCGAGG CGCTTATTGC CAAGCTGGAG AAGGTCAAGG AAGGCCTGCT CCACGACCTG CTGACCCGCG GCATCGACGA CAACGGCCAG CTACGCCCCA GCCCCGAGCA GGCACCGGAG CTCTACAAGG AATCGCCGCT GGGGTTGATT CCGAGGGAGT GGAACGCAGT CAGGCTTTAT GAAATGGCGG AAAATCATGA TGGACAGCGA ATTCCGCTCA AAAAGTCGGA GCGCAAACAT GGCACATATC CATACTATGG AGCCTCCGGG ATAATTGATT GGGTTGAGGG ATATCTATTT GAAGGAAGCT ATGTTCTTCT TGGGGAGGAT GGTGAGAACG TTGTATCTAG GAACTTGCCG TTGGCATTCC CTGTTACTGG AAGGTTCTGG GTGAATAATC ATGCGCACAT ATACTCTCCA AAAGATGACT GCGACACCCG GTTCTTGGTT GAGGTGCTTG AGCAAAAGGA TTATTCGCGC TGGGTAAATG GTTCGGCCCA GCCGAAAATA ACGCAAGCAT CATTGAGAAT GATGTGGTTC TGCAAGCCAC CAACGGCTGA GCAGAAGGCT ATCTCAAATA GCCTTGAGGC AATCAATCAG CAGATTGACG AAGAGAAGAT CAAGATTGCC AAAGTGAGAA CGCAAAAAGC AGGGGTCATG GACGACCTGC TAACCGGCCG CGTCCGCGTC ACCCCCCTTC TCGACAAGGC CCAGGCCACG ACGCCAGCAT AA
|
Protein sequence | MSDLSWPDVS LGNIFTINTS AVIPNAAPNT EFYHHSLPAW DATGGPTVEK GSSIESNKVN ITKPCVLVSK LNPRKPRVSV LESVGKDERH CASTEFVCLE PKAKEHLRFW GHLFSNKRFA GHLDRMAIGS TNSHKRFSPG VLLSLRIELP SEPERRLIAR ILDTLDTQIQ KTEALIAKLE KVKEGLLHDL LTRGIDDNGQ LRPSPEQAPE LYKESPLGLI PREWNAVRLY EMAENHDGQR IPLKKSERKH GTYPYYGASG IIDWVEGYLF EGSYVLLGED GENVVSRNLP LAFPVTGRFW VNNHAHIYSP KDDCDTRFLV EVLEQKDYSR WVNGSAQPKI TQASLRMMWF CKPPTAEQKA ISNSLEAINQ QIDEEKIKIA KVRTQKAGVM DDLLTGRVRV TPLLDKAQAT TPA
|
| |