Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0503 |
Symbol | |
ID | 4268439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 550095 |
End bp | 551183 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638125244 |
Product | appr-1-p processing domain-containing protein |
Protein accession | YP_741347 |
Protein GI | 114319664 |
COG category | [R] General function prediction only |
COG ID | [COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGAGT ACACCCGAGG CAACCTTCTG GACGCCGATG TGGAGGCGTT GGTCAATACC GTCAACACCG TGGGTGTGAT GGGCAAGGGG GTCGCCCTGA TGTTCAAAGA GGCCTTCCCC GAGAACTTTC GTGCCTACCA GGCGGCGTGC AAGAACCGGG AGGTGGTGCC GGGTCGTATG TTCGTGCACG AACGGAGTGC ACTGCTCGGG CCGCGTTGGA TCATCAATTT CCCCACTAAG CAGCATTGGC GCGGTAAGAC GCGGATGGAG TGGATCGACT CCGGCCTGCG GGACCTCGAA CGGGTGATCC GCGTGAATGG GATTCGGTCC ATCGCGCTTC CGCCGCTGGG GTGCGGCAAT GGTGGGCTGC CATGGGCACA GGTGCGGCCC CGGATCGAGT CGGCGCTGCG CGACCTGCAG GACGTGCGGG TGGTGGTCTT CGAGCCGACC CGTCAATACC AAAATGTGGC CAAGCGCTCC GGGGTGGAGA AGCTGACACC GGCCCGTGCC TTGATTGCCG AGTTGGTGCG TCGGTATTGG GTGCTGGGGA TGGAGTGCTC CTTGCTGGAG GTGCAGAAGC TGGCCTGGCT AATCGAGCGG CGCATCATCG ACCACGGCCT GGAGAACCCA CTGGATCTGC AATTCAAGGC TCTTCGCTAT GGGCCGTATT CGGATCGGTT GCGCCACCTG CTCAATGGGC TTGATGGCAG CTATCTGCGC AGTGACAAAC GCATCAACGA CGCGGGCCCT GAAGAAGTGG TCTGGTTCGA TGAGGCGCGG CGCGACAAGC TGGGCATTTA CCTGCGCAGC GCGGAGGTCC GCCCCTATCT TGGAGTGCTG CAGGAGGTCG ACGACCTTAT CGATGGCTTT CAGTCCCCCC TCGGTCTCGA GGTGCTGGCC ACCCTGGATT GGCTTATCTG GCAGGAGGGT GTCGCGCCCA CAATAGCGGA CGTTAAAGAA GGGCTGCGGC GCTGGCCAGA CGACATTGCC GGTCAGCGCA AGCTACGGCT GTTTTCGGAT CAGCTCATTG AGTTGGCGTT GGCGCGTTTG ACCAGCCGGA CTCCGGATCT TCAGGTCATC GCCACGTGA
|
Protein sequence | MIEYTRGNLL DADVEALVNT VNTVGVMGKG VALMFKEAFP ENFRAYQAAC KNREVVPGRM FVHERSALLG PRWIINFPTK QHWRGKTRME WIDSGLRDLE RVIRVNGIRS IALPPLGCGN GGLPWAQVRP RIESALRDLQ DVRVVVFEPT RQYQNVAKRS GVEKLTPARA LIAELVRRYW VLGMECSLLE VQKLAWLIER RIIDHGLENP LDLQFKALRY GPYSDRLRHL LNGLDGSYLR SDKRINDAGP EEVVWFDEAR RDKLGIYLRS AEVRPYLGVL QEVDDLIDGF QSPLGLEVLA TLDWLIWQEG VAPTIADVKE GLRRWPDDIA GQRKLRLFSD QLIELALARL TSRTPDLQVI AT
|
| |