Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2552 |
Symbol | |
ID | 4270940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2894913 |
End bp | 2895854 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638127311 |
Product | ribosomal large subunit pseudouridine synthase D |
Protein accession | YP_743382 |
Protein GI | 114321699 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0564] Pseudouridylate synthases, 23S RNA-specific |
TIGRFAM ID | [TIGR00005] pseudouridine synthase, RluA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0920144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0397584 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCACTC GCATAGAACA CGACATCCTC ATCGACGAGC AGCAGACCGG TCAGCGGCTG GACCAGGCCT TGGCCGCCCT GCTGCCGGAC TACTCCCGCA GCCGTATCCA GCAGTGGATC CGCGAGGGGG CGGTCCGGCT GGAGGGCACC GCCCCCCGGC CGCGGGACAA AGTCGCTGCC GGGCAACAGG TCACGATACG GGCCGAACTG GAGGAAGAGC AACGGGTCAG TGCCGAGCCG ATCCCCCTGC GCATCCAGTA CGAGGACCGC CACCTGTTGG TCATAGACAA GCCCGCGGGC CTGGTGGTTC ACCCCGGGGC CGGCAACCGC GAGGGCACCC TGCAGAACGC CCTGCTCCAC CACGACCCGC AACTGGCCGA GCTGCCGCGG TCCGGCATCG TACACCGGCT CGACAAGGAC ACCTCCGGGC TGATGGTGGT GGCGCGCAGC CTGGCCGCCC ACACCGCCCT GGTGGCCCAG CTGCAGGCCC GCAGCGTCCG GCGCGAATAC CTGGCGCTGG TGAACGGCTG TCCGGTGGCC GGCGGTACCG TGGAGGCCCC CATCGGTCGC CACCCGCGGG ACCGCAAACG CATGGCGGTG GTCGAGCGCG GGCGCCCGGC CACCACCCAC TACCGGGTGG AGGAGCGCCT GGCCGCCCAC ACCCTCCTGC GCTGCTTTCT CGAGACCGGA CGCACGCACC AGATCCGGGT GCACATGGCC CATGCCGGCT ACCCGCTGGT GGGCGATCCC GTCTACGGCG GGCGGCTGCG GCTGCCGCCG CGGGCCACCG AGGCGCAGCG CCAGGCCCTG CGCGCCTTCC AGCGCCAGGC CCTGCACGCC GCCCGACTGG CCCTGGACCA CCCAGAGAGC GGCGAGCGCC TGAGCTGGGA GGCCCCCCTG CCCGAAGACA TGGCCGCGCT GCTGGCCTGT CTGCGGTCCT GA
|
Protein sequence | MGTRIEHDIL IDEQQTGQRL DQALAALLPD YSRSRIQQWI REGAVRLEGT APRPRDKVAA GQQVTIRAEL EEEQRVSAEP IPLRIQYEDR HLLVIDKPAG LVVHPGAGNR EGTLQNALLH HDPQLAELPR SGIVHRLDKD TSGLMVVARS LAAHTALVAQ LQARSVRREY LALVNGCPVA GGTVEAPIGR HPRDRKRMAV VERGRPATTH YRVEERLAAH TLLRCFLETG RTHQIRVHMA HAGYPLVGDP VYGGRLRLPP RATEAQRQAL RAFQRQALHA ARLALDHPES GERLSWEAPL PEDMAALLAC LRS
|
| |