Gene Mlg_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2552 
Symbol 
ID4270940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2894913 
End bp2895854 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content73% 
IMG OID638127311 
Productribosomal large subunit pseudouridine synthase D 
Protein accessionYP_743382 
Protein GI114321699 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0920144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0397584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACTC GCATAGAACA CGACATCCTC ATCGACGAGC AGCAGACCGG TCAGCGGCTG 
GACCAGGCCT TGGCCGCCCT GCTGCCGGAC TACTCCCGCA GCCGTATCCA GCAGTGGATC
CGCGAGGGGG CGGTCCGGCT GGAGGGCACC GCCCCCCGGC CGCGGGACAA AGTCGCTGCC
GGGCAACAGG TCACGATACG GGCCGAACTG GAGGAAGAGC AACGGGTCAG TGCCGAGCCG
ATCCCCCTGC GCATCCAGTA CGAGGACCGC CACCTGTTGG TCATAGACAA GCCCGCGGGC
CTGGTGGTTC ACCCCGGGGC CGGCAACCGC GAGGGCACCC TGCAGAACGC CCTGCTCCAC
CACGACCCGC AACTGGCCGA GCTGCCGCGG TCCGGCATCG TACACCGGCT CGACAAGGAC
ACCTCCGGGC TGATGGTGGT GGCGCGCAGC CTGGCCGCCC ACACCGCCCT GGTGGCCCAG
CTGCAGGCCC GCAGCGTCCG GCGCGAATAC CTGGCGCTGG TGAACGGCTG TCCGGTGGCC
GGCGGTACCG TGGAGGCCCC CATCGGTCGC CACCCGCGGG ACCGCAAACG CATGGCGGTG
GTCGAGCGCG GGCGCCCGGC CACCACCCAC TACCGGGTGG AGGAGCGCCT GGCCGCCCAC
ACCCTCCTGC GCTGCTTTCT CGAGACCGGA CGCACGCACC AGATCCGGGT GCACATGGCC
CATGCCGGCT ACCCGCTGGT GGGCGATCCC GTCTACGGCG GGCGGCTGCG GCTGCCGCCG
CGGGCCACCG AGGCGCAGCG CCAGGCCCTG CGCGCCTTCC AGCGCCAGGC CCTGCACGCC
GCCCGACTGG CCCTGGACCA CCCAGAGAGC GGCGAGCGCC TGAGCTGGGA GGCCCCCCTG
CCCGAAGACA TGGCCGCGCT GCTGGCCTGT CTGCGGTCCT GA
 
Protein sequence
MGTRIEHDIL IDEQQTGQRL DQALAALLPD YSRSRIQQWI REGAVRLEGT APRPRDKVAA 
GQQVTIRAEL EEEQRVSAEP IPLRIQYEDR HLLVIDKPAG LVVHPGAGNR EGTLQNALLH
HDPQLAELPR SGIVHRLDKD TSGLMVVARS LAAHTALVAQ LQARSVRREY LALVNGCPVA
GGTVEAPIGR HPRDRKRMAV VERGRPATTH YRVEERLAAH TLLRCFLETG RTHQIRVHMA
HAGYPLVGDP VYGGRLRLPP RATEAQRQAL RAFQRQALHA ARLALDHPES GERLSWEAPL
PEDMAALLAC LRS