Gene Mlg_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1835 
SymboltruD 
ID4268190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2094075 
End bp2095106 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content74% 
IMG OID638126591 
ProducttRNA pseudouridine synthase D 
Protein accessionYP_742669 
Protein GI114320986 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.151087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCACTG AGCAAGCCGC GTCCCTGCCC CGCGCCTGGG GCCCCCCGCT GGGCACGGCG 
CGCCTCAAGG CCACCCCGGA GGACTTCCTG GTCGAGGAGC AGACCGGGCT CTGCCCTTGC
GGCGACGGGG AGCACCTGTG GCTGTGGGTG GAAAAGCGGG GTCTGAACAC CGCGCAGGTG
GCCCGGGCGC TGGCCGAGGC CGCCGGCATC CACCCGCGGG CGGTCTCCTT TGCCGGCCTG
AAGGACAAAC ACGCCCTGAC CCGCCAGTGG TTCAGCCTGC AGTCGCCCGG TCGGTCGCTG
CCCCTGGGCG TGGGGGAGGG GCCGATCCCG GGCGTGCGCA TCCTGATCGC CCGGCGCCAC
CATCGAAAGC TGCGCACCGG GGCCCTCAAG GGCAACCGGT TCGTGCTGAC CCTGCGGGAC
TGCGACGCCG ATCCGGCGGC GGTGGCACAG CGCCTCTACC GCATCAGCAC CCAGGGCGTG
CCCAACTACT TCGGCCACCA GCGCTTCGGG CGTGGCGGCG GCAATCTGGC CCAGGCCTCG
GCCTGGTTCG CCGGCGGGCG TCCGCCCCGC GACCGCAAGC TGCGCGGCCT GCTGCTCTCC
AGCGTGCGGT CCGAGCTGTT CAATCGGGTG CTGGCGCGGC GGGTCGGGGA GGGCAGTTGG
AACCGACTGT TGCCGGGCGA GGTGGCCATG CTCGATGGGC GCGGAGCGGT GTTCGAGACC
GATCCGGCCG ACCCCGCTCT GCCCGGGCGG TGTGCCCGTC TGGAGATCCA CCCCACGGGG
CCACTGGCGG GCGAGCGCGG GGTGCAGCCC GGCGGCGAGG TCGCGGCCCT GGAGCGGTCG
GTATTGGCGG CCGAACCCCT CTGGCACCAG GGCCTGGCGC GGGCAAGGAT GGAGGCGGCG
CGCCGTGCGC TGCGCCTGCG GGTGGTCGAT CTCGCCTGGC ATTGGCCGGC GCCGGGCCGG
CTCCAACTTA GTTTCCGGCT GCCTGCCGGG GCCTATGCCA CCGTGGTGGT ACGGGAGGTG
TTGGAGTGTT GA
 
Protein sequence
MSTEQAASLP RAWGPPLGTA RLKATPEDFL VEEQTGLCPC GDGEHLWLWV EKRGLNTAQV 
ARALAEAAGI HPRAVSFAGL KDKHALTRQW FSLQSPGRSL PLGVGEGPIP GVRILIARRH
HRKLRTGALK GNRFVLTLRD CDADPAAVAQ RLYRISTQGV PNYFGHQRFG RGGGNLAQAS
AWFAGGRPPR DRKLRGLLLS SVRSELFNRV LARRVGEGSW NRLLPGEVAM LDGRGAVFET
DPADPALPGR CARLEIHPTG PLAGERGVQP GGEVAALERS VLAAEPLWHQ GLARARMEAA
RRALRLRVVD LAWHWPAPGR LQLSFRLPAG AYATVVVREV LEC