Gene Emin_0375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0375 
Symbol 
ID6262592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp399925 
End bp400824 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content43% 
IMG OID642610841 
ProductRluA family pseudouridine synthase 
Protein accessionYP_001875269 
Protein GI187250787 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000429003 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000105454 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAAATA AAGAAACTTT AACTTTTAGC GGAACAAGCG CCAGACTTGA CCTTTTTTTA 
AGCGAAAACA AGCCTGATTA TTCGCGCGGA TTAATACAAA ACCTTATTAA GCAGGGGAAA
GTTACGGTTA ACGGCAAAGA ACGTAAACCC GCCTGGCCGC TTGCAGAAGG CGACAACGTT
GAAATAGAGT GGCCTTCGGT TGAAAATAAA ACTGGTTTAA AAGATTTAAT AATTTTTGAA
GATAAAAATA TGTTTGTAAT AAATAAACCC AGCGGCATGC TTGTACACCC GCAAAGCCCC
GTTTGGGAAG AAAACCCCGC CGCCGCTTTT ATCGGGGAAG AAACTTTAGT TTCCCTTATT
TTGGCAAACC CGCCTAAAAA TTTTGAAAAA GGCATTACCC GCGCCGGGCT TGTGCACAGG
CTTGATAAAG ACACAAGCGG CGTTATGATA ATTGCTAAAA ACTCAAAAAC CCAGGACGCT
ATGGTTGAAA TGTTTGCCAA CAGGGAAATG CATAAAACCT ATGAGGCTAT TGTCTGTGGC
GTTGTGCCTG ACGATAAAGG CATAATAAAC GTGCCTATAG GGCGTGTTAC GGGCGGTAAA
ATAAAAGCAA GCGAACTTGG GCGCGAAGCT GTTACCGAAT ACAGCGTTTT ACAAAGAAAA
GAAACCGTTT CTTTAATGAA ACTTCACCCC GTAACGGGTA GAACAAACCA GTTACGCGTG
CATATGAGCT GGCTCGGCTA CCCTGTTTTG GGCGACTGGC TTTATAAAGG CGCCACGGCG
CCACGGCTTA TGTTGCACTC TAAAAGCGCG GAATTTGAAC ATCCTTTTAC TTCCAAACCC
GTAAAATTTA CGGTAGCGCC GCCTAAAGAT TTTAAAGACT CTTGGAAAAA CGCCAAATAA
 
Protein sequence
MSNKETLTFS GTSARLDLFL SENKPDYSRG LIQNLIKQGK VTVNGKERKP AWPLAEGDNV 
EIEWPSVENK TGLKDLIIFE DKNMFVINKP SGMLVHPQSP VWEENPAAAF IGEETLVSLI
LANPPKNFEK GITRAGLVHR LDKDTSGVMI IAKNSKTQDA MVEMFANREM HKTYEAIVCG
VVPDDKGIIN VPIGRVTGGK IKASELGREA VTEYSVLQRK ETVSLMKLHP VTGRTNQLRV
HMSWLGYPVL GDWLYKGATA PRLMLHSKSA EFEHPFTSKP VKFTVAPPKD FKDSWKNAK