Gene Mext_1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1005 
Symbol 
ID5833661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1083687 
End bp1084718 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content71% 
IMG OID641366787 
ProductRluA family pseudouridine synthase 
Protein accessionYP_001638481 
Protein GI163850438 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAA TCCTCACGGT GTCCTCCACG GTGCTTTCCA TGATCCCCGG CGCGGACGAG 
GCGCGCAGCC TTGTGCTCGA CGAGGGAACC GTTCCCGAGC GGCTCGACCG CGTCCTTGCC
CGCGTCTTCG ACGATCTCTC CCGTGCCCGG CTCCAGGGCT TGGTGCGCGA GGGCCTCGTG
CGCTGCGACG GGATCGTGGT GCGCGATCCC GCGCGCAAGG TCGGGGCTGG CTGCCGGATC
GATCTCAGCG TTCCGGCGCC GCTTCCCGCC GAGCCGCTCG GCGAGGCTTT GCCGCTCGCC
GTCGTCCATG AGGACGAGGA TCTCATCGTC ATCGACAAGC CGGCGGGGCT CGTCGTGCAC
CCGGCGGCGG GGCACGAGGA CGGCACCCTC GTCAATCGGC TGATCGCGCA TTGCGGGGCG
AGCCTGTCCG GAATCGGCGG CGTGCGCCGG CCCGGCATCG TCCACCGCCT CGACAAGGAC
ACGAGCGGCC TGCTCGTCGT CGCCAAGAAC GACCTCGCCC ATCAGGGCCT CTCGGCCCAG
TTCGCCGACC ATGGCCGCAG CGGCGCCCTG GAGCGGGCCT ATCTCGCCCT GGTCTGGAAT
GTGCCGGAGC CGCGGGCCGG CACGATCCGC GCGAACCTCG CGCGCTCGCG CCACAATCGC
GAAAAGATCG CCGTGGTCCG CGACGGCGAG GGGCGGGAGG CGATCACCCA CTACCGGGTC
GAGGACGTGC ACGGGGAAGG CGGCGTTACG GCCCTGCTGC GCTGCCATTT GGAAACCGGG
CGCACCCACC AGATCCGGGT GCATCTGAGC CATCGCGGCC ATCCGCTGCT GGGGGACGCG
GTCTATGGCG GCGCCTTCAA GACCAAGGCG GCCCGGCTCA GCGAACCCGC TCGCGCCGCC
CTGGACGCTC TCGGACGACA GGCCTTGCAC GCGGTCGAAC TCGGATTTCT CCACCCGCGC
TCCGGCGAGC GCCTGCGCTT CGAGAGCCCG CTGCCGGAGG ATTTTTCGCG GCTGCTCGCC
GCCCTCGGCT GA
 
Protein sequence
MTRILTVSST VLSMIPGADE ARSLVLDEGT VPERLDRVLA RVFDDLSRAR LQGLVREGLV 
RCDGIVVRDP ARKVGAGCRI DLSVPAPLPA EPLGEALPLA VVHEDEDLIV IDKPAGLVVH
PAAGHEDGTL VNRLIAHCGA SLSGIGGVRR PGIVHRLDKD TSGLLVVAKN DLAHQGLSAQ
FADHGRSGAL ERAYLALVWN VPEPRAGTIR ANLARSRHNR EKIAVVRDGE GREAITHYRV
EDVHGEGGVT ALLRCHLETG RTHQIRVHLS HRGHPLLGDA VYGGAFKTKA ARLSEPARAA
LDALGRQALH AVELGFLHPR SGERLRFESP LPEDFSRLLA ALG