Gene Saro_2837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2837 
Symbol 
ID3915476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3063296 
End bp3064606 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID640445616 
Productribosomal large subunit pseudouridine synthase C 
Protein accessionYP_498107 
Protein GI87200850 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0449259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGGG CCAAGATGCC GGACGATCAG GTCCGCCAGT TCACCATCGG CAAGGACGAT 
GCGGGGGCCC GCCTCGACCG CTGGTTCAAG CGCCACCTGC CGCAGGTCGG CTTTGCCACG
GTATCGCGCT GGGCGCGGAC CGGGCAGATC CGGATCGACG GCAAGCGGGC CGATGTCGAC
ACCCGGCTCG AGGCCGGCCA GATCCTGCGC GTACCGCCAG GCAACGCGAC CCCCGTCGGC
ACCCCCGGCA AGGGCGAGCG CGCGCGCAAG CCGCTGACCG AGGAACAGAT CCAGCTCGCC
GAATCGATGG TGCTGGAAAA GGACCGCGCC GCCATCGTGC TCAACAAGCC GCCGGGCCTT
GCGACGCAAG GCGGCAGTGG CACGCACGAG CACGTCGACG GCCTGCTCGA CGCCTATGTC
AACGAGGGCG AAGCGCGCCC ACGGCTGGTG CACCGGCTCG ACAAGGACAC CTCTGGCGTC
CTGCTCATCG CGCGCACGCC GGGCAGCGCG GCATTCTTTT CCAAGCGCTT CTCCTGCCGG
TCGGCGCGCA AGATCTACTG GGCGCTGGTC GTCGGCGTGC CCGACGTCAA GGACGGCCTG
ATCGAACTGC CGCTGGCCAA GCAGCCGGGC ACCGGCGGCG AGAAGATGAT GGTCGACGAA
AGCGGCCAGG GCCAGTCTGC CCGCAGCCGC TACCGGGTAA TCAGCCGCGC GGGCAATGCC
GCCGCGTGGG TCGAGCTACA GCCGCTGACC GGGCGCACCC ACCAGCTTCG CGTGCACATG
GCTGCCATCG GGCACCCGAT CGTGGGCGAC GGCAAGTATG GCGGGCAGGC GGCGTTCCTG
ACCGGATCGA TCAGCCGCAA GATGCACCTC CACGCCCGGC GCCTCAGGAT CGAGCATCCC
GAAGGCGACA TGATCGACGT GACCGCGCCG CTGCCCGCGC ACTTCGCGGC GAGCATGGCG
AGCCTCGGCT TCCACGAGGA AGAAGGCGAC CTGCCGCTCG AGCCGGTCAA GCTGGTCGAC
GAGAAGACCG AGCAGAAGCG CGCGGCCAAG GCGCATGCCA AGGAGTACCG CAAGGAACGC
CGGGGCGAGC GTCGCAAGCG CGTCGACGGC GGCACCTCGC GCAAGCCGAC GGGCAAGCGG
GCCTCTCCCA AGGCCGGGGC CAAGCCTGCT GCCGGAAAGC CTGCTGCTGC GAAGTCCGGC
GCGAAGCCGG GGGCGAAGAA GCCGGCAGCC AGGACGACGG CCGGGGGCAA GCCCGCGCGC
GCCGCCGCGC CGCGCAAGCC CGCTGGTCCG CCCAAGGCCA AGGGGGGCTG A
 
Protein sequence
MSRAKMPDDQ VRQFTIGKDD AGARLDRWFK RHLPQVGFAT VSRWARTGQI RIDGKRADVD 
TRLEAGQILR VPPGNATPVG TPGKGERARK PLTEEQIQLA ESMVLEKDRA AIVLNKPPGL
ATQGGSGTHE HVDGLLDAYV NEGEARPRLV HRLDKDTSGV LLIARTPGSA AFFSKRFSCR
SARKIYWALV VGVPDVKDGL IELPLAKQPG TGGEKMMVDE SGQGQSARSR YRVISRAGNA
AAWVELQPLT GRTHQLRVHM AAIGHPIVGD GKYGGQAAFL TGSISRKMHL HARRLRIEHP
EGDMIDVTAP LPAHFAASMA SLGFHEEEGD LPLEPVKLVD EKTEQKRAAK AHAKEYRKER
RGERRKRVDG GTSRKPTGKR ASPKAGAKPA AGKPAAAKSG AKPGAKKPAA RTTAGGKPAR
AAAPRKPAGP PKAKGG