Gene Saro_3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3046 
Symbol 
ID3916658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3261197 
End bp3262195 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content67% 
IMG OID640445826 
Productribosomal large subunit pseudouridine synthase D 
Protein accessionYP_498315 
Protein GI87201058 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTACCCA TATGGCCGGG AATGGGGGGC ACCGACAGCA TCATTTCCGG ACAGGTCCCC 
GCAGGTGGCC AGAGACTCGA CAAGGCCCTG GCCGACGCCA GCGGCCTCTC GCGCGAGCGG
GTCAAGGCTC TGCTTGGTGA AGGGCGTGTC CGGCTGGGTG GGGCGGTAGT CTCGCAGGCC
TCGCTCAAGC CCGCCGAGGG CACACCGTTC GAAATCCACG TACCCGAGGC GGCCCCCGCC
GAAGCCATCG CGCAGGCAAT CCCGCTGGTG GTCGTCCACG AGGACGACGC CCTGATCGTG
GTCGACAAGC CTGCGGGACT TGTGGTCCAT CCGGCTGCGG GCAACCCCGA CGGGACGCTG
GTCAACGCCC TGCTGCATCA CTGCCGTGGG CAGCTTTCCG GCATTGGCGG GGTGGCCCGG
CCGGGGATCG TCCATCGCAT CGACAAGGAT ACTTCGGGCT TGCTGGTCGT GGCGAAGACC
GACGCCGCCC ACGAGGGACT GGCACGGCAG TTCGCCGATC ATTCGATCAC GCGTGCGTAC
AAGTGCGTGA CCGCAGGCGT GCCGATGCCG CCTTCTGGCA CGGTGCGCGG GGCGATCGCG
CGATCGAGCC ATGATCGCAA GAAGATGGCG CTGGTCGATG ACGGGCGCGG GAAGCATGCG
GTCACCCATT TCCGAACGCT CGCAGCGCTT CAGGGCGCCG CGCTTGTCGA GTGCCGGCTG
GAGACGGGGC GAACCCACCA GGTGCGCGTT CACCTTGCGT CAATCGGCCA TCCGCTATTG
GGTGATCCGG TCTATGGACG CACACCTTCA CGCCTCAGGC CGCTGCTCCA GCGGCTCGGG
TTTCACCGTC AGGCGCTTCA CGCGGCGGAG CTGGGATTCA TCCACCCCGT CACCGGCGCA
CCGCTCCACT TCGCCAGTCC GACGCCCGTC GACATGCGGG AACTCATCGT CGAACTGTGC
GCTGAAGGTC AGGATGCAAA GCTCATGGCG ATGGTGTAG
 
Protein sequence
MLPIWPGMGG TDSIISGQVP AGGQRLDKAL ADASGLSRER VKALLGEGRV RLGGAVVSQA 
SLKPAEGTPF EIHVPEAAPA EAIAQAIPLV VVHEDDALIV VDKPAGLVVH PAAGNPDGTL
VNALLHHCRG QLSGIGGVAR PGIVHRIDKD TSGLLVVAKT DAAHEGLARQ FADHSITRAY
KCVTAGVPMP PSGTVRGAIA RSSHDRKKMA LVDDGRGKHA VTHFRTLAAL QGAALVECRL
ETGRTHQVRV HLASIGHPLL GDPVYGRTPS RLRPLLQRLG FHRQALHAAE LGFIHPVTGA
PLHFASPTPV DMRELIVELC AEGQDAKLMA MV