Gene Arth_1534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1534 
Symbol 
ID4445934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1709141 
End bp1710361 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID639689348 
Productpseudouridine synthase, Rsu 
Protein accessionYP_831028 
Protein GI116670095 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1187] 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 
TIGRFAM ID[TIGR00093] pseudouridine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0171678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAGG CGGGACGCCA GGGTTCACCA CGTAACAGTT CGGGACGCAA CAGCGCCGGA 
CAGGCCAAAG ATGGCCAAAA CAGGGCCGGA CGCAACCCGG CCCGGGGCGG CAGCAGCCGT
GCTGCCGGGG CCGGCGGCGC CGGCTTCAAG GGCGGCGGCG ACCGTCCCTT CAAGCACCCC
AAGCCTCGCG AAGAGGCATT CGTCGATCCG GACCTGCAGG GCCCCGGCGG CGCGCCGGCG
GACCGCCCGG CAGCCGGCGA CTGGAAGCCG GGCAAACCTG CTGCCCGGAA GCCCGGTTCG
CGCAAGCCCG GCGCCGGCAA GGTGCCCGGC ACTCCCGGCG CGCTGAAGCC CAAGCCGCGG
TCCGCCGCAG CCAAGACGTT CGGTACGCGC GCCTTCGGCG GCGAACGGTT CGGCCAGAAC
CTCGGTGCCG TCCGCAAGCC TTCGCGCAAA CGCGGACCCC GCGGGGACGT GCCGCAGTCC
GAAATGCACG ACGCCGACGG CATCCGCCTG CAGAAAGTCA TGGCCTCAGC CGGTGTCGCT
TCACGGCGCG TCTGCGAGGA AATGATCTCA GAAGGCCGTG TGGAGGTGGA CGGCAAGGTC
GTCACCGAGC TCGGCGTCCG AGTCGACCCG AAAACGGCAG TGATCCACGT CGACGGCCTG
CGCATCCAGC TGGACGAAAA CATGGTCTAC ATGGTGTTCA ACAAGCCCAA GGGCGTTGTA
TCCACCATGG AGGATCCGGA CGGCCGTCCG TGCATCAGCG ACTTCGTTCG CAATACCCAC
GGCGAACGCC TCTTCCACGT GGGACGTCTC GACGTCGCCA CCGAAGGGCT GTTGTTGCTG
ACCAACGACG GCGAACTGGC CAACCGTTTG ACGCACCCGT CCTACGAGGT ACCCAAGACG
TACCTGGTCC AGGTCCGCGG CCCGTTCCCG CAGGGAATCG GCGCGCAGCT GAAGGCGGGC
GTCGAGCTTG AGGACGGAAT GGCCTCGGTT GACTCCTTCA AGCTGGTGGA CTCCACCCCG
GGCCACGTGC TGATCGAGGT AGTGCTGCAC TCCGGCAAAA ACCGGATCGT GCGCCGCCTC
TTTGACGCCG TCGGTTTCCC GGTACTCCGG CTCGTGCGCG TCAAGGTGGG ACCCATCGGC
CTGGGAGACC AGCGCCAGGG GAGCATCCGC AACCTCGGCA AGCAGGAAGT CGGCCACCTG
CTGGCATCCG TAGGGCTGTA G
 
Protein sequence
MTQAGRQGSP RNSSGRNSAG QAKDGQNRAG RNPARGGSSR AAGAGGAGFK GGGDRPFKHP 
KPREEAFVDP DLQGPGGAPA DRPAAGDWKP GKPAARKPGS RKPGAGKVPG TPGALKPKPR
SAAAKTFGTR AFGGERFGQN LGAVRKPSRK RGPRGDVPQS EMHDADGIRL QKVMASAGVA
SRRVCEEMIS EGRVEVDGKV VTELGVRVDP KTAVIHVDGL RIQLDENMVY MVFNKPKGVV
STMEDPDGRP CISDFVRNTH GERLFHVGRL DVATEGLLLL TNDGELANRL THPSYEVPKT
YLVQVRGPFP QGIGAQLKAG VELEDGMASV DSFKLVDSTP GHVLIEVVLH SGKNRIVRRL
FDAVGFPVLR LVRVKVGPIG LGDQRQGSIR NLGKQEVGHL LASVGL