Gene RoseRS_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3894 
Symbol 
ID5210877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4874821 
End bp4876044 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content61% 
IMG OID640597490 
Product(Uracil-5)-methyltransferase 
Protein accessionYP_001278197 
Protein GI148657992 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATAT CGAACAAACT TTTTCGGCGA CAGATGATTG AAGCGGTACG GTCGGGAGCG 
CCAGTTTTGC CGCGTTGCTT GCACGCCCCG CCGACCGGTC AGTGTGGCGG ATGCGCATTC
CAGGATCGTG AGTATCCGGT GCAGGTGGCA GCAAAGCGCA CGGCGCTTCG TCATCTCTGG
GAAGGCGATC TACCGGAAAC GCTCCTCACC ACGCTTGATG TCGTCGCTTC GCCCAATCCC
TTCGCTTATC GCACGCGCAT GGACTTTGTG GCGAGCAAGG AACGTTTTGG TCTGCGACGC
AGCGGCAGGT TCAACTATAT TATCGACCTG AAGGAATGCC ATCTCATACC GGCGCGCGCA
TTCGCTGCCG CGCGCGCAAT GTACGATCAC GCGATGCGTC TGGGTTTGCC CGACTACGAC
CTGCGCGCGC ATGCCGGTTT TCTGCGGTAC GTGGCAGTGC GGCGCAGCCC TGACGACGAA
GTGCTGCTGG CGCTGATCAC TGCCGCTCCC GACGAGGCGG GGACGTATGC CGGGAAGGTG
GAACAGGTGG CGCAGGCGGC GCTCGAGTAC GATGGCGTGG TCGGCGTTCA CTGGTTGATC
AACCCCACCC GTACTGATAT ATCGTTTGGG GAAACGGCGC GTTTCTGGGG GCGCGCAACG
TTGCCGATGC GTGTGGGGGC GCATACGCTC GACATCGGAC CGAACACCTT CTTTCAGAAT
AATGTCTGGT TGCTTATGCC CCTGCTCGAG GCGGTGCGTG ATGCGGTTGC CGGGGAAGGC
GAACGTGCGC GTGCGCTTGC CGATCTGTAT GGCGGGGTTG GCACAATAGC GCTGTTTGTC
GCCGATCTTG CCGATCAGAT TGTCTGTGTC GAATCGGTTG AGGAAAGTGT GCGTCTGGCG
CGAGAAAACA TTGCGCGCGC CGGCTTTGAG CATATCGCCA TTGTTAAGGC GGATGTCGCC
GATGCGCTCC GCGAACGAAC ACGGGGCGCG TTCGATATCG TTATCGCCGA TCCGCCGCGC
ACCGGTTTGG GTCCCGACGT ATGCCGCGAA TTGCTGCGGC TGCGCCCGCA ACGCATTGTG
TATATCTCGT GCAACCCGCT CACCCAGCGC GACGATGTGC GTATGCTGAC GGAGGCGTAT
CGCCTGACAT CGCTGCGAGG GTACGATATG TTTCCCCATA CGCCGCATCT CGAGTCGCTG
GCAGTACTGG ACGTTATCCG CTGA
 
Protein sequence
MSISNKLFRR QMIEAVRSGA PVLPRCLHAP PTGQCGGCAF QDREYPVQVA AKRTALRHLW 
EGDLPETLLT TLDVVASPNP FAYRTRMDFV ASKERFGLRR SGRFNYIIDL KECHLIPARA
FAAARAMYDH AMRLGLPDYD LRAHAGFLRY VAVRRSPDDE VLLALITAAP DEAGTYAGKV
EQVAQAALEY DGVVGVHWLI NPTRTDISFG ETARFWGRAT LPMRVGAHTL DIGPNTFFQN
NVWLLMPLLE AVRDAVAGEG ERARALADLY GGVGTIALFV ADLADQIVCV ESVEESVRLA
RENIARAGFE HIAIVKADVA DALRERTRGA FDIVIADPPR TGLGPDVCRE LLRLRPQRIV
YISCNPLTQR DDVRMLTEAY RLTSLRGYDM FPHTPHLESL AVLDVIR