Gene RoseRS_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3233 
Symbol 
ID5210208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4071891 
End bp4073507 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content60% 
IMG OID640596829 
Productsingle-stranded nucleic acid binding R3H domain-containing protein 
Protein accessionYP_001277544 
Protein GI148657339 
COG category[S] Function unknown 
COG ID[COG3854] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.384079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTGC AACAGGATAT TACCAGCAAT ATCGAATTAC TTCTGGCAAC GCTTCCGGCG 
CCTATTCGTC AGGCGATTGA AACGGGTCCC GATGACCGGG AGGACCTGCT GGAAGTGATC
ATGGATCTTG GTCGACTCCC CGAAGCGCGC TATCGGAATC ACGAACGTTT TCTCAGTGAT
CGCGAAGTCA CGCAGGAAGA TATCGACTAT GTCATTGGTC GAATCGGCGC CTTCGGCGAA
GATAACCGTG CGGGCATCCC GCGCACGCTG CACCGGATCT CGGCGATCCG CAATCGTACC
GGTCGCGTGA TCGGGTTGAC CTGTCGTGTC GGGCGCGCCG TCTTCGGTAC GATCACGATC
CTGCGCGATC TGATCGAGTC GGGGAAGAGC GTGCTGCTGC TGGGACGTCC CGGCGTCGGC
AAGACGACGA TGCTGCGCGA AACGGCGCGG GTGCTGGCGG AAGATCTGCG GAAGCGCGTT
GTGATTGTAG ATACGTCGAA CGAAATTGCC GGTGACGGCG ATATTCCCCA CCCCGGCATC
GGGCGGGCGC GCCGCATGCA GGTGCCGCGA CCGTCGGAAC AGCACGCGGT GATGATCGAA
GCCGTCGAAA ATCATATGCC GGAAGTGATC GTGATCGATG AGATCGGGAC GGAACTCGAG
GCGCTGGCTG CGCGTACCAT CGCCGAACGC GGCGTTCAGT TGATCGGCAC GGCGCACGGG
CAGACGCTCG AGAACCTGTT GTCGAACCCG ACATTGTCCG ATCTGATCGG CGGCATTCAG
GCGGTGACGC TTGGCGATGA AGAGGCGCGC CGGCGGGGAA CACAGAAGAC GGTGCTTGAA
CGAAAGGCGC CGCCAACTTT CGACATTCTG GTCGAAATTC AGAACTGGGA TCAGGTGACA
GTCTATCCCG ACGTTGCCAG CGCGGTGGAT AGTCTGCTGC GCTCCGAGCC GCCGCAGGCG
GAGGTGCGCC GACGCACTGC TGACGGTGAG ATCGAAGTCT TTCCGGCTTT TACCGGTGAG
CATGCTGTTC CGTCGATCCC GGGCATTCGT CGTGGCGGCG GGCGTGAACG TGGGGAGCGA
AGCCCGCGTG TATCGTCGGC GCCTGCCGCT TCCAGCGTCG GAATGACACC GCAACGTATC
TACCCGTTCG GTGTCAGCCG TGATCGTCTG GAGCGGGCGA TTGCCAGCCT GCACGTTCCG
GCGACCATCG TGCGCGACAT GGGTGAGGCG ACGATGGTGA TGACCCTGAA GAATTATTAT
CGTCAGGGAT CGCAGCGTGT GCGTCAGGCG GAAGAGCGTG GCGTGCCGGT TTATGTGTTG
CGGAACAATA CCCTGGCGCA GATGGAGCGT CAACTCGCCG ATGTGTTCAA TATCAGTCTG
AGCGACAATG GATTGCAACG CCGGGCGGAT CGTGATGATG ATGCTACCAT GACCGAGGCG
CTGCTGGAGG TCGAAACAGC GATTACGCAG GTTCTGAACG GCGAACGTTC GTCGGTTGAA
CTGCAACCGC AGAGCAGTTA TGTGCGCCGG TTGCAGCATC AGATGGCGGA GCGCTACAAC
TTGCAATCGG AGAGTCGCGG ACGCGAACCG AACCGACGGG TCAAAATCTT TCGTTAG
 
Protein sequence
MAVQQDITSN IELLLATLPA PIRQAIETGP DDREDLLEVI MDLGRLPEAR YRNHERFLSD 
REVTQEDIDY VIGRIGAFGE DNRAGIPRTL HRISAIRNRT GRVIGLTCRV GRAVFGTITI
LRDLIESGKS VLLLGRPGVG KTTMLRETAR VLAEDLRKRV VIVDTSNEIA GDGDIPHPGI
GRARRMQVPR PSEQHAVMIE AVENHMPEVI VIDEIGTELE ALAARTIAER GVQLIGTAHG
QTLENLLSNP TLSDLIGGIQ AVTLGDEEAR RRGTQKTVLE RKAPPTFDIL VEIQNWDQVT
VYPDVASAVD SLLRSEPPQA EVRRRTADGE IEVFPAFTGE HAVPSIPGIR RGGGRERGER
SPRVSSAPAA SSVGMTPQRI YPFGVSRDRL ERAIASLHVP ATIVRDMGEA TMVMTLKNYY
RQGSQRVRQA EERGVPVYVL RNNTLAQMER QLADVFNISL SDNGLQRRAD RDDDATMTEA
LLEVETAITQ VLNGERSSVE LQPQSSYVRR LQHQMAERYN LQSESRGREP NRRVKIFR