Gene RoseRS_3729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3729 
Symbol 
ID5210710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4665908 
End bp4667089 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content65% 
IMG OID640597324 
Productradical SAM domain-containing protein 
Protein accessionYP_001278033 
Protein GI148657828 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0105628 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTTG ACGACAAACT TGCCATCCTG TCGCCAGCGG CGCGTTTTGA TGCGTGTGAT 
CGCTTCCTGG GGAAGCGGCG CGCCACGCCG CGCACGGTGC TGTGGGACGA TCGTGCAGTT
GCTGCCGATA CCGACAACGC GGGGCGTGCG CTGCCAGTCT TCCGCCTGTT GCTGAGCAAT
CGCTGCGAAT GGAATTGCGC CTACTGTCCG CTGCGCTCCG GAAACGACAT ACCACGCGCG
GCGCTGACGC CCGATGAACT GGCGCGCGTC TTTCTTCCCC GCGTCGAACG TGGTGCAGTG
CAGGGGTTGC TCCTCTCCAC CGGCGTCGAT GGCAATCCGG CGGTTGCGAC CGGTCGGATG
CTCGATGCTG TCGAGATGCT GCGCACGCGC TACGGCTACA GCGGGTATGT GCATCTGAAA
CTGCTGCCCG GCGCGCCAGC CGCTGAAATC GAGCGTGCTG CGCGCCTTGC CAACCGCATC
AGTCTCAACC TGGAAGCGCC GACTGCGGCG CATCTGGCGC GTATTTCGCC CGAACGCGAC
TGGCTGCGCG ACCTGATCGC ACCGCTGGCG CTGGCGCGTG ACTGGAGTCG AACGGGCGCG
ATCGGTGCAG GGCTTGCGAC GCAGTTCGTT GTGGGTGCAG CCGGAGAGAG CGACCGCGAT
CTGCTGGTGA CGACCACCTG GCTCTACCGC GACCTGGCGC TGCGTCGGGT CTATTTTGGC
GCATTCCGAC CGGTGACCGG CACACCGCTG GAGAGTCGTC CGCCGACGCC GTTTGTGCGC
GAGCAGCGTC TCCGTGAAGC AGACTGGTTG CTGCGACGCT ATGGCTTCGA GCAGCAGGAA
CTCCCGTATG ATCCTGATGG CAACCTGCCG TTACACCTCG ACCCGAAACT GGCGTGGGCG
CTGGCGCATC CGGAACGCTT CCCGGTCGAA CTGAACCGTG CCGACCGGGA CGAACTTCTG
CGCGTACCGG GGTTGGGTCC GGTAAGCGTG GCGCGCATTC TCCGCCTGCG CCGCGAAGGG
CGTTTCCGCT ATCCGGATCA TCTTGCAGCG CTGGGTGGGG CGCTCGCGCG CGCGCGTGAC
TTTATTACGC TCGATGGGCG CTTTTTCGGA CGCAATGAAC GTGATCGCCT GCGACACTAT
GCGCGCCGGT CGCCGATCGC CGAACAGTTA ACGCTGTGGT AG
 
Protein sequence
MDLDDKLAIL SPAARFDACD RFLGKRRATP RTVLWDDRAV AADTDNAGRA LPVFRLLLSN 
RCEWNCAYCP LRSGNDIPRA ALTPDELARV FLPRVERGAV QGLLLSTGVD GNPAVATGRM
LDAVEMLRTR YGYSGYVHLK LLPGAPAAEI ERAARLANRI SLNLEAPTAA HLARISPERD
WLRDLIAPLA LARDWSRTGA IGAGLATQFV VGAAGESDRD LLVTTTWLYR DLALRRVYFG
AFRPVTGTPL ESRPPTPFVR EQRLREADWL LRRYGFEQQE LPYDPDGNLP LHLDPKLAWA
LAHPERFPVE LNRADRDELL RVPGLGPVSV ARILRLRREG RFRYPDHLAA LGGALARARD
FITLDGRFFG RNERDRLRHY ARRSPIAEQL TLW