Gene RoseRS_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1303 
Symbol 
ID5208255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1605836 
End bp1607482 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content66% 
IMG OID640594918 
Producthypothetical protein 
Protein accessionYP_001275657 
Protein GI148655452 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.480323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGTC ATAGCCAACA GCAGATAGGG CTTATGGAGG CGCTCGATGT CGTTCTTGAT 
CGCCTGTTGC GCGGCGCCGA CATCGATGAG TGCCTGAGTC TGTACCCGCA TCTGGCGGTT
GAGCTCGAAC CGCTGTTGCG TGTTGCGGGG ATGGTACGCG CAGAGGTGAC GCAACCGCTG
CCGCCTGAAA TGGAACGCTG GCTGGCGACC GGGGCGCAGG AGTTTGCCGC AATTGCCGAT
CAGATGCTTG CGCGTCGGCA TGCCCGGCGC AATCTGCTCA AACCGCTGCG CAAGGCTGCC
GTTCAACGTG TCCTGGTCGG CGCCCTGGCA GTGACGGTTC TGCTTGCATC GGTTGACACG
GCGTCGGCGC AAAGTCTGCC GGGCGACCCG CTCTATGTCT GGAAAGTGGC ACGGGAAGAT
CTGACGCTTT CGATGACGTC CGATCCAGTC CAGCGGAGTA AACTGCACGT CACCTATGCC
CGCCGCCGCC TTCTGGAAAT TAATGAGATG CTCGCCAGTG ATGCGGCAAT CGATCCACAG
GCGCTCAGGG AGCCGCTTGC CCTTCTCAGC AGTCACATCC GCGGCGCTGT CATCGAAAGC
CGCGACATGG ATGTTGTCGA TGTGTCGGTC GATATCACTG CACTCCTCGG TGAGGTGAGG
ACTGCTCTGT CGCGGCTTGC GTCGAAAGTT CCAGATGCCT CTCCGCTGCT CGAAAACGTC
CAGGAGCAGA TCGACTCGGT GATTGAACCG ACAGCGTCGC CGGTTCCGAT TGCGACCGCG
TCGCCAGCAC CCTCATTGCC GCCGACAACG CCGGTCGAAG CGCCGACAAC TGTGGAAATC
ACGGCTCCGG AAGCTGCACC TCAGACCGGG CGCGAGCCTG AACGGGTCGA TGCCGCCACC
CCTATCGCAA CACCGCCGCC GACCAGTCGC CCGCCTCGCC CGTCGCCGAC GCCCGGTCAG
ATTGAGCCAA CCGCAAGCAA CGCCCCGGCG CCGACTGCAA CGCCTGCGCC CCGATCACCA
ACGGCGACGC CGCCCCCGCC ACCAACAATG ACGCCGACGG CGCCGCCAAC TGAGGCGTCG
CCGACAAATA CGCCGCTGCC AACCAACACG CCATCACCAA CGGCAACGCC ACCGCCGACG
GCGACTCGTG TTCCACCGAC CGAACCGCCG TCAGCCAGTT CGACGCCACA ACCGCCGCCA
ACAGCGCGTC CGCCGCGTCC AACGGCGACA CCACGCCCAA CGATAACCCC GACGTCGGCG
CCGACCGCAA CACCGACTGA TCCCCCGGCG CCGACTGCAA CACCGACTGA TCCCCCGGCG
CCGACCGCAA CGCCGACGCC AGCGCCGACC GCAACGCCGA CGCCAGCGCC GACCGCAACG
CCGACGCCAG CGCCGACCGC AACGCCGACG CCAGCGCCGA CCGCAACACC GACTGATCCC
CCGGCGCCGA CCGCAACGCC GACTGAGACG CTGCCACCCA CCCCGTCGAT CACACCAACT
GATGAGGCGG GACAACCAAC GGTGACGCCA GCGGGTTCAG AGCCGTCAGG TATGCCGACT
CTAACGCCAA CACCAGCGGG TTCAGAGCCG TCAGGTATGC CGACCCTAAC GCCAGATGAC
GCAGGCGCAC ATGGCGCCAA TCTGTAA
 
Protein sequence
MIRHSQQQIG LMEALDVVLD RLLRGADIDE CLSLYPHLAV ELEPLLRVAG MVRAEVTQPL 
PPEMERWLAT GAQEFAAIAD QMLARRHARR NLLKPLRKAA VQRVLVGALA VTVLLASVDT
ASAQSLPGDP LYVWKVARED LTLSMTSDPV QRSKLHVTYA RRRLLEINEM LASDAAIDPQ
ALREPLALLS SHIRGAVIES RDMDVVDVSV DITALLGEVR TALSRLASKV PDASPLLENV
QEQIDSVIEP TASPVPIATA SPAPSLPPTT PVEAPTTVEI TAPEAAPQTG REPERVDAAT
PIATPPPTSR PPRPSPTPGQ IEPTASNAPA PTATPAPRSP TATPPPPPTM TPTAPPTEAS
PTNTPLPTNT PSPTATPPPT ATRVPPTEPP SASSTPQPPP TARPPRPTAT PRPTITPTSA
PTATPTDPPA PTATPTDPPA PTATPTPAPT ATPTPAPTAT PTPAPTATPT PAPTATPTDP
PAPTATPTET LPPTPSITPT DEAGQPTVTP AGSEPSGMPT LTPTPAGSEP SGMPTLTPDD
AGAHGANL