Gene RoseRS_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1917 
Symbol 
ID5208878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2383213 
End bp2384382 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content55% 
IMG OID640595526 
Productarsenical-resistance protein 
Protein accessionYP_001276256 
Protein GI148656051 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00090668 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCACACA CTTCCACAAC CGTGCCGCAC TCCACGCCGA GCGTCACGCG CCAGCTTTCC 
GCCCTGGATC GCTACCTCAC CCTCTGGATT TTCCTGGCAA TGGCTCTGGG AGTCGGTCTG
GGCTATTTCT TGCCCGGCGT CGAGCAGTTC ATCAATCGCT TCCAGGTCGG TACAACCAAT
ATTCCGATTG CGATTGGCCT GGTATTGATG ATGTATCCGC CGTTCGCCAA AGTGAAATAT
GAGGAACTGG GCGAGGTTTT TCGTAACACA AAGGTGCTTG GCTTATCACT GATCCAGAAC
TGGGTCGTCG GACCGATCCT CATGTTTGGG CTGGCAATCA TCTTCCTGCG TGATTATCCC
GAGTACATGG TCGGCCTGAT TCTGATTGGT CTGGCACGCT GTATCGCCAT GGTGATCGTT
TGGAACGAAC TGGCAAAAGG CGATACCGAA TATGCCGCCG GGATTGTAGC GTTCAACAGC
CTGTTCCAGG TCTTCTTCTA CAGCATTTAC GCCTGGGTGT TCATTACTGT ATTGCCGCCG
TTGTTTGGCA TGCAGGGCAG TATCGTTCGC ATCGGCATTG TTCAGATTGC TGAAAGCGTG
TTTATCTACC TGGGCATCCC GATGATCGCT GGCTTCCTGA CCCGCTTTAT CCTGCTGCGC
GCCAGGGGGC GGGAGTGGTA TGAACGCGTC TTCGTGCCAC GTATCAGTCC ACTGACGCTG
ATTGCGCTAC TGTTTACAAT CGTGTTGATG TTCAGCCTGA AGGGTGAATT GATCGTCCAG
ATCCCGCTTG ATGTGGTGCG CATCGCCATT CCGCTGCTGC TCTATTTTGT GCTCATGTTC
CTGGTCAGCT TCTGGATAGG CTATCGTCTC GGCGCAGATT ACCGCAAAAC GACGACGCTC
TCGTTCACTG CGGCGAGTAA CAACTTCGAA CTGGCGATCG CTGTTGCAGT CGCCGTCTTC
GGCATCGGTT CCGGCGCAGC CTTTGCAGCG GTGATTGGTC CGCTGATTGA GGTGCCGGTG
ATGATCGGCC TGGTGAATGT CGCCTTCTGG TTCCAGCGGC GCTATTTCGC TCATGAAGCC
CAACCAGCCG ATGTCTTGTT GCAAACGACC GCCGAGGCTG GCTGTGGGTC GCCGGTGCGT
GATCCAGGAC AACGAACAGC GAACCGATGA
 
Protein sequence
MSHTSTTVPH STPSVTRQLS ALDRYLTLWI FLAMALGVGL GYFLPGVEQF INRFQVGTTN 
IPIAIGLVLM MYPPFAKVKY EELGEVFRNT KVLGLSLIQN WVVGPILMFG LAIIFLRDYP
EYMVGLILIG LARCIAMVIV WNELAKGDTE YAAGIVAFNS LFQVFFYSIY AWVFITVLPP
LFGMQGSIVR IGIVQIAESV FIYLGIPMIA GFLTRFILLR ARGREWYERV FVPRISPLTL
IALLFTIVLM FSLKGELIVQ IPLDVVRIAI PLLLYFVLMF LVSFWIGYRL GADYRKTTTL
SFTAASNNFE LAIAVAVAVF GIGSGAAFAA VIGPLIEVPV MIGLVNVAFW FQRRYFAHEA
QPADVLLQTT AEAGCGSPVR DPGQRTANR