Gene RoseRS_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2104 
Symbol 
ID5209066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2597304 
End bp2598929 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content64% 
IMG OID640595706 
Productsecretion protein HlyD family protein 
Protein accessionYP_001276435 
Protein GI148656230 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCAC GTGCACGAAT AATCATTCTG CTCGTCCTGG TCGCCGTGCT CATCGGCGGT 
GGCTACTGGT TGTTGCAGGA ATCGGCGCAG GCGAGGAACG GAACCCTGAC CGGCACGGGA
ACGATTGAAG CGGAAGAAGT GCTGGTGACG GCCGAGATCG CCGGACGTGT GCAGCATCTC
TTCGTCGATG AAGGGTCGGA GGTGCGGATC GGGGAGGATC TGGCGCAGAT CGACACTGCA
TTGCTCGAGG CGCAACTGGA TCAGGCGAAG GCGGCGGTTG CAGCAGCCGA GGCGAACCTG
GCGCAGATCC GTGCCGGAAC CCGCAGCGAG GAGATTGCCA TCGCTGAGGC GCAGGTCAAA
CAGGCGGAGG CGGCAGCAAG AGGGGCGGAG CAGGCGTATG AGCATGCACT GGAGATTCTG
AACAATCCGC AGGAACTGGA ATTGCAACTG GTGCAGGCGC GCGCGCAACG TGATGCGGCG
CTGCGAACGC TGGAAAAACT GCGCGCTGGC GTCCGCCAGG AAGATATCGA TGCTGCGCGC
GCATCGATGC ACCTGGCACA GGAAGGGTTA CAATCCGCCC GCGACCGCCT CTCCGCTGCG
AAAACGCTGG CGGAGGCGCA GGTGGAACAG GCTGCGGCAG CGTTGACCCA GGCGCAGGCG
CGGTATGCGC AGGCGAAGAC CAACTGGGAG TATGTGCGCG ATACGGGTCA GGATGCGTTG
ATGCCGACCG TGCTGGTCAC ACTGCCGACC GGTGTGCAAC GTCTGCCGAA TACCGTAAGT
GACGGGGTGC GCGAACAGTA CTATGCGCAA TTCGTTCAGG CGGAAGCCGC GCTGCGCCAG
GCGGAGACGG CGCTCGAACA GGCGTTGGTG CAGGCGGAAG AGGCGCGCCA GGCAGAGGCG
GTCGGCGTGC GCCAGGCGGA GTTGCAGGTG AGTATCGCAC AGGCGACGCT GGCAAAAGCG
GAAGCAGGAC CGACCCGTCA GGAGATTGCT GCTGCCGAGA CTGCACTCGC AAATGCGCAG
CGGTTGCTCG ACGTGGTCGA AGCAATGCGC GCCAATCCGC AGCAATTGCG CGCTGCCGTT
GATGCTGCGC GCACGCAGCA AGCGATTGCA GAAGCGCAAC TGGCGCAGGC GCGGGCGCGG
CTTGATCTGG CGCGCAATGG CGCGCGTCCA GAGCAGATCC AGGCGGCGGA AGCGCAACTG
GCGCAGGCGC GGGCATCCCT GCATCAGGTG GAGGTGATGA TCGACAAGGC AAGGTTGCGC
GCACCGCGCA CCGGCATCGT CCTGAGCCGC CCGATCCACG AAGGAGAGCA GGTGACCCCC
GGCACGCCGT TAATGACCAT CGGCGCCCTC GATCCCGTGC GCCTGACGGT GTACATCAGC
GAGGCGGATA TCGGGCGTGT GCGTCTCGGT CAGGCGGCAG AGGTCACAGT GGACAGTTTT
CCAGGTCGAG TCTTTCATGG TACAGTAACA TTCATTGCCC AAAGAGCGGA ATTTACGCCG
CGGAACGTGC AGACGCGCGA TGAGCGCGCA ACGACGGTGT TCGCGGTACG GATTGAGTTG
CCGAATGCTG ACTACGCCCT CAAGCCGGGG ATGCCAGCGG ATGTTGTGCT GGTGGAGAAG
GACTGA
 
Protein sequence
MHPRARIIIL LVLVAVLIGG GYWLLQESAQ ARNGTLTGTG TIEAEEVLVT AEIAGRVQHL 
FVDEGSEVRI GEDLAQIDTA LLEAQLDQAK AAVAAAEANL AQIRAGTRSE EIAIAEAQVK
QAEAAARGAE QAYEHALEIL NNPQELELQL VQARAQRDAA LRTLEKLRAG VRQEDIDAAR
ASMHLAQEGL QSARDRLSAA KTLAEAQVEQ AAAALTQAQA RYAQAKTNWE YVRDTGQDAL
MPTVLVTLPT GVQRLPNTVS DGVREQYYAQ FVQAEAALRQ AETALEQALV QAEEARQAEA
VGVRQAELQV SIAQATLAKA EAGPTRQEIA AAETALANAQ RLLDVVEAMR ANPQQLRAAV
DAARTQQAIA EAQLAQARAR LDLARNGARP EQIQAAEAQL AQARASLHQV EVMIDKARLR
APRTGIVLSR PIHEGEQVTP GTPLMTIGAL DPVRLTVYIS EADIGRVRLG QAAEVTVDSF
PGRVFHGTVT FIAQRAEFTP RNVQTRDERA TTVFAVRIEL PNADYALKPG MPADVVLVEK
D