Gene RoseRS_3910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3910 
Symbol 
ID5210893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4894318 
End bp4897776 
Gene Length3459 bp 
Protein Length1152 aa 
Translation table11 
GC content64% 
IMG OID640597506 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001278213 
Protein GI148658008 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATGA CCGATCATCC GCCGCCAGAA CCGTCGCGCG ACCACGAAGT GCTGGCATTT 
CCTTCATTTG CAGCGCTGCG TGCAGCGCAC GCCGAGATGC TGCGTCGCCT CCACGAAGAG
GGATTGACGA CTGCACTGCT GAGTCGGGCG GCTGAGATGA TTGCGCGCGG CAGCGCCACC
GGCGTGATCC TCGACGATGA AGACGAACGC ACCGCTGCGC AGAGTTTGCT CGATTATTGG
GCGACAATCC TTATCCGTAA GGGATATGAG CCTCCCGACG CCACTCTCGA TGAGTTCGAT
CCTTCCCTCG CGCCGGAATT GGACGACTCA CAATGCCCTT ACGTCGGGCT GGATCCATTC
CGTGAAGAGG ATGCAGCGCG CTTCTTCGGC AGACGCCGCC TGGTCGAGTT CCTCGTCGCT
CGCCTGACCG AACACCGCAT GCTGGCGCTG GTCGGTCCTT CTGGTTCCGG CAAATCATCC
GTCGTGCGCG CCGGTCTCAT TCCGGCGCTG AAGCAGAATG CTATTCCTGG CAGCAGCGCC
TGGCGATACA TACCCATCAT TACGCCTGGC GAACAACCCC TGTTCCACCT GGCGCATGCG
TGGCGCCGGT TGAATGGTCA ACCCGACGAC CAGGCGCAGA GTGAAGAACT TGCGGAACGG
TTCCGCGATC CGCAGCAGGT GGTCGCTGCA TTCGACGGGA CTGAGGCGGT AGTGCTGGTG
GTCGATCAAA TGGAGGAATT GTTCACCCTC ACCGACGACG AATCGGAGCG TCGCGCATTC
GTGGCGGCGA TGGTGGCGCT GGTCGAGCAC CCATCACCCC GGCATACGGT CATCACGACG
CTGCGCAGCG ACTTCGAGAG TTTTATTGCG CTCGAACCGG CGTTGCAACA GTGGATGGAA
CAGGCGCGCG TTCAGTTGCT GCCGCTCAGC GCCGGTGAAA TGCGTGAAGC GATCGAACGC
CCGGCGGAAC TGGTCGGTCT CAAGTTCGAA CCGGGCGTCG TCGAAGCGCT CCTGCACGAT
ACGCTCGGCG AACCGGCAGC GTTGCCGCTG CTCCAATTTT CGTTGCTCAA ACTGTGGGAC
GCGCGCGACC GCAATCGGGT GACCTGGCAG GCGTACCAGC AGATCGGCGG CGGGCGACAG
GCGCTGGCGC GCAGCGCCGA TGCGCTCTAC AACAGCCTGA TCCCGGAAGA TCAGGTGACG
GCGCGCCGCA TTCTGCTCCG CATGGTGCGT CCAGGCGAGG GGTTGGAGAT CACCAGCAAT
CGCATTCGAC GCGATCAACT CTATCGCGCA GGCGAGGCGC GTGATCGTGT TGATCGGGTG
CTTGAACGCC TGGTTGCTGC ACGCCTGGTG CGGCTTACCC CCGGCGATAC TCCCGGCAGC
GCACAGGTGG AAATCGCCCA TGAAGCGCTT GTGCGCAACT GGCCCACGCT GGTTGGCTGG
CTCGAAGACG AACGCGCGGC CATCGCAACA CGTCGGCGCC TGGAAGCAAA AGCGGCGGAA
TGGGTACGGC TTGGCGGCGG CGCAGCGGGT CTGCTTGACG AGGTGCAACT GGCAGAAGCC
GAACGCTGGT TGAACAGCGC CGAGGCTGCC TACCTGGGAT TCGACGAAGC GCTGCTCGAT
CTGGTGATCG CCAGCCGCAC AGCCATCGAT CAGGCGCGCC ACGCACAGGA GGATCAGCGT
CGTCGCGAAC TCGAACAGGC GCAGGCGCTG GCGGAGGCGC GTCGCCTTCA GGCAGAACAG
AACGAAAAAC TGGCAATCGA GCAGGGAAAG CGCGCCGAAG CCGAGGCGCG CTCGGCGCGT
CGGCTGCGTC TGATTGTGGC GACGCTGACA GTCGCCCTGA TCGGCGCGAT CATGTCGGGC
ATTCTGCTCC TTCAGCAGCA ATCACAGATC CAGGAACGCA CCTTTGAACT GGCGACGCAG
GTAAGTATTT CCGACCAGGC GGCTGCAACC GCTCGAATTG CAGCGGCGAA CGCGCAGGCA
GCCGCTGCAA CCTCGCAGGC TGCCGAAAGC CTGGCGCTGT TGCGTGGCAC CCAGGCGGCG
CTGGCAGCAG CGCAGGAATC TCAGGCGCGC CAGACTGCCG AGGCGCTGGT TCCGGCGCTC
GAACGGCAGG CACGCCAGGC GCGCGCCGGT CAACTCGCGG CACTGGCGCA GGCGAAACGC
GACGAACGAC CGATCCTTAG CCTCCTCCTG GCGATTGAGG CGGTGCATAT CACCCTTCAG
AAAGGCGAAG CGCCGGTTCC GGTGGCTGAC GCAGCGTTGC GCGACTCACT CACGCGCATC
GGCGGCATCG GGCTATTGAT CGGTGAGCCG GTGGTCGCTG CGGCAACCAG CGACGACGGC
AGGGTTGCTG TAGCAATGAC CGCAAATGGA ACCCTCAGCA TCTGGAATCT GACCGATCCT
GGCTTGCCGC CACGGGTCGT CAACACGCCC GGATCACCAA CGCTGCTCGT CCTCTCACGC
GATGGCGCGC GCCTGGCAAC CACTGGCGCC GATCCGACGA GCGTCCGTCT GTGGGATCTT
TCGGGAACAA TGCCGGTCGC ACGCGAACTG GCTGCGCCAG GCGGCATCAA TACTGCGCTC
GCCATCAGCG ACAATGGTCG TCAACTGGCG ATTGGCGATG ATCAGGGCGT TGTCCTTGTG
TACGACCTGA CCAACCCGTC AGCGACGCCG CAGCGCCTCA GCGGTCTCGG CGGGCGCAGC
GCCGTTCGCT CGCTGGCATT CAGCCCCGAT GGTCAACTGC TGGCAACCGG CAACGCGGAT
AGTCAGGCGC GGTTGTACAA TCTGGCGTCC GGCTCTGGAT CGTACACGCG CGAACGCACA
CGCGGCGCAC TGACCTCGAT CACCTTCAGT AACAACGGTC GCTGGATTGT CTATGGCAGC
GCCGATGGAC AGGTGCGCCT CTGGCGCTTA AGCGGAGCAC AGTTTGACTC AGCCTATGTC
CTGCTCGGAT TGACAGCCGA AGTGACCGAC GTGCGCATTG CCCCCGGCGA CAGGCTCATC
ATTGCGGGCA GCGCCGATGG GACGACATGT ATCTGGGATC TTGAAGCGCG GAGCGACAAC
CGCGCCCGTG TGGTGCTGCG CGGTCAAACT GCCAGGATCA CCGGACTGGC ATTGAACGGC
AACGCCAGTC GCCTGGCAAC CGCTGGCGCC GATGGGCGGA TTGCGCTGTG GAACCTGACC
GCAGCCGATC CTGGTGTGAA CCCGCTGATC ATGCGCGGTC ATGATGGCCC GGTGAACGAT
GTGGCGATAC CGGCGCGCAC CTCCCTGATG TTGACGGTCG GCGCCGACGG CATGGCGCGT
GTCTGGAATC TCGATGCGCC GCTGTATGCG CTGAAAACAC TGCCGCAAGA GAGCGTTGAA
CTGCTGGAAG TCGCCTGCCG CGCTGCCGGG CGAACACTGT CGGAATCCGA ATGGAGCGAA
TACGTCGAAG GGCTGCCGTA CAACCCGTTC TGCAAGTAA
 
Protein sequence
MSMTDHPPPE PSRDHEVLAF PSFAALRAAH AEMLRRLHEE GLTTALLSRA AEMIARGSAT 
GVILDDEDER TAAQSLLDYW ATILIRKGYE PPDATLDEFD PSLAPELDDS QCPYVGLDPF
REEDAARFFG RRRLVEFLVA RLTEHRMLAL VGPSGSGKSS VVRAGLIPAL KQNAIPGSSA
WRYIPIITPG EQPLFHLAHA WRRLNGQPDD QAQSEELAER FRDPQQVVAA FDGTEAVVLV
VDQMEELFTL TDDESERRAF VAAMVALVEH PSPRHTVITT LRSDFESFIA LEPALQQWME
QARVQLLPLS AGEMREAIER PAELVGLKFE PGVVEALLHD TLGEPAALPL LQFSLLKLWD
ARDRNRVTWQ AYQQIGGGRQ ALARSADALY NSLIPEDQVT ARRILLRMVR PGEGLEITSN
RIRRDQLYRA GEARDRVDRV LERLVAARLV RLTPGDTPGS AQVEIAHEAL VRNWPTLVGW
LEDERAAIAT RRRLEAKAAE WVRLGGGAAG LLDEVQLAEA ERWLNSAEAA YLGFDEALLD
LVIASRTAID QARHAQEDQR RRELEQAQAL AEARRLQAEQ NEKLAIEQGK RAEAEARSAR
RLRLIVATLT VALIGAIMSG ILLLQQQSQI QERTFELATQ VSISDQAAAT ARIAAANAQA
AAATSQAAES LALLRGTQAA LAAAQESQAR QTAEALVPAL ERQARQARAG QLAALAQAKR
DERPILSLLL AIEAVHITLQ KGEAPVPVAD AALRDSLTRI GGIGLLIGEP VVAAATSDDG
RVAVAMTANG TLSIWNLTDP GLPPRVVNTP GSPTLLVLSR DGARLATTGA DPTSVRLWDL
SGTMPVAREL AAPGGINTAL AISDNGRQLA IGDDQGVVLV YDLTNPSATP QRLSGLGGRS
AVRSLAFSPD GQLLATGNAD SQARLYNLAS GSGSYTRERT RGALTSITFS NNGRWIVYGS
ADGQVRLWRL SGAQFDSAYV LLGLTAEVTD VRIAPGDRLI IAGSADGTTC IWDLEARSDN
RARVVLRGQT ARITGLALNG NASRLATAGA DGRIALWNLT AADPGVNPLI MRGHDGPVND
VAIPARTSLM LTVGADGMAR VWNLDAPLYA LKTLPQESVE LLEVACRAAG RTLSESEWSE
YVEGLPYNPF CK