Gene RoseRS_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2154 
Symbol 
ID5209116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2649741 
End bp2652866 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content62% 
IMG OID640595755 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001276484 
Protein GI148656279 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCAG GCGAGAAGGT TATCCATGAT CGCTACCGCA CGATCTACAC GATCGACGAG 
CGACCCGGCG TGAAAACATA TCGGTGTCGC GATGAACAGA GCGGCGAGTT GACGCTGGTT
GCGGAGTTCA CAGTTGATGA TGAGGCACGG AGCGATCTGG CAATTCTCGC AAAGCAGATC
GCGGCGGTGA GTCACGAAGC GCTGCTGCCG CTGCGCGATC ATTTCGCCGG GGACTCACAC
TACTTCATGG TGTGCGCCGA TCCCGGCGGT CAGGACCTGG AGCGGTCGAT CCGGGCGCGT
GGCGGTCCAC TGCCGGAAGC CGATGTTCTG GCGCAGGCGA ACCGTTTACT CCTGCTTCTG
GAACATCTGC ACAGTCAGCG TCCGCCCCTG TTCCTCGGCG ACCTGGCAGT GACTGATGTC
TGGATCACTG ATCGGGGCGC CTGGATGGTG ACCCCGTTCA CGCTGGCGAT ACCGATTGGG
CAGGGTCCAT CGCCCTACCG TGCGCCTGAA CTGGCACGCG CCGATGCGGA ACCGACGACA
GTGAGCGATG TCTACACCAT CGGCGCGCTG ATGTACCATG CATTGACGGG TTGGGCGCCG
CCGACGCCGG AGCAGCAGAA CGCAGGCATG CCGCTGCCGG GACCACGGAC GCTCAACCCG
CAGATCTCTC CGCTGGTTGA ACAGGTATTG TTGCGCGCAC TGCAACTGAA ACCGGTCAAT
CGGTTTCAGC AGGCGCGTGA AATGCGGATC GCGCTGGAAA CGGCGCAGAT GATGGGCGGA
CGTTCGCTTG GTCTCGGTCC CGATGTGCTC ACCCAGGTGA CCACTGCCGA GGCTCAGGCA
GCGCCAGAGC CGGTCGCCGC TCCGCCCGAA ACATCGGTAA CATCGGTTGC ACCACCGAAC
ATTGCACCCC CGCCAGCCGC TCCACCGGTT GCACCCGCTC CTGCGCCTGT CGCACCTCCA
CCATACCCGC AGCCAGTCGC CTATCCCCCT GGTTATGCCC CCGTCCCGCA GCGACAGGGG
TTGAGCACCG GGTGCCTGAT CACCTCGGCA GTCCTGCTGA CTGTCGCGGC GATCGGTGTG
TGCCTGGCAA TTGCGGTGTT TCTGCCGGGA AGTCCGTTGC GCCAGATGCT CGGCATTGCC
GGGTCTGCCC CCGCCTCGAC TGCGGCGCCC GTTGATGCGA CTCCCGCAGC GACAGCCACG
CCAACCGAAG AGAGTGGCGC AAGCGCCGAA CCAACTGTTC CGCCTCCCAC ACCGATCCCG
GTTGACGGAT CGACGACGCT CAGTCCGAAC GCGATTTCAC CCCGGAACGT CGCTGCAATT
ACTGCAACCC GTCAATTATC GATGTCGGTG CTCGGTCCAG TCGCCTATTC GCCCGATGGT
CGTTTGCTGG CGGTGGGGGT GAGTGAAGCG GTGAGCCTGC ACGATGCGAC CACCCTCGAT
GATCGCGGCA CGTGGTTCGA CCATACGGGG AAGATCACGT CGCTCGCCTG GTCTGCCGAT
AGCACGCTCC TCGCATCGGG CGCCAGCGAT GATAACGATA TCCGAATCTG GGATGTATCA
ACCGGTACGG TCATCCGACG TCTGAGCGGG CACACCGGCT GGATCCGCAG TCTCGCCTTC
GCCCCCGACG GAACATTGCT GGCTTCAGGG AGTACCGATC AGACGGTGCG CATATGGGAT
GCCGCTACCG GTCAACTGCT GGCGACACTT CGTGGTCACA CCGGTTTTAT CGGCGGCGTG
GCGTTCTCAC CCGACAGCGC GACGCTGGCA TCTGCCTCGC GTGATGGAAG CGTCCGTCTG
TGGGATGTGG CATCGGGGAA AGAAATCAGC GGCTTCAGTT TCCGCACGGC TCTTGACCCA
ACCACTAATC TGCGTTACTG GGCGACGGGC GTCACATTCT CGCCCGACGG CAAGACCCTG
GCGGTCGGTT CGACTGAAGG GGTGGTCTAC CTGATCGATG CCACGAGTGG TCAGATCATC
CATCAGCTAC GCGGTCATAC CAACTGGATC GTCATCCGTG GACTGGCATT TTCGCCGGAT
GGGAAAACGC TCTATTCGGC GGGGCTGGAT GCCACCGTGC GCATCTGGGA TGTGGAGCGT
GGCGTGCAGA CCGCAATGCT CGATGTGCAT CGGCTTGATA TTTTCAGTAT TGCCATCAGC
CCCGACGGTG AGCGCCTGGC TTCGGTCAGC GACCAGGAAG GGCGCGTCAT TGTCTGGGAT
CTGATGCAGC AGCGTCCCGA CCTGAACCTG CGAATCGGTC TTGGATTGAT CACGTCGCTC
GTTTTTTCGC CAGATAGCGA GGTGCTTGGC TCGGTCGGCT ACAACGGGAT CATTCAGTTG
CGACTGCTGG CGAATGACCA GGTGCGTCAA TTCGCTGGCT CAGCCACATC GGTTCAATCA
CTGGCTTTCC TGCCCAATGG CCGCCTGGCG ACGCTCACCG AGCAGGATAC GGTGAGACTG
ATCGATTTCG TTCAGGAAAC GACCAGTGAT CTGCGGGGGA TGACAGGAAC ACCGCTCTGC
ATCGCAGCCG ATCCCGGCGG ATCGGTTGTG GCAGTCGGTG CAAGCGATGG GACGGTGGTG
CTGTGGGAAG GTTCGACCGG TCGATTGGTG CGTTCGCTCA AGACCGATCT GCCAGCCGTG
TTTCTGGTGG CGCTCAGTTC CGATGGTGAG TTTGTGGCAG CAGCCGGGAC GCCGAACGAT
CCACGGATCG AGATCTGGCG TGTTTCGGAT GGTCAGCGGG TGCAAACGCT GAGCGGCATG
CAGAATGCGA TTACCAGCAT CGCGTTTCAG CCGGGCGGAA CGTTGTTTGC CGCCACAGGA
ACCGATGGGG TGTTGCGGAT GTGGAACTAT CGGACAGGCG TATCCGAACG CAATATTCGC
GCCGCGCCGG AGGATGGCTG GTTTACTGCG CTGGCATTCT CGCCGGATGG TGCGCTTCTC
GCAACCGGCA CCCCTACCGG CGTCATGCAA TTCTGGAACC CGGCAAGCGG CGCGGAGATG
GGGCGCGTTG AGCAGCAGTT CGGCATTCTG GCGCTGGCGT TCAGTCCCGA CGGGGTGCAA
CTGGCGGCAG CCGGTCGGGA CGCCGGTGTC ACCATTTATC GCGCCGTGCG CGCCGGGTCG
TCATGA
 
Protein sequence
MSSGEKVIHD RYRTIYTIDE RPGVKTYRCR DEQSGELTLV AEFTVDDEAR SDLAILAKQI 
AAVSHEALLP LRDHFAGDSH YFMVCADPGG QDLERSIRAR GGPLPEADVL AQANRLLLLL
EHLHSQRPPL FLGDLAVTDV WITDRGAWMV TPFTLAIPIG QGPSPYRAPE LARADAEPTT
VSDVYTIGAL MYHALTGWAP PTPEQQNAGM PLPGPRTLNP QISPLVEQVL LRALQLKPVN
RFQQAREMRI ALETAQMMGG RSLGLGPDVL TQVTTAEAQA APEPVAAPPE TSVTSVAPPN
IAPPPAAPPV APAPAPVAPP PYPQPVAYPP GYAPVPQRQG LSTGCLITSA VLLTVAAIGV
CLAIAVFLPG SPLRQMLGIA GSAPASTAAP VDATPAATAT PTEESGASAE PTVPPPTPIP
VDGSTTLSPN AISPRNVAAI TATRQLSMSV LGPVAYSPDG RLLAVGVSEA VSLHDATTLD
DRGTWFDHTG KITSLAWSAD STLLASGASD DNDIRIWDVS TGTVIRRLSG HTGWIRSLAF
APDGTLLASG STDQTVRIWD AATGQLLATL RGHTGFIGGV AFSPDSATLA SASRDGSVRL
WDVASGKEIS GFSFRTALDP TTNLRYWATG VTFSPDGKTL AVGSTEGVVY LIDATSGQII
HQLRGHTNWI VIRGLAFSPD GKTLYSAGLD ATVRIWDVER GVQTAMLDVH RLDIFSIAIS
PDGERLASVS DQEGRVIVWD LMQQRPDLNL RIGLGLITSL VFSPDSEVLG SVGYNGIIQL
RLLANDQVRQ FAGSATSVQS LAFLPNGRLA TLTEQDTVRL IDFVQETTSD LRGMTGTPLC
IAADPGGSVV AVGASDGTVV LWEGSTGRLV RSLKTDLPAV FLVALSSDGE FVAAAGTPND
PRIEIWRVSD GQRVQTLSGM QNAITSIAFQ PGGTLFAATG TDGVLRMWNY RTGVSERNIR
AAPEDGWFTA LAFSPDGALL ATGTPTGVMQ FWNPASGAEM GRVEQQFGIL ALAFSPDGVQ
LAAAGRDAGV TIYRAVRAGS S