Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_2154 |
Symbol | |
ID | 5209116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 2649741 |
End bp | 2652866 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640595755 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001276484 |
Protein GI | 148656279 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCAG GCGAGAAGGT TATCCATGAT CGCTACCGCA CGATCTACAC GATCGACGAG CGACCCGGCG TGAAAACATA TCGGTGTCGC GATGAACAGA GCGGCGAGTT GACGCTGGTT GCGGAGTTCA CAGTTGATGA TGAGGCACGG AGCGATCTGG CAATTCTCGC AAAGCAGATC GCGGCGGTGA GTCACGAAGC GCTGCTGCCG CTGCGCGATC ATTTCGCCGG GGACTCACAC TACTTCATGG TGTGCGCCGA TCCCGGCGGT CAGGACCTGG AGCGGTCGAT CCGGGCGCGT GGCGGTCCAC TGCCGGAAGC CGATGTTCTG GCGCAGGCGA ACCGTTTACT CCTGCTTCTG GAACATCTGC ACAGTCAGCG TCCGCCCCTG TTCCTCGGCG ACCTGGCAGT GACTGATGTC TGGATCACTG ATCGGGGCGC CTGGATGGTG ACCCCGTTCA CGCTGGCGAT ACCGATTGGG CAGGGTCCAT CGCCCTACCG TGCGCCTGAA CTGGCACGCG CCGATGCGGA ACCGACGACA GTGAGCGATG TCTACACCAT CGGCGCGCTG ATGTACCATG CATTGACGGG TTGGGCGCCG CCGACGCCGG AGCAGCAGAA CGCAGGCATG CCGCTGCCGG GACCACGGAC GCTCAACCCG CAGATCTCTC CGCTGGTTGA ACAGGTATTG TTGCGCGCAC TGCAACTGAA ACCGGTCAAT CGGTTTCAGC AGGCGCGTGA AATGCGGATC GCGCTGGAAA CGGCGCAGAT GATGGGCGGA CGTTCGCTTG GTCTCGGTCC CGATGTGCTC ACCCAGGTGA CCACTGCCGA GGCTCAGGCA GCGCCAGAGC CGGTCGCCGC TCCGCCCGAA ACATCGGTAA CATCGGTTGC ACCACCGAAC ATTGCACCCC CGCCAGCCGC TCCACCGGTT GCACCCGCTC CTGCGCCTGT CGCACCTCCA CCATACCCGC AGCCAGTCGC CTATCCCCCT GGTTATGCCC CCGTCCCGCA GCGACAGGGG TTGAGCACCG GGTGCCTGAT CACCTCGGCA GTCCTGCTGA CTGTCGCGGC GATCGGTGTG TGCCTGGCAA TTGCGGTGTT TCTGCCGGGA AGTCCGTTGC GCCAGATGCT CGGCATTGCC GGGTCTGCCC CCGCCTCGAC TGCGGCGCCC GTTGATGCGA CTCCCGCAGC GACAGCCACG CCAACCGAAG AGAGTGGCGC AAGCGCCGAA CCAACTGTTC CGCCTCCCAC ACCGATCCCG GTTGACGGAT CGACGACGCT CAGTCCGAAC GCGATTTCAC CCCGGAACGT CGCTGCAATT ACTGCAACCC GTCAATTATC GATGTCGGTG CTCGGTCCAG TCGCCTATTC GCCCGATGGT CGTTTGCTGG CGGTGGGGGT GAGTGAAGCG GTGAGCCTGC ACGATGCGAC CACCCTCGAT GATCGCGGCA CGTGGTTCGA CCATACGGGG AAGATCACGT CGCTCGCCTG GTCTGCCGAT AGCACGCTCC TCGCATCGGG CGCCAGCGAT GATAACGATA TCCGAATCTG GGATGTATCA ACCGGTACGG TCATCCGACG TCTGAGCGGG CACACCGGCT GGATCCGCAG TCTCGCCTTC GCCCCCGACG GAACATTGCT GGCTTCAGGG AGTACCGATC AGACGGTGCG CATATGGGAT GCCGCTACCG GTCAACTGCT GGCGACACTT CGTGGTCACA CCGGTTTTAT CGGCGGCGTG GCGTTCTCAC CCGACAGCGC GACGCTGGCA TCTGCCTCGC GTGATGGAAG CGTCCGTCTG TGGGATGTGG CATCGGGGAA AGAAATCAGC GGCTTCAGTT TCCGCACGGC TCTTGACCCA ACCACTAATC TGCGTTACTG GGCGACGGGC GTCACATTCT CGCCCGACGG CAAGACCCTG GCGGTCGGTT CGACTGAAGG GGTGGTCTAC CTGATCGATG CCACGAGTGG TCAGATCATC CATCAGCTAC GCGGTCATAC CAACTGGATC GTCATCCGTG GACTGGCATT TTCGCCGGAT GGGAAAACGC TCTATTCGGC GGGGCTGGAT GCCACCGTGC GCATCTGGGA TGTGGAGCGT GGCGTGCAGA CCGCAATGCT CGATGTGCAT CGGCTTGATA TTTTCAGTAT TGCCATCAGC CCCGACGGTG AGCGCCTGGC TTCGGTCAGC GACCAGGAAG GGCGCGTCAT TGTCTGGGAT CTGATGCAGC AGCGTCCCGA CCTGAACCTG CGAATCGGTC TTGGATTGAT CACGTCGCTC GTTTTTTCGC CAGATAGCGA GGTGCTTGGC TCGGTCGGCT ACAACGGGAT CATTCAGTTG CGACTGCTGG CGAATGACCA GGTGCGTCAA TTCGCTGGCT CAGCCACATC GGTTCAATCA CTGGCTTTCC TGCCCAATGG CCGCCTGGCG ACGCTCACCG AGCAGGATAC GGTGAGACTG ATCGATTTCG TTCAGGAAAC GACCAGTGAT CTGCGGGGGA TGACAGGAAC ACCGCTCTGC ATCGCAGCCG ATCCCGGCGG ATCGGTTGTG GCAGTCGGTG CAAGCGATGG GACGGTGGTG CTGTGGGAAG GTTCGACCGG TCGATTGGTG CGTTCGCTCA AGACCGATCT GCCAGCCGTG TTTCTGGTGG CGCTCAGTTC CGATGGTGAG TTTGTGGCAG CAGCCGGGAC GCCGAACGAT CCACGGATCG AGATCTGGCG TGTTTCGGAT GGTCAGCGGG TGCAAACGCT GAGCGGCATG CAGAATGCGA TTACCAGCAT CGCGTTTCAG CCGGGCGGAA CGTTGTTTGC CGCCACAGGA ACCGATGGGG TGTTGCGGAT GTGGAACTAT CGGACAGGCG TATCCGAACG CAATATTCGC GCCGCGCCGG AGGATGGCTG GTTTACTGCG CTGGCATTCT CGCCGGATGG TGCGCTTCTC GCAACCGGCA CCCCTACCGG CGTCATGCAA TTCTGGAACC CGGCAAGCGG CGCGGAGATG GGGCGCGTTG AGCAGCAGTT CGGCATTCTG GCGCTGGCGT TCAGTCCCGA CGGGGTGCAA CTGGCGGCAG CCGGTCGGGA CGCCGGTGTC ACCATTTATC GCGCCGTGCG CGCCGGGTCG TCATGA
|
Protein sequence | MSSGEKVIHD RYRTIYTIDE RPGVKTYRCR DEQSGELTLV AEFTVDDEAR SDLAILAKQI AAVSHEALLP LRDHFAGDSH YFMVCADPGG QDLERSIRAR GGPLPEADVL AQANRLLLLL EHLHSQRPPL FLGDLAVTDV WITDRGAWMV TPFTLAIPIG QGPSPYRAPE LARADAEPTT VSDVYTIGAL MYHALTGWAP PTPEQQNAGM PLPGPRTLNP QISPLVEQVL LRALQLKPVN RFQQAREMRI ALETAQMMGG RSLGLGPDVL TQVTTAEAQA APEPVAAPPE TSVTSVAPPN IAPPPAAPPV APAPAPVAPP PYPQPVAYPP GYAPVPQRQG LSTGCLITSA VLLTVAAIGV CLAIAVFLPG SPLRQMLGIA GSAPASTAAP VDATPAATAT PTEESGASAE PTVPPPTPIP VDGSTTLSPN AISPRNVAAI TATRQLSMSV LGPVAYSPDG RLLAVGVSEA VSLHDATTLD DRGTWFDHTG KITSLAWSAD STLLASGASD DNDIRIWDVS TGTVIRRLSG HTGWIRSLAF APDGTLLASG STDQTVRIWD AATGQLLATL RGHTGFIGGV AFSPDSATLA SASRDGSVRL WDVASGKEIS GFSFRTALDP TTNLRYWATG VTFSPDGKTL AVGSTEGVVY LIDATSGQII HQLRGHTNWI VIRGLAFSPD GKTLYSAGLD ATVRIWDVER GVQTAMLDVH RLDIFSIAIS PDGERLASVS DQEGRVIVWD LMQQRPDLNL RIGLGLITSL VFSPDSEVLG SVGYNGIIQL RLLANDQVRQ FAGSATSVQS LAFLPNGRLA TLTEQDTVRL IDFVQETTSD LRGMTGTPLC IAADPGGSVV AVGASDGTVV LWEGSTGRLV RSLKTDLPAV FLVALSSDGE FVAAAGTPND PRIEIWRVSD GQRVQTLSGM QNAITSIAFQ PGGTLFAATG TDGVLRMWNY RTGVSERNIR AAPEDGWFTA LAFSPDGALL ATGTPTGVMQ FWNPASGAEM GRVEQQFGIL ALAFSPDGVQ LAAAGRDAGV TIYRAVRAGS S
|
| |