Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0355 |
Symbol | |
ID | 5207290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 454020 |
End bp | 457112 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640593981 |
Product | NHL repeat-containing protein |
Protein accession | YP_001274737 |
Protein GI | 148654532 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.64892 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0641589 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAA ACAGCCTGCG CGTCATCACA CTCCTCGCTA TCTCAATCCT GCTACTGAGC GGGCTGGCGA CGGCCCAGGG GCCGCAACCA CCGCTCCCCT CGTATCCACC CCAGGGACTG GGCCTGCCCG CAGAAGCCCG GCTCGGGACG CCGGAGTTCG GCAAGCCCTT CATTCGCCCA GGCGTTCCGA GCATTGGCAT CCAGAGCGCC GGGACCGCCT CCATCCCCCT CGGCCAGCCG GGGCTGAGCT TCCGCTACGT GCAGACGTTC GGGGTGACGG AGACTCCCTA CATCTCAACC ACTACCCACT TGAACTACCC TTACGGCATT GGGGTGGAGG GCAACAGCAT CTGGATTGGC GAAATGTGGG GAAACCGTTT CCTCAAATAT GCCAGCGATG GCAATTTTCA ACAATCCTTT GGCCATGCTG GCTTTGCTGA GGATTACACC GATACATCGT TTTGGGAAAT TGCTGACGTG GCCACTGATA GCGACGGTAA CATCTGGGTA GTGGACGCGG CCTCCAGCCG GGTGGTCAAA TTGAACTCCT CTGGCAAAGC ACTCCTAACA CTTGGAAAGC GGTGGGAAAG TGGAAGCGAC AATAACCGAT TTGCATATCC CATCAGCGTT GCCTTTGACG CCAGCGGAAA CATCTACGTG AGCGATGGCG CTCCCTGGTG GAACCGAGAG GGTGGGAACC ACCGCATCCA GGTTTTCCGG AGTGATGGCA CCTATCTCGC TACGCTCGGG CAGACCGGCG TGTGTGGCTC TGCCAACAAC CAATTCTGTG GGCCGCGCCA TATCGCCATC TACGGCAACG AACTCTATGT GCCCGATGCC AACAACAACC GCGTGCAAAT CTTCAATATC TCCAATCCTG CGTCACCATC CTATGTGGCA ACGATCGGGG GGCTGAACAA CCCTTCCGGC GTGGCGGTAG ACGATAACTT CATTTACATA GCTGATACCT GGAATAACCG CATACAGACA TACACACGCA TAGATCGGGT GTATATCGGC ACTATTGGAG GTGAATGGGG AAGTGGAAAC AACCAGTTCC GAAATCCAAC CGATGTTGTC GCCATGACGA TTGGGACGTA TCCTAACGCG GAGCTTCATC TCTTCGTTGC AGACTTTGTC AACACTCGCG TACAGCAATT CAAGATTACC AGCATTTCCC CCTTCGCTTT CCAGTATGTG CGCACTTATG GAACAACTGG CGTGCCCTAT GTCACTGACG GTTATCATTA TAACACTCCG TCAAGTGTTG CCGTGGCCCC TGATGGGAGC ATCTATCTGA CTGAGGATAA AGGCCATCGG TTGGTCAAAC TTCGCCCTGA CGGTACACCC ATCTGGATTG TAGGTGCGGC TGGAGTGAAA GGCGATTGGG ATGCCAGCAA CGATCGCTTG AATAATCCAG ACGACCTTGC GCTTGATGCG AATGGGCGGG TCTATGTTGC TGACCGCTGG CATGGCCGGG TGCAGATTTA CAACCCAGAC GGCTCATACT ACACCACGGT GAGCGGCTTG GACTGCCCGG GCGGGGTGGC CATTGGCCCC AACGGATACC TCTATGTGGC CGATACTTGC AATCACACGG TTAAAATCTA CAACACCAAC CTGGTATTGG TAGCAACTCT GGGGACACCA GGCGAATCAG GCACAGATAA TGCCCACTTC AACTCGCCGG AAGATGTTGC CGTGGATAGC AATGGCACCA TTTATGTATC CGATGGAGGC AACCACCGTA TCCAGGTCTT CAACGCCAAT CGTCAGTACG TGCGCACAAT GGGGGAGACT GGCATCTGGG GCAGCGATTT CGCCCACTTC AACGGGCCGA ACAACCTGTT TGTAGACAGT GCCAATCGTC TTTACGTAGG CGACGAGTGG AATCACCGCA TTCAAGTCTT TGACGCGAAT GGCGCCTATT TGACCACAAT CGGGGGAAGT GCAGGCCCCC GAACAGGGCA GTTCCGCGGT GCGCGGGGTG TGGCGGTGGA CAACGCTGGC AACATCTACG TGGCAGACAG GCTCAACCAC CGCATCCAGA AATTCGCCCC CGGCGTGCCG GGCTGGAAGC AGGTGAACAT CAACGGGTTT GGAAATCGAG ATACCACGTT TGTCAGTACG CTGGATGTCT TCGGGGGTTA TCTGTATGCC GGCACCTGGT CCAGCCAGAT GTGGCGCACT GCGGACGGAC AAACCTGGAG TCAGGTTGCC CCCTCTACAT GGCCTACTGA TACGGCTGTG TTTGATGCTG AGCCCTTTGG CTCCTATCTG TACGTAGGCA CGGCGTCCAA TAACGGCGGC GAGATCTGGC GAACCAACGG GATCACCTGG GAACAGGTGA TTACCAGCGG TTTTGGGATT ACCAATAACT ACGGCATCAA CACGCTGGCA GTATTCTCAA ATGCCATTTA CGCCGCAACC AGCGCTGAAG ATGGAGTGAT GCAGATTTAC CGCAGCGCCA GCGGCGATGC CGGGAGTTGG TCACCTGTCG TGACCGACGG CTTCGGTGGC GGCGGCGTGT GGCAAGATGT GACAATGGAT GTGTACGGCG GTTATCTCTA TTTGGGAATT GGCCGGGCTG GCGTGGCAGA ACTCTGGCGC ACGAACGATG GAGTTACCTG GTCGCCGGTT TTCACCGACG GTCTGGCCGC AAATAACACC CATGTCTCTG CAATGGCCGA GTTCAATGGC GCTTTCTACA TTGGCCTGCG CAATGTCACC ACAGGCGGCG AAGTCTGGCG GACCACCGAT GGCACAACCT TTACCCGTGT CTTTGATGGC GGACTGGGAG ATCCCAACAA CGGTCGTCCC TATGGGCTTC AGGTGTTCAA CGGCTACTTG TATCTGGTCT TCAGCAATCT GGTCACAGGG GCCGAAGTCT GGCGTACATC GGATGGAATG ACCTGGGAGC AGGTTGGCAA TGCCGGCTGG GGCGACAGCA ACAATGGCTA CGCCGACTAC TTTGACAAAG GCGCGGCCGT TTTCAACAAC CGCCTCTACA TCGGCACGAC CAACGATGCC AACGGCGGGG AGGTGTGGCT GTTCCTGCAC AATCGAGTCT ACTTGCCGTT AATCCGGCGT TAA
|
Protein sequence | MKANSLRVIT LLAISILLLS GLATAQGPQP PLPSYPPQGL GLPAEARLGT PEFGKPFIRP GVPSIGIQSA GTASIPLGQP GLSFRYVQTF GVTETPYIST TTHLNYPYGI GVEGNSIWIG EMWGNRFLKY ASDGNFQQSF GHAGFAEDYT DTSFWEIADV ATDSDGNIWV VDAASSRVVK LNSSGKALLT LGKRWESGSD NNRFAYPISV AFDASGNIYV SDGAPWWNRE GGNHRIQVFR SDGTYLATLG QTGVCGSANN QFCGPRHIAI YGNELYVPDA NNNRVQIFNI SNPASPSYVA TIGGLNNPSG VAVDDNFIYI ADTWNNRIQT YTRIDRVYIG TIGGEWGSGN NQFRNPTDVV AMTIGTYPNA ELHLFVADFV NTRVQQFKIT SISPFAFQYV RTYGTTGVPY VTDGYHYNTP SSVAVAPDGS IYLTEDKGHR LVKLRPDGTP IWIVGAAGVK GDWDASNDRL NNPDDLALDA NGRVYVADRW HGRVQIYNPD GSYYTTVSGL DCPGGVAIGP NGYLYVADTC NHTVKIYNTN LVLVATLGTP GESGTDNAHF NSPEDVAVDS NGTIYVSDGG NHRIQVFNAN RQYVRTMGET GIWGSDFAHF NGPNNLFVDS ANRLYVGDEW NHRIQVFDAN GAYLTTIGGS AGPRTGQFRG ARGVAVDNAG NIYVADRLNH RIQKFAPGVP GWKQVNINGF GNRDTTFVST LDVFGGYLYA GTWSSQMWRT ADGQTWSQVA PSTWPTDTAV FDAEPFGSYL YVGTASNNGG EIWRTNGITW EQVITSGFGI TNNYGINTLA VFSNAIYAAT SAEDGVMQIY RSASGDAGSW SPVVTDGFGG GGVWQDVTMD VYGGYLYLGI GRAGVAELWR TNDGVTWSPV FTDGLAANNT HVSAMAEFNG AFYIGLRNVT TGGEVWRTTD GTTFTRVFDG GLGDPNNGRP YGLQVFNGYL YLVFSNLVTG AEVWRTSDGM TWEQVGNAGW GDSNNGYADY FDKGAAVFNN RLYIGTTNDA NGGEVWLFLH NRVYLPLIRR
|
| |