Gene RoseRS_0355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0355 
Symbol 
ID5207290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp454020 
End bp457112 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content56% 
IMG OID640593981 
ProductNHL repeat-containing protein 
Protein accessionYP_001274737 
Protein GI148654532 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.64892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0641589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAA ACAGCCTGCG CGTCATCACA CTCCTCGCTA TCTCAATCCT GCTACTGAGC 
GGGCTGGCGA CGGCCCAGGG GCCGCAACCA CCGCTCCCCT CGTATCCACC CCAGGGACTG
GGCCTGCCCG CAGAAGCCCG GCTCGGGACG CCGGAGTTCG GCAAGCCCTT CATTCGCCCA
GGCGTTCCGA GCATTGGCAT CCAGAGCGCC GGGACCGCCT CCATCCCCCT CGGCCAGCCG
GGGCTGAGCT TCCGCTACGT GCAGACGTTC GGGGTGACGG AGACTCCCTA CATCTCAACC
ACTACCCACT TGAACTACCC TTACGGCATT GGGGTGGAGG GCAACAGCAT CTGGATTGGC
GAAATGTGGG GAAACCGTTT CCTCAAATAT GCCAGCGATG GCAATTTTCA ACAATCCTTT
GGCCATGCTG GCTTTGCTGA GGATTACACC GATACATCGT TTTGGGAAAT TGCTGACGTG
GCCACTGATA GCGACGGTAA CATCTGGGTA GTGGACGCGG CCTCCAGCCG GGTGGTCAAA
TTGAACTCCT CTGGCAAAGC ACTCCTAACA CTTGGAAAGC GGTGGGAAAG TGGAAGCGAC
AATAACCGAT TTGCATATCC CATCAGCGTT GCCTTTGACG CCAGCGGAAA CATCTACGTG
AGCGATGGCG CTCCCTGGTG GAACCGAGAG GGTGGGAACC ACCGCATCCA GGTTTTCCGG
AGTGATGGCA CCTATCTCGC TACGCTCGGG CAGACCGGCG TGTGTGGCTC TGCCAACAAC
CAATTCTGTG GGCCGCGCCA TATCGCCATC TACGGCAACG AACTCTATGT GCCCGATGCC
AACAACAACC GCGTGCAAAT CTTCAATATC TCCAATCCTG CGTCACCATC CTATGTGGCA
ACGATCGGGG GGCTGAACAA CCCTTCCGGC GTGGCGGTAG ACGATAACTT CATTTACATA
GCTGATACCT GGAATAACCG CATACAGACA TACACACGCA TAGATCGGGT GTATATCGGC
ACTATTGGAG GTGAATGGGG AAGTGGAAAC AACCAGTTCC GAAATCCAAC CGATGTTGTC
GCCATGACGA TTGGGACGTA TCCTAACGCG GAGCTTCATC TCTTCGTTGC AGACTTTGTC
AACACTCGCG TACAGCAATT CAAGATTACC AGCATTTCCC CCTTCGCTTT CCAGTATGTG
CGCACTTATG GAACAACTGG CGTGCCCTAT GTCACTGACG GTTATCATTA TAACACTCCG
TCAAGTGTTG CCGTGGCCCC TGATGGGAGC ATCTATCTGA CTGAGGATAA AGGCCATCGG
TTGGTCAAAC TTCGCCCTGA CGGTACACCC ATCTGGATTG TAGGTGCGGC TGGAGTGAAA
GGCGATTGGG ATGCCAGCAA CGATCGCTTG AATAATCCAG ACGACCTTGC GCTTGATGCG
AATGGGCGGG TCTATGTTGC TGACCGCTGG CATGGCCGGG TGCAGATTTA CAACCCAGAC
GGCTCATACT ACACCACGGT GAGCGGCTTG GACTGCCCGG GCGGGGTGGC CATTGGCCCC
AACGGATACC TCTATGTGGC CGATACTTGC AATCACACGG TTAAAATCTA CAACACCAAC
CTGGTATTGG TAGCAACTCT GGGGACACCA GGCGAATCAG GCACAGATAA TGCCCACTTC
AACTCGCCGG AAGATGTTGC CGTGGATAGC AATGGCACCA TTTATGTATC CGATGGAGGC
AACCACCGTA TCCAGGTCTT CAACGCCAAT CGTCAGTACG TGCGCACAAT GGGGGAGACT
GGCATCTGGG GCAGCGATTT CGCCCACTTC AACGGGCCGA ACAACCTGTT TGTAGACAGT
GCCAATCGTC TTTACGTAGG CGACGAGTGG AATCACCGCA TTCAAGTCTT TGACGCGAAT
GGCGCCTATT TGACCACAAT CGGGGGAAGT GCAGGCCCCC GAACAGGGCA GTTCCGCGGT
GCGCGGGGTG TGGCGGTGGA CAACGCTGGC AACATCTACG TGGCAGACAG GCTCAACCAC
CGCATCCAGA AATTCGCCCC CGGCGTGCCG GGCTGGAAGC AGGTGAACAT CAACGGGTTT
GGAAATCGAG ATACCACGTT TGTCAGTACG CTGGATGTCT TCGGGGGTTA TCTGTATGCC
GGCACCTGGT CCAGCCAGAT GTGGCGCACT GCGGACGGAC AAACCTGGAG TCAGGTTGCC
CCCTCTACAT GGCCTACTGA TACGGCTGTG TTTGATGCTG AGCCCTTTGG CTCCTATCTG
TACGTAGGCA CGGCGTCCAA TAACGGCGGC GAGATCTGGC GAACCAACGG GATCACCTGG
GAACAGGTGA TTACCAGCGG TTTTGGGATT ACCAATAACT ACGGCATCAA CACGCTGGCA
GTATTCTCAA ATGCCATTTA CGCCGCAACC AGCGCTGAAG ATGGAGTGAT GCAGATTTAC
CGCAGCGCCA GCGGCGATGC CGGGAGTTGG TCACCTGTCG TGACCGACGG CTTCGGTGGC
GGCGGCGTGT GGCAAGATGT GACAATGGAT GTGTACGGCG GTTATCTCTA TTTGGGAATT
GGCCGGGCTG GCGTGGCAGA ACTCTGGCGC ACGAACGATG GAGTTACCTG GTCGCCGGTT
TTCACCGACG GTCTGGCCGC AAATAACACC CATGTCTCTG CAATGGCCGA GTTCAATGGC
GCTTTCTACA TTGGCCTGCG CAATGTCACC ACAGGCGGCG AAGTCTGGCG GACCACCGAT
GGCACAACCT TTACCCGTGT CTTTGATGGC GGACTGGGAG ATCCCAACAA CGGTCGTCCC
TATGGGCTTC AGGTGTTCAA CGGCTACTTG TATCTGGTCT TCAGCAATCT GGTCACAGGG
GCCGAAGTCT GGCGTACATC GGATGGAATG ACCTGGGAGC AGGTTGGCAA TGCCGGCTGG
GGCGACAGCA ACAATGGCTA CGCCGACTAC TTTGACAAAG GCGCGGCCGT TTTCAACAAC
CGCCTCTACA TCGGCACGAC CAACGATGCC AACGGCGGGG AGGTGTGGCT GTTCCTGCAC
AATCGAGTCT ACTTGCCGTT AATCCGGCGT TAA
 
Protein sequence
MKANSLRVIT LLAISILLLS GLATAQGPQP PLPSYPPQGL GLPAEARLGT PEFGKPFIRP 
GVPSIGIQSA GTASIPLGQP GLSFRYVQTF GVTETPYIST TTHLNYPYGI GVEGNSIWIG
EMWGNRFLKY ASDGNFQQSF GHAGFAEDYT DTSFWEIADV ATDSDGNIWV VDAASSRVVK
LNSSGKALLT LGKRWESGSD NNRFAYPISV AFDASGNIYV SDGAPWWNRE GGNHRIQVFR
SDGTYLATLG QTGVCGSANN QFCGPRHIAI YGNELYVPDA NNNRVQIFNI SNPASPSYVA
TIGGLNNPSG VAVDDNFIYI ADTWNNRIQT YTRIDRVYIG TIGGEWGSGN NQFRNPTDVV
AMTIGTYPNA ELHLFVADFV NTRVQQFKIT SISPFAFQYV RTYGTTGVPY VTDGYHYNTP
SSVAVAPDGS IYLTEDKGHR LVKLRPDGTP IWIVGAAGVK GDWDASNDRL NNPDDLALDA
NGRVYVADRW HGRVQIYNPD GSYYTTVSGL DCPGGVAIGP NGYLYVADTC NHTVKIYNTN
LVLVATLGTP GESGTDNAHF NSPEDVAVDS NGTIYVSDGG NHRIQVFNAN RQYVRTMGET
GIWGSDFAHF NGPNNLFVDS ANRLYVGDEW NHRIQVFDAN GAYLTTIGGS AGPRTGQFRG
ARGVAVDNAG NIYVADRLNH RIQKFAPGVP GWKQVNINGF GNRDTTFVST LDVFGGYLYA
GTWSSQMWRT ADGQTWSQVA PSTWPTDTAV FDAEPFGSYL YVGTASNNGG EIWRTNGITW
EQVITSGFGI TNNYGINTLA VFSNAIYAAT SAEDGVMQIY RSASGDAGSW SPVVTDGFGG
GGVWQDVTMD VYGGYLYLGI GRAGVAELWR TNDGVTWSPV FTDGLAANNT HVSAMAEFNG
AFYIGLRNVT TGGEVWRTTD GTTFTRVFDG GLGDPNNGRP YGLQVFNGYL YLVFSNLVTG
AEVWRTSDGM TWEQVGNAGW GDSNNGYADY FDKGAAVFNN RLYIGTTNDA NGGEVWLFLH
NRVYLPLIRR