Gene RoseRS_0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0461 
Symbol 
ID5207397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp585367 
End bp588327 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content67% 
IMG OID640594081 
Producthelicase domain-containing protein 
Protein accessionYP_001274836 
Protein GI148654631 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.838477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000248811 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCCCGGT TTGCAGTCGG TTCACTGGTA CGCTGTCGCG AACGCGAATG GGTGGTGCTC 
CCCTCGCAGG ACGATGACCT GCTGCTGTTG CGTCCGCTCG GCGGGCGCGC ATCCGGGGTA
TGCGGCGTCT TCCTGCCGCT CGAAGAGCAC GATGTGCGCC CGGCGTCGTT TGCGCCGCCC
GACCCGGCTG CGATGGGGGA TTTCGAGGGT GGGGCGCTGC TGCGCGATGC GGCGCGCCTG
AGCCTGCGCA GCGGCGCCGG TCCTTTCCGC TGTATGGGGC GGTTGGGGTT TCGTCCGCGT
CCCTATCAGA TTGTGCCGCT GTTGATGGCG CTGCGTCTCG ACCCGGTGCG CCTGTTGATC
GCCGATGATG TGGGGATCGG GAAGACGATT GAAGCGGCGC TGATCGCCCG TGAACTGTTC
GACCGCGGTG AGATTGCGCG GATGGCGGTG ATCTGCCCGC CCCATCTGTG CGATCAGTGG
CAGCGTGAGT TGCAGCATAA GTTTGCGCTC GAAGCCACGG TGGTGCGCGC CAGTACCGCC
GCTGCGCTTG AACGTCGCCT GCCGCGCGGC GATGTGAGCA TCTGGGAACA TTTCCCGTTC
ACGATTGTCA GTATCGATTA TGTCAAAAAT GAGCGCCGCC GGGATAGTTT CCTGCGCGCC
GCGCCCGAAC TGATCATTGT CGATGAGGCG CACGCCGCCA TCGGCATGGG GGACGGCGCG
CAACAGCAAC GCTATGCGCT GGTGCGCGAT CTTGCCGCGC GCTCCGACCG GCATCTGATC
CTGTTGACGG CGACGCCGCA CTCCGGTGTG GAGCACGCCT TTGCCCGCCT GATCGGGTTG
ATCGATCCGG AGTTCGCTGC GTTCGACCTC GACCAGTTGC GCGATGCGGA ACGCGACCGG
CTGGCGCGCC ACTTTGTGCA GCGCCGCCGC GCCGATGTGC AACGCTGGCT CGGCAGCAAT
GATGTGACTC CCTTCCCGCA ACGCGAGTCG ACCGAGGTGA CGTATGCTCT GCCGCGCGCT
TCGGCGTACC GCGCGCTGTT CGATGATGTG TTCGCGTTTA CCCGCGAACA GGTGCGCGCC
GACCATGCAT CACCGGCTGC CGGACGCGCT TCCGATGCGC AATTGCGCTG GCGACGCCGG
GCGCGCTATT GGGCGGCGCT CGCCTTGCTG CGGTGCGTGA TGAGCAGTCC GGCTGCCGCC
GAACGCGCAT TGCGCCTGCG CGCTGCGGAT GTCGCTGCGC CGCTGCGCCT GACCGATGCC
GACGACGAGC GGTTCGACGC CGATCTCATC CGCCCCTTCG TGATCGATCC GACCGACCAG
GAGCAGGCGC ACGACCTGGA GCCAGCACAG ACGGCAGACC TGCTCGCCGA AACGGATTCC
AGTCCATCCC GCAGCGAGCG CGCCCGCCTG GAACGCTTCG CGGCGCGCGC TGCCGCCCTG
CGCGGCGCTG AAGACCCCAA ACTCCAGCGG TTGATCCCGC TGATCCAGAC GCTGATCGAT
GATGGGTTCA ATCCGATCAT CTACTGCCGC TATATCGCCA CTGCCGATTA TGTTGCGGCG
GAACTGGCGC GGCAGTTCGA GCGCATTCCC GCTGTGCGGG TGATGTCGGT CACCGGCGAA
CGCTCCGAGG AGGAGCGCGA CATGATGATT GCCGAACTGG AACGCAGTCC CCGGCGCATC
CTGGTGGCGA CCGATTGCCT CAGCGAGGGG ATCAATCTGC AACATAGTTT CGATGCGGTG
GTGCATTACG ACCTGCCGTG GAACCCCAAC CGCCTGGAAC AGCGCGAGGG GCGGGTGGAC
CGCTATGGGC AGCGCAGCGC CGTTGTGCGC ACGGTGTTGA TCTACGGGCA GGACAACCCC
ATGGACGAGG CGGTGATGAA GGTGTTGCTG CGCAAGGCGG TGCGCATTCA TAAGACGCTC
GGCATCAGCG TGCCGTTGCC GGTGGATAGC GGCACGGTCG TTGAGGCGCT GATCGCGGCA
CTGTTCCAAC CGGCAGTCGA TCAACTCACC CTCTTCGATA GTGACCGGCA GGCGGCGCTG
CTCGACGAAT CGCAGGAGTT GCAGCGGATC GAACTGGCGT GGGATCGGGC GACGCGCCGG
GAGCAGGAGA GCCGCACCCG TTTTGCGCAG CGGCGCATCA AGCCGGAAGA GGTGGCGCGT
GAACTGGAAG AGAGCGATGC CGCGCTTGGC GATCCGGCGG CTGTCGAACG TTTCGTCATT
GCGGCATGCG CCCGCCTGAA TGCGGCGCTG ACGCCGGTGA GCGGCATTCG TGGTGCGACG
GCGCCGGTGT TCGCTGTGCC GCTGGCGCGC CTGCCCGCCC CGGTGCGTGA ACCGGTTGCG
CACCTGGCTG ACGCCGACCG GAACCTGCTG GTGACCTTTA CCGAACCGGC GCCCGCAGGG
GTCGAGGCGC TCGAACGCAA CCATCCGCTG GTCGTGGCGC TCACCGATTA TCTGCTCGAA
ACTGCGCTGA TCCCCGCCGG GATGATGCCT GCCGATGCCA TGCCGGTTGC CGCTCGATCA
GGGGTGATCC GCACCCGCGC AGTGACGAAA CGCACCTGCC TGTTGCTGCT GCGGGTGCGG
ATGCTGATCG AGCATGGGCA TCAGCGCGAT ACGCAGCCGC TCCTTGCCGA AGAACTGATC
GTGACCGGCT TTCGCGGCAG CCCCGCCAGC CTGACCTGGC TCGATCAGGC GGAGGCGCTG
GCGTTGCTCG AAACAGCGCA ACCCGCCGAA AATCTGCCCC GCGATGCGCG CCTGATCGCG
CTGCGCGCCG TCCTCGACGC ATTGCCCGCG CTCGACGCCG CGCTCGACCA GATCAGCCGC
GAACGCGCGC TGCGCCTGCG TGAATCTCAT ATGCGCGTCT GGTCGCAGAT CGGCGGGACG
CAACGACGCT GTGCCTGTAC GCCTGCCGGA CCGCCCGACA TCCTGGGGGT GTATGTGCTG
TTGCCGGTGC TGGCGACCTG A
 
Protein sequence
MPRFAVGSLV RCREREWVVL PSQDDDLLLL RPLGGRASGV CGVFLPLEEH DVRPASFAPP 
DPAAMGDFEG GALLRDAARL SLRSGAGPFR CMGRLGFRPR PYQIVPLLMA LRLDPVRLLI
ADDVGIGKTI EAALIARELF DRGEIARMAV ICPPHLCDQW QRELQHKFAL EATVVRASTA
AALERRLPRG DVSIWEHFPF TIVSIDYVKN ERRRDSFLRA APELIIVDEA HAAIGMGDGA
QQQRYALVRD LAARSDRHLI LLTATPHSGV EHAFARLIGL IDPEFAAFDL DQLRDAERDR
LARHFVQRRR ADVQRWLGSN DVTPFPQRES TEVTYALPRA SAYRALFDDV FAFTREQVRA
DHASPAAGRA SDAQLRWRRR ARYWAALALL RCVMSSPAAA ERALRLRAAD VAAPLRLTDA
DDERFDADLI RPFVIDPTDQ EQAHDLEPAQ TADLLAETDS SPSRSERARL ERFAARAAAL
RGAEDPKLQR LIPLIQTLID DGFNPIIYCR YIATADYVAA ELARQFERIP AVRVMSVTGE
RSEEERDMMI AELERSPRRI LVATDCLSEG INLQHSFDAV VHYDLPWNPN RLEQREGRVD
RYGQRSAVVR TVLIYGQDNP MDEAVMKVLL RKAVRIHKTL GISVPLPVDS GTVVEALIAA
LFQPAVDQLT LFDSDRQAAL LDESQELQRI ELAWDRATRR EQESRTRFAQ RRIKPEEVAR
ELEESDAALG DPAAVERFVI AACARLNAAL TPVSGIRGAT APVFAVPLAR LPAPVREPVA
HLADADRNLL VTFTEPAPAG VEALERNHPL VVALTDYLLE TALIPAGMMP ADAMPVAARS
GVIRTRAVTK RTCLLLLRVR MLIEHGHQRD TQPLLAEELI VTGFRGSPAS LTWLDQAEAL
ALLETAQPAE NLPRDARLIA LRAVLDALPA LDAALDQISR ERALRLRESH MRVWSQIGGT
QRRCACTPAG PPDILGVYVL LPVLAT