Gene RoseRS_4345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4345 
Symbol 
ID5211329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5459569 
End bp5462856 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content63% 
IMG OID640597928 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001278632 
Protein GI148658427 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00663384 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTAAGTT CACGAGTTAC GGGCGAGTTC GGCGCTACTG TGTCTCTCGC ATCTACGGTA 
CTGCTTCCGC GTCTGGAGCC GCCGCCGCAG CCGGCCCGGC TCATCGAGCG TCCACGGATC
GATGGGTTGC TTGCTGCGGT CGCCGATTAT CCGGTGACGC TGGTGACCGC GCCGGCAGGC
GGCGGTAAGA CGGTTGCGCT CACCGGTTTT GCGCGTCACG GCGGCTGGCC TGTTGCCTGG
TGTCGTCTTG ATGCTGCGGA TACGCCGATC TCACTGGCGC TCCATCTGGC GACGGCGTTT
CGCCCGATTA CCGGTTTTGA TCCCTCCTGT TTCGTCACCG TGCACCCGGT CGATGTGCTG
GACCGGTTGA TCAATGCGTT GACCGCGCTC GGTGATGAGA CGCTCCTGGT GCTCGACGAC
GTCCACTGCG CCGACAGGCG TCCCGAACTG CGCGTCCTGA TCGAGCATCT GATTGATCGC
CTGCCGCCAC GTCTCCATCT GGTGCTGGCG AGTCGCGAGA TGCCTTCACT GGCGCCACTG
CCGACAGTGG CAGCGCGCGG TGAACTCTAC CGCCTGAACC GGGCGCAACT GGCGTTCACC
AATGAGGAGG CGCGTGAATT CTTCGCCGCC TGCGGGTTGC CGCCGAGTCC TTATGATGAT
GAACTCAATA CACTGGCGCG TGGTTGGCCC CTGGCGCTCC GTTTCTTTGC AACCGCTCGT
GTTGATTTGG GCGCCACGCA GCACGAACCG ATCCTGCCAG AACGGTTGCT GGAGCACATT
GCACCCCAAC TCGATGCGTA TCTGGCGCGT GAAGTCCTGG GCGACCTGCC GTTCGATCTG
CGCACCTGGT TGCTGGGTAC AGCGTTGATG CGCTGGATCG ATGAATCGGC ATGCGCTGCC
GTCGCTGAAC TGGCCGATCT CCATATCGAT CTCAAGATGA TCGAACGACA TGAGTTGTTC
ATTGAGACTC TGCCCGACGG TCGGTCGGTC TACCAGCCGT TACAGGCTGC CAGTTTTGCG
CGCCTGGCGG AACGTGAATT GCCGAACTGG CGGAGCATCC ATGCGCAGTT GGGTCAGTAC
TATGCAACGA AGGGCGATGA TCACGGCGCG GCGCACCATT TTCTTGCTGC GGCGCAATGG
GAGGAAGCGT CGGCGGCATT GAGTCGTATG GCGCTCGCCG GCGTTTCTGG GGCGCAGGCG
GTTGCGCTGC TGTCGTGGAT CGAACAGATC CCGACCGACC ACCGGAACAA TGCTGCGCTG
CTCGAGGCGC GGGCAATCGC CGAACGTCGC CTCGGTCGCT ACGCGCAGGC GCTGGAGTCG
TACCGCCAGG CGGAGGATCG CTACCGTGCG CAGGGAGATA TGGATGGTCA GGTGCGGGCA
TTGCGCGGTC AGGCGGAGGT GTACATCGAT ACGGTGCAGC CAGCGCCAGC AGCGGTGCTG
TTGAAGCGCG CGATGAAACT CTTGCCGCGC CATCGTCGCG CTGAACGGGC GACTATTCTG
AGTCTGCAAG CTGAGAACTG GATCAACCGG GGCCGTGCTG ATGTGGCAAT TTTCATCATT
GCTGCCGCCC ACCGCGAAGC GTATGGGAGA ACGTCGCGCG GCGATCCGGG CGGTGGCGGT
CTCCGCCGCT CTGCCGTTCT GTCGCCCCGC CTGCTGCTGC GGAGCGGCCG GCTTGTCGAA
GCGCGACGTT TGCTCGAAGA GGAACTCGGT CTCGATACGG GCAGGGCGCG CGGTGAGCAT
ATGCTGCACC GCGATCCGCT TCTGCTCCTG GCGTTGATCG AGTGCATGCT GGGCAATGGG
GTGCGCGCGC TGGCGCTTGC ACAGCGCGGC TTGCTCGAAG CACAACGCGG CGACTCGCCG
CTGACCGAGG CTATTGCTCA TATGCGCCTC GGTCATGCCT ATCTCGTGAC GGCGTCGAGC
GACGAGATGG CGTATCGTCA CTACCGCATT GCGCTCGATA TGATCGAGTC GAGCGGTATT
CCCCGCATGC GCGCCGAGGT GATGATGGGT CTGACGCTGC TTGAAGGGCA TGCGGGCAAC
CTTGCCGCCG CCGAAGCGTA TGCCCGTGAG GGTCTTGACC GCACGCTGGA GGCGGGTGAC
GACTGGATGG CGGCGCTGAT CTGGCTTGCG CTCGGAAGCG TTGCGGCCAC TGCCGGCGAT
CCGCGCGCGG CGCAATGGCT CCACGAAGCG CAGCAACGCT TTGTGCGGGG TGATGATCAG
TATGGGCAGA CGGTCGCGCT GATCTGGGAA GCGCACATCC TGCTGCAATC CGGGAAGGAA
GCCGAAGCCG ATCGCACTCT GGCGCGCCTG CTTGGATTGA TCGATGCGTG TGGCTTCGAT
GGCGTCTTGA CAACGCGCAC GCTCTTCGGT CCGCACGATC TTGCAGTTCT GGTTCCGCTT
CTGCTCCGCG GTCGGGCGCT GCGCGGCGCT GCATCGGCGC AGGCAGCAGT GGCGCAGCGT
CTGTTGCGCC AGGGTTTCCC CTCGATTGCC GCCGATGACG CGGTTGACAC CTATCATCCC
GGCTATACAC TGCGGGTCTA CATGCTGGGT CGTTTCCGCA TCTTCCGCGG CTCGCACGAG
ATACAGGCAC GCGAGTGGCA GCGTGAGAAG GCGCGCCAGT TGCTCCAGTT GTTGCTGACG
TATCGCGGCA TGTGGTTGCA GCGTGAGCAG ATCTGCGCCT GGCTCTGGCC CGACAGCGAA
CCGTCGGCTG CCGAGCGCCA GTTTAAGGTG ACGCTCAATG CGCTCAATAA TGTGCTGGAG
CCGCGTCGTC CGCCACGGGT GGCGCCGTTC TTTATTCGAC GGCAGGGGCT GGCGTACAGT
TTTGCACCCT CGTATGGCTG CTGGATCGAT GTGGATGAGT TTGAACTGCG CACTGCCGGC
GCACCGGGAC GCGATCCCGA AGTGGAGATC CGCAGTCGCC GCACGGCATT CCAGTTGTAC
CGCGGCGATT ATCTCGCCGA AGCGCTATAC GATCCCTGGA CGCTTGAAGA GCGTGAGCGT
CTCCTGGCGC GTCATCTGGC ATCGACGGCG ACACTTGCCC GGTTGTTGAT CGATCGCGAA
GAGTTCGATG AAGCGATCGA TCTGTGCGAG CACATCATCC GCCGCGACCG TGGCTATGAG
GAAGCGTACC AGATGCTGAT GCGCGCCTAT GCGCGCTCTG GCAGTCGCTC CCAGGCGCTG
CGCGCCTATG CGCGCTGTGT CCAGGCAATG CAGGACGAAC TGGGGATGGA GCCGCTGCCA
GAGACGACCG AACTCTGTGA GCGGATCAAG CGGAATGAGC CGGTGTGA
 
Protein sequence
MLSSRVTGEF GATVSLASTV LLPRLEPPPQ PARLIERPRI DGLLAAVADY PVTLVTAPAG 
GGKTVALTGF ARHGGWPVAW CRLDAADTPI SLALHLATAF RPITGFDPSC FVTVHPVDVL
DRLINALTAL GDETLLVLDD VHCADRRPEL RVLIEHLIDR LPPRLHLVLA SREMPSLAPL
PTVAARGELY RLNRAQLAFT NEEAREFFAA CGLPPSPYDD ELNTLARGWP LALRFFATAR
VDLGATQHEP ILPERLLEHI APQLDAYLAR EVLGDLPFDL RTWLLGTALM RWIDESACAA
VAELADLHID LKMIERHELF IETLPDGRSV YQPLQAASFA RLAERELPNW RSIHAQLGQY
YATKGDDHGA AHHFLAAAQW EEASAALSRM ALAGVSGAQA VALLSWIEQI PTDHRNNAAL
LEARAIAERR LGRYAQALES YRQAEDRYRA QGDMDGQVRA LRGQAEVYID TVQPAPAAVL
LKRAMKLLPR HRRAERATIL SLQAENWINR GRADVAIFII AAAHREAYGR TSRGDPGGGG
LRRSAVLSPR LLLRSGRLVE ARRLLEEELG LDTGRARGEH MLHRDPLLLL ALIECMLGNG
VRALALAQRG LLEAQRGDSP LTEAIAHMRL GHAYLVTASS DEMAYRHYRI ALDMIESSGI
PRMRAEVMMG LTLLEGHAGN LAAAEAYARE GLDRTLEAGD DWMAALIWLA LGSVAATAGD
PRAAQWLHEA QQRFVRGDDQ YGQTVALIWE AHILLQSGKE AEADRTLARL LGLIDACGFD
GVLTTRTLFG PHDLAVLVPL LLRGRALRGA ASAQAAVAQR LLRQGFPSIA ADDAVDTYHP
GYTLRVYMLG RFRIFRGSHE IQAREWQREK ARQLLQLLLT YRGMWLQREQ ICAWLWPDSE
PSAAERQFKV TLNALNNVLE PRRPPRVAPF FIRRQGLAYS FAPSYGCWID VDEFELRTAG
APGRDPEVEI RSRRTAFQLY RGDYLAEALY DPWTLEERER LLARHLASTA TLARLLIDRE
EFDEAIDLCE HIIRRDRGYE EAYQMLMRAY ARSGSRSQAL RAYARCVQAM QDELGMEPLP
ETTELCERIK RNEPV