Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4345 |
Symbol | |
ID | 5211329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 5459569 |
End bp | 5462856 |
Gene Length | 3288 bp |
Protein Length | 1095 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640597928 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001278632 |
Protein GI | 148658427 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00663384 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTAAGTT CACGAGTTAC GGGCGAGTTC GGCGCTACTG TGTCTCTCGC ATCTACGGTA CTGCTTCCGC GTCTGGAGCC GCCGCCGCAG CCGGCCCGGC TCATCGAGCG TCCACGGATC GATGGGTTGC TTGCTGCGGT CGCCGATTAT CCGGTGACGC TGGTGACCGC GCCGGCAGGC GGCGGTAAGA CGGTTGCGCT CACCGGTTTT GCGCGTCACG GCGGCTGGCC TGTTGCCTGG TGTCGTCTTG ATGCTGCGGA TACGCCGATC TCACTGGCGC TCCATCTGGC GACGGCGTTT CGCCCGATTA CCGGTTTTGA TCCCTCCTGT TTCGTCACCG TGCACCCGGT CGATGTGCTG GACCGGTTGA TCAATGCGTT GACCGCGCTC GGTGATGAGA CGCTCCTGGT GCTCGACGAC GTCCACTGCG CCGACAGGCG TCCCGAACTG CGCGTCCTGA TCGAGCATCT GATTGATCGC CTGCCGCCAC GTCTCCATCT GGTGCTGGCG AGTCGCGAGA TGCCTTCACT GGCGCCACTG CCGACAGTGG CAGCGCGCGG TGAACTCTAC CGCCTGAACC GGGCGCAACT GGCGTTCACC AATGAGGAGG CGCGTGAATT CTTCGCCGCC TGCGGGTTGC CGCCGAGTCC TTATGATGAT GAACTCAATA CACTGGCGCG TGGTTGGCCC CTGGCGCTCC GTTTCTTTGC AACCGCTCGT GTTGATTTGG GCGCCACGCA GCACGAACCG ATCCTGCCAG AACGGTTGCT GGAGCACATT GCACCCCAAC TCGATGCGTA TCTGGCGCGT GAAGTCCTGG GCGACCTGCC GTTCGATCTG CGCACCTGGT TGCTGGGTAC AGCGTTGATG CGCTGGATCG ATGAATCGGC ATGCGCTGCC GTCGCTGAAC TGGCCGATCT CCATATCGAT CTCAAGATGA TCGAACGACA TGAGTTGTTC ATTGAGACTC TGCCCGACGG TCGGTCGGTC TACCAGCCGT TACAGGCTGC CAGTTTTGCG CGCCTGGCGG AACGTGAATT GCCGAACTGG CGGAGCATCC ATGCGCAGTT GGGTCAGTAC TATGCAACGA AGGGCGATGA TCACGGCGCG GCGCACCATT TTCTTGCTGC GGCGCAATGG GAGGAAGCGT CGGCGGCATT GAGTCGTATG GCGCTCGCCG GCGTTTCTGG GGCGCAGGCG GTTGCGCTGC TGTCGTGGAT CGAACAGATC CCGACCGACC ACCGGAACAA TGCTGCGCTG CTCGAGGCGC GGGCAATCGC CGAACGTCGC CTCGGTCGCT ACGCGCAGGC GCTGGAGTCG TACCGCCAGG CGGAGGATCG CTACCGTGCG CAGGGAGATA TGGATGGTCA GGTGCGGGCA TTGCGCGGTC AGGCGGAGGT GTACATCGAT ACGGTGCAGC CAGCGCCAGC AGCGGTGCTG TTGAAGCGCG CGATGAAACT CTTGCCGCGC CATCGTCGCG CTGAACGGGC GACTATTCTG AGTCTGCAAG CTGAGAACTG GATCAACCGG GGCCGTGCTG ATGTGGCAAT TTTCATCATT GCTGCCGCCC ACCGCGAAGC GTATGGGAGA ACGTCGCGCG GCGATCCGGG CGGTGGCGGT CTCCGCCGCT CTGCCGTTCT GTCGCCCCGC CTGCTGCTGC GGAGCGGCCG GCTTGTCGAA GCGCGACGTT TGCTCGAAGA GGAACTCGGT CTCGATACGG GCAGGGCGCG CGGTGAGCAT ATGCTGCACC GCGATCCGCT TCTGCTCCTG GCGTTGATCG AGTGCATGCT GGGCAATGGG GTGCGCGCGC TGGCGCTTGC ACAGCGCGGC TTGCTCGAAG CACAACGCGG CGACTCGCCG CTGACCGAGG CTATTGCTCA TATGCGCCTC GGTCATGCCT ATCTCGTGAC GGCGTCGAGC GACGAGATGG CGTATCGTCA CTACCGCATT GCGCTCGATA TGATCGAGTC GAGCGGTATT CCCCGCATGC GCGCCGAGGT GATGATGGGT CTGACGCTGC TTGAAGGGCA TGCGGGCAAC CTTGCCGCCG CCGAAGCGTA TGCCCGTGAG GGTCTTGACC GCACGCTGGA GGCGGGTGAC GACTGGATGG CGGCGCTGAT CTGGCTTGCG CTCGGAAGCG TTGCGGCCAC TGCCGGCGAT CCGCGCGCGG CGCAATGGCT CCACGAAGCG CAGCAACGCT TTGTGCGGGG TGATGATCAG TATGGGCAGA CGGTCGCGCT GATCTGGGAA GCGCACATCC TGCTGCAATC CGGGAAGGAA GCCGAAGCCG ATCGCACTCT GGCGCGCCTG CTTGGATTGA TCGATGCGTG TGGCTTCGAT GGCGTCTTGA CAACGCGCAC GCTCTTCGGT CCGCACGATC TTGCAGTTCT GGTTCCGCTT CTGCTCCGCG GTCGGGCGCT GCGCGGCGCT GCATCGGCGC AGGCAGCAGT GGCGCAGCGT CTGTTGCGCC AGGGTTTCCC CTCGATTGCC GCCGATGACG CGGTTGACAC CTATCATCCC GGCTATACAC TGCGGGTCTA CATGCTGGGT CGTTTCCGCA TCTTCCGCGG CTCGCACGAG ATACAGGCAC GCGAGTGGCA GCGTGAGAAG GCGCGCCAGT TGCTCCAGTT GTTGCTGACG TATCGCGGCA TGTGGTTGCA GCGTGAGCAG ATCTGCGCCT GGCTCTGGCC CGACAGCGAA CCGTCGGCTG CCGAGCGCCA GTTTAAGGTG ACGCTCAATG CGCTCAATAA TGTGCTGGAG CCGCGTCGTC CGCCACGGGT GGCGCCGTTC TTTATTCGAC GGCAGGGGCT GGCGTACAGT TTTGCACCCT CGTATGGCTG CTGGATCGAT GTGGATGAGT TTGAACTGCG CACTGCCGGC GCACCGGGAC GCGATCCCGA AGTGGAGATC CGCAGTCGCC GCACGGCATT CCAGTTGTAC CGCGGCGATT ATCTCGCCGA AGCGCTATAC GATCCCTGGA CGCTTGAAGA GCGTGAGCGT CTCCTGGCGC GTCATCTGGC ATCGACGGCG ACACTTGCCC GGTTGTTGAT CGATCGCGAA GAGTTCGATG AAGCGATCGA TCTGTGCGAG CACATCATCC GCCGCGACCG TGGCTATGAG GAAGCGTACC AGATGCTGAT GCGCGCCTAT GCGCGCTCTG GCAGTCGCTC CCAGGCGCTG CGCGCCTATG CGCGCTGTGT CCAGGCAATG CAGGACGAAC TGGGGATGGA GCCGCTGCCA GAGACGACCG AACTCTGTGA GCGGATCAAG CGGAATGAGC CGGTGTGA
|
Protein sequence | MLSSRVTGEF GATVSLASTV LLPRLEPPPQ PARLIERPRI DGLLAAVADY PVTLVTAPAG GGKTVALTGF ARHGGWPVAW CRLDAADTPI SLALHLATAF RPITGFDPSC FVTVHPVDVL DRLINALTAL GDETLLVLDD VHCADRRPEL RVLIEHLIDR LPPRLHLVLA SREMPSLAPL PTVAARGELY RLNRAQLAFT NEEAREFFAA CGLPPSPYDD ELNTLARGWP LALRFFATAR VDLGATQHEP ILPERLLEHI APQLDAYLAR EVLGDLPFDL RTWLLGTALM RWIDESACAA VAELADLHID LKMIERHELF IETLPDGRSV YQPLQAASFA RLAERELPNW RSIHAQLGQY YATKGDDHGA AHHFLAAAQW EEASAALSRM ALAGVSGAQA VALLSWIEQI PTDHRNNAAL LEARAIAERR LGRYAQALES YRQAEDRYRA QGDMDGQVRA LRGQAEVYID TVQPAPAAVL LKRAMKLLPR HRRAERATIL SLQAENWINR GRADVAIFII AAAHREAYGR TSRGDPGGGG LRRSAVLSPR LLLRSGRLVE ARRLLEEELG LDTGRARGEH MLHRDPLLLL ALIECMLGNG VRALALAQRG LLEAQRGDSP LTEAIAHMRL GHAYLVTASS DEMAYRHYRI ALDMIESSGI PRMRAEVMMG LTLLEGHAGN LAAAEAYARE GLDRTLEAGD DWMAALIWLA LGSVAATAGD PRAAQWLHEA QQRFVRGDDQ YGQTVALIWE AHILLQSGKE AEADRTLARL LGLIDACGFD GVLTTRTLFG PHDLAVLVPL LLRGRALRGA ASAQAAVAQR LLRQGFPSIA ADDAVDTYHP GYTLRVYMLG RFRIFRGSHE IQAREWQREK ARQLLQLLLT YRGMWLQREQ ICAWLWPDSE PSAAERQFKV TLNALNNVLE PRRPPRVAPF FIRRQGLAYS FAPSYGCWID VDEFELRTAG APGRDPEVEI RSRRTAFQLY RGDYLAEALY DPWTLEERER LLARHLASTA TLARLLIDRE EFDEAIDLCE HIIRRDRGYE EAYQMLMRAY ARSGSRSQAL RAYARCVQAM QDELGMEPLP ETTELCERIK RNEPV
|
| |