Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1015 |
Symbol | |
ID | 5207961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 1242622 |
End bp | 1246272 |
Gene Length | 3651 bp |
Protein Length | 1216 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640594629 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001275374 |
Protein GI | 148655169 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAGCG CAAGACGACA AAAGCGATGT CTGAACAGGT ATAATACCAT CACACGCAGA GTATCCGGTT CAATCCGCCC GAATCGCAAA CGCATCGTCA TGATCGGTCC GCATCATCCA CATCCATCGC TCGAACTCCG GGTTTTCGGC ACGCCGCGCC TTTTGCTCGA CCAGCACGAC GTGCCGTTCC GCCGGCGACA ACCGCTGGCA ATTATGGCTG TGCTGGCGCT CAGCGAACGT TCCGTCACCC GCGATGAACT GACGTACCTG TTGTGGCCCG ACTCGCCGCA GACCGTCGGA CGTCAACGGT TACGCCGTTT ACTCTCACAA CTGCGTCAGT CGGTCGGTCC GCTGGCTGAC CGGTTGTTGA CCGGTGAAGC CGGGATCGGC AGTGGCGTCA TTCGTTTCGA TGCCAGTATG TGCCGGGTCG ATGCCCGTGA GTTCGTGCGC TTATCCGGTC AGGCGCGCAC ATTGCCCGAT CCGGACGGCT TACGCGCTGC CGAGCAGGCG ACGCGACTCT ATGCCGGACC GCTGCTGAGC GGTCTGGAAC TCGAAGAGTC GCCGGAGTTC GAACACTGGC TCCTCCAGCA GCGCGAACGC TTCGAGCGCA TGATCCTCGA TATCTGGCGG CGTCTCGTCG ATGGATACGC CAGCACAGGC GATTTTGACC GCGCTATTGC TGCGGTTGAA CATGCGCTCG TTCTCGATCC GCTCTCTGAA ATGCTGCACC GCAAGGCGAT ATGGCTCTAT GCAAAAACCG GTCGTCGCAG TGACGCCATT CGGCAGTTTA CCGTCTGTGT TGCGTTGCTC GAACGCGAAC TGGCGCTTGA GCCGGACGCA GACACTGCGG CGCTCTATCG GGCTATTCTG AATAACCAGC TCGATACTGC GCGGTCGCTG GCATTCCGGG ATCAGTCGCT TCCATCCGCC ATCCGCAGTT CCACCGGTCG TGCGGAACAT ACCTCGCCAG CGACACTCAC GACGACGTTG ACAGCAGATC TGGTCGCGGC CATGCGCACC GCGCTCACCG GATCGTCGCC GGTTGTCGTT GTGGAGGGAC CGGCTGGCAG CGGGAAAACG CGCATGGTTC GCCAGGCGCT GGATCAGATT CGACAGACAA CGACCAATCT TCTGGTCTGG CTATCCGGTG CACGGCGATC AGGTCAGCAA CGCCCCTTTG GCGTCATGAT CGACCTGCTC GATACGGCGC TGCGCGAGCG ATTGAACCAT CCTGACGCAG AGCGTGCTTT GCCGACCGAC GTGCGGATGA CTGAGGCCGT CCGCGTGCTG CCGGAGTTGC GCGCCGTTTT CCTCCGATTG CAACCCATTC GCCAGCCCAC AGAAGAATCA CCATCGTCCC ACCACCTCTT GCAGCGGCGG TTGTTGCAGG CGCTGCCACG CGCCGTCCAT GCGCTGGCAG GCGGAGAGCC GGTTATCGTG GCGCTGGAAG ACCTGGATCA GGCCGATCCA CTCTCGATAG AAGCCGTCGC CTGGCTGGCG CGTTCGCTGC ACGGAACGAA TCTGGCGCTG GTGATCACGT GTCGCACCGC AGAGGGCGCC CTCAACGCGA TGCTCTCCGA TCTGCGCGCA CGCGGCATGC TGCAATCGTT GACGCTGACT GCATTCGATC ACCCGACAGT CATCGGTCTG GCGCAGCATG CCGGTCTGCC GACGACGACG GCTGAACAGA TCTGGCGGCA GACGGGCGGT GCGCCGCTGG CAACCCGCGA GATGGTGCGT GCGATGGTTG CTGCCGGAAA AGACCTTTCC GCACTTCCTT TGTCACTGCA CGAAGCCATT CAGATTCAGT TGAAATCGCT TGCCCCGACC ATCCGCCAGG TTATGGAAGC AGCAGCGGTG CTGCAAAGCG GTGATGCGCT TGAGATTCAG CAGGTCAGCG GACGCACCGC CGACGAGGTG GAACACGCCT GTGAGGAACT TCAGGCACGC GACTGGCTCA CGTTTGACGG ATCACGGTAC GTCGTCGCGC ATCCAGAGGT GCGGGAAGCA GTGCTGGAAA GCCTGAGTCC GGCGCGTCGC CAGCGGCTGC ACCGTCAGGC TGCGTTGGTG ATGCGTCAGC ACAACGCGGA CCCTGCGCGA ATTGCATCCC ATCTTGAGTC CGCCGATCAA CCAGATGAGG CAGCCGCGAT GTGGCTTCAG GCGGCGCGGC GCGCCCGGTC GCTCTATGCG CGCGATGCTG CACTCACGGC GCTTCAGTGT GGTCTGGGGC TGGTTCGTGA TCGACACGTG CTGTTCGAAT TGTTGAGCGA ACAGGAATCA ATTCTGCACG AACACGGGTT GCGCGATGAA CAACGCGCGA CGCTGGAGAC ACTGGAGCGT TTCGTCGAAC AGTCACCCGA TCACCCCGAC TGGCGCGCCG AAGTCTATCG CAAACGCGGG CGTCTTGCCC TTGCGTGCAA CGAGTGGAAT GCCGCTATCG ATGCGCTGCG TCGAGCGGCA GTGTTCACAC TTCACAGCGA CTGGGCAACG TTGTGTCTGC TGGCGCGCGC CCTCGGACAC AACCAGCAGT GGCACGAAGC GGACGAGATA CTCCAACGCG CGCTGGCGCT GGCACAGCAG CAGCGTGATC GTGAAGCGCA GGCGCGCTGC TGGCTGACCC GCGCCGACAT CGAGCAGGGA CGGGAGCGTT TCGACGCTGC CGAAAGCGCC TTGAAGCACG CCGTGCACCT GATCGAACCC TCATCACCAA CATTGCCGCA ACTGATGTTG AACCTTGGGA ATATGGCTAC CGTGCGCAAC GATTTCGTCA GTGCGCTGAC GTATGGACAG GAGGCGCAAC GTCTGTTTGC GCAGCGGGGC GCGCCCGATA GCGAAGCCGC CGCCTGGGTG CTGGTCGCCC GGATGCACGC CCGCCTGGGG CAGTTTGAAG CGGCATTCGA AGCGTACCAG TCCGCCTATG CGGGGTATGC TGCGCTCGAA CTGCGCCAGG GTATGGCAGC CGCTCGTATC AATGCGTGCA CCCTGGCGTT GCGCATTGGC AATTTTGACA ACGGGTTGCG CCTGGCAGAG GAAGCCTGGG AACTGTTCCA GGCGATCAAC GATGCGCGTG GCATGTGTGT CACCGCCAGC AACAGGGGTG CGGCGCTGGT CTGGATGGGA CGCGGCGCCG AAGCCGAGCC ATGGCTGCGC GAGTCGTATG AGCGTGCGGT TGCGATTCCG CTGCCTGCGC AACAGGCGGC AGCGCTGGCG AACCTGGGCG CGGCGCTGCT CCAGCAGGGA CGGCTGGAAG AAGCCCGTCG GTTGATGGAA CAGGGACTGG CGTTGCGCGT CGCGCAGGGG CATATCGATG TGAGTGTCGA CCGCGCATTT CTGGCGATAG CCTGCCTGCG TCTTGGCGAC ATCGAGGCTG CTGACCGGTA TAGCCTGGAG GCTGTGGAGT ACCTGGCGCG AGCGCCACAG GTAGAAAACC CACAGCAGGT CTGGTTTGCA CGTGCTCAGG TCCTCCGCGC TCAGGGTTTG ATAACCGAAG CTAACAATGC TTTGCAGTCC GCCGTGGAAT GTCTATACCG CAGCGAGCAA CAACTGCCGC CACCGTACCG CGAACGCTAC CGCAGCGTGT TTTCGTTCAA TCGTGCGATC CTGCATGCCT TCGATAATGG CGTCTGGCCC GAACCGCCGA TGCTGGTGTG A
|
Protein sequence | MHSARRQKRC LNRYNTITRR VSGSIRPNRK RIVMIGPHHP HPSLELRVFG TPRLLLDQHD VPFRRRQPLA IMAVLALSER SVTRDELTYL LWPDSPQTVG RQRLRRLLSQ LRQSVGPLAD RLLTGEAGIG SGVIRFDASM CRVDAREFVR LSGQARTLPD PDGLRAAEQA TRLYAGPLLS GLELEESPEF EHWLLQQRER FERMILDIWR RLVDGYASTG DFDRAIAAVE HALVLDPLSE MLHRKAIWLY AKTGRRSDAI RQFTVCVALL ERELALEPDA DTAALYRAIL NNQLDTARSL AFRDQSLPSA IRSSTGRAEH TSPATLTTTL TADLVAAMRT ALTGSSPVVV VEGPAGSGKT RMVRQALDQI RQTTTNLLVW LSGARRSGQQ RPFGVMIDLL DTALRERLNH PDAERALPTD VRMTEAVRVL PELRAVFLRL QPIRQPTEES PSSHHLLQRR LLQALPRAVH ALAGGEPVIV ALEDLDQADP LSIEAVAWLA RSLHGTNLAL VITCRTAEGA LNAMLSDLRA RGMLQSLTLT AFDHPTVIGL AQHAGLPTTT AEQIWRQTGG APLATREMVR AMVAAGKDLS ALPLSLHEAI QIQLKSLAPT IRQVMEAAAV LQSGDALEIQ QVSGRTADEV EHACEELQAR DWLTFDGSRY VVAHPEVREA VLESLSPARR QRLHRQAALV MRQHNADPAR IASHLESADQ PDEAAAMWLQ AARRARSLYA RDAALTALQC GLGLVRDRHV LFELLSEQES ILHEHGLRDE QRATLETLER FVEQSPDHPD WRAEVYRKRG RLALACNEWN AAIDALRRAA VFTLHSDWAT LCLLARALGH NQQWHEADEI LQRALALAQQ QRDREAQARC WLTRADIEQG RERFDAAESA LKHAVHLIEP SSPTLPQLML NLGNMATVRN DFVSALTYGQ EAQRLFAQRG APDSEAAAWV LVARMHARLG QFEAAFEAYQ SAYAGYAALE LRQGMAAARI NACTLALRIG NFDNGLRLAE EAWELFQAIN DARGMCVTAS NRGAALVWMG RGAEAEPWLR ESYERAVAIP LPAQQAAALA NLGAALLQQG RLEEARRLME QGLALRVAQG HIDVSVDRAF LAIACLRLGD IEAADRYSLE AVEYLARAPQ VENPQQVWFA RAQVLRAQGL ITEANNALQS AVECLYRSEQ QLPPPYRERY RSVFSFNRAI LHAFDNGVWP EPPMLV
|
| |