Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_3667 |
Symbol | |
ID | 3722157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007494 |
Strand | + |
Start bp | 779082 |
End bp | 785318 |
Gene Length | 6237 bp |
Protein Length | 2078 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640073341 |
Product | parallel beta-helix repeat-containing transcriptional regulator AraC |
Protein accession | YP_355178 |
Protein GI | 77465675 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCAACG GCGGCGTGGC CCCCCTCACC GACGTGACGC TCACCGATGC GGTGCCGGGC GTGCAGGTGA CGGGCGGGCC GATCTCGCTG GCAGGCGGGG CTTCGGACAC GACGAGCTTC ACCGCGACCT ATGAGCTGAC CCAGGCCGAC GTCGATGCGG GCAGCTTCAC CAACGACGCG AGCGTGACGG GCTTCGTTCA GGTTCAGGGC GGCGGCCGGG TGCCGGTCAC GGCCGATGAC AGCGTGACCA CGCCACTCGC GCTTGCGCCC GCCATCACCC TGGTGAAGGA GGTCGACACG ACCGGCCTCT CCTCTCCGGC GGCGGTGGGT GACGTGCTCG CCTACAGCTT CACCGTGACG AACGCGGGCA ACGTGACGCT TACCGACGTG ACCGTGACCG ACGACAGCCT TCCGGGTCTC GTGCTGACGG GCGGGCCGAT CGCGACCCTC GCGCCGGGCG ACAGCGACAG CACCACCTTC ACCGCCTCCT ACAGCCTGAA GCAGGAGGAT CTCGACCGCG GCTTCGTCGA GAACACCGCA CTGGCGACCG GCACCTATGC CGGGCCCGGC GGCACGCCCG CCGAAGTGAC CGACAGGTCC GGGACGGACA CCTCCAACGA CACGCCCACC GTGGCGACCG TGCCCCCTGC CCCGGCCATC ACGCTGGTCA AGGCGGTGGA TGCGAGCGGC ATCTCCAGCC CCGCGGCGGT CGGCGAGCAG CTCAGCTACA GCTTCACGGT GACGAACACC GGCAACGTGA CGCTGACGGA TGTGACGGTC ACGGACACGA GCCTGCCGGG CCTCGTCCTG TCGGGCGGGC CGATCACCCT CGCGCCGGGA GCGAGCGACG CTGCCACCTT CACCGCCACC TATGCGCTGA AGCAGGCCGA CATCGACCGC GGGTTTGTCG AAAACACGGC GCTCGTCACC GGGACCCATG TCGACGGGAA CGGCGACAAG ACCGAGGTCG AGGACGTCTC CGGCACCGAC GCCGCGAACG ACCTTCCGAC CCGCAGCGAC GTCGAGGCCG CGCCCGCCAT CGCGCTGGTG AAGACCGTCG ACCTCTCGGC CCTCTCGAGC CCGGTCGCGG CGGGCGACGT GCTAAGCTAC GGCTTCGCCG TGACCAACAC GGGCAACGTC ACGCTCACGA ATGTGACCGT GACCGACGAC AGCCTCGCGG ATCTCGTGCT GACGGGCGGC CCCATCGCCT CGCTCGCGCC GAACGCCACC GACAGCACGA GCTACAGCGC GAGCTACACG CTGACCCAGG CCGACATCGA CCGCGGCTTC GTCGAGAACA CGGCGCTTTC CACCGGCACC TACACCGACG GCGCAGGGGT CGAGACGGAG GTCGAGGACG CGTCGGGCAC CGACACGAGC AACGACCTGC CCACGCGCGC CGATCTCGAC GCCCTGCCCT CCATCGCACT GGTCAAGACG GTCGACGCCT CGGCGGTCTC CTCTCCGGCG GCGGTGGGCG ATCTGCTCAG CTACAGCTTC ACGGTGACGA ACACCGGCAA CGTGACGCTG ACCGACGTGA CGGTGACGGA CGACAGCCTT GCCGACCTCG TGCTCGCGGG GGATCCGATC CCGACGCTCG CGCCGGGTGC GGCGGACGCC ACCACCTACA CCGCGACCTA TGCGCTGAAA CAGGCCGACA TCGACCGCGG CTACGTCGAG AACACGGCAC TTGTCACCGG CACCCACACC GATGGCGCGG GCGTCGAGAC GGAGGTCGAG GATATCTCCG GCACCGAGGC CACCAACGAT ACGGCGACGC GCGCCGATCT GGGGACCACT CCCTCGATCG CGCTGGTGAA GGCCGTCGAT CTTTCGGCCG TCTCCTCTCC GGCGGCGGTG GGCGACCTGC TCACCTACAG CTTCACGGTG ACGAACACCG GCAACGTGAC GCTGACCGAC GTGACGGTGA GCGACGACAG CCTCGCCGAT CTGATCCTCG CCGGCGACCC GATCCCGTCG CTCGCGCCGG GTGCGACCGA TGCCACCGCC TACAGCGCCA CCTATGCGCT GAAGCAGGCC GACATCGACC GCGGCTTCGT CGAGAACACC GCGCTCGTCA CCGGCATCCA CACCGATGGC GCAGGCGTCG AGACCGAGGT CGAGGACATC TCCGGCACCG AGGCCACCAA CGACACGCTG ACACGCGCCG ATCTCGAGAC CGCGCCCGCC CTCGCGCTGG TGAAGACTGT CGATGCTTCG GCCGTCTCCT CGCCGGCGGC GGTGGGTGAG CTGCTGACCT ACAGCTTTGC CGTGACCAAC ACCGGCAACG TGACCCTGAC CGGCGTCACC GTGACGGACG ACAGCCTCGC GGGTCTCGTG CTCGCGGGAA GCCCTGTCCC GACGCTCGCG CCGGGTGCGA CCGATGCCAC CGCCTACACC GCGACCTATG CGCTGACGCA GGCGGACATC GACCGCGGCT TCGTCGAGAA CACCGCGCTC GCCACCGGCA CCTACACCGA TGGCGCGGGG ATCGAGACCG AGGTCGAGGA CATCTCCGGC ACCGAGGCCA CCAACGACAC GCCGACCCGC GCCGATCTGG ACACCACGCC CTCCATCGCG CTGGTGAAGA CGGTCGATGC TTCGGCGGTC TCCTCTCCGG CGGCGGTGGG CGATCTGCTC AGCTACAGCT TCACGGTGAC GAACACCGGC AACGTGACCC TGACCGACGT GACCCTGAAC GACGACAGCC TCGCGGATCT CGTGCTCACC GGCGGCCCGA TCCCGTCGCT GGCACCGGGC GCGGCGGACG CCACGACCTA CACGGCGAGC TATGCGCTGA AACAGGCGGA CATCGACCGC GGCTATGTCG AGAACACGGC GCTCGTTACC GGCACCCATA CCGACGGCGC AGGGGTCGAG ACCGAGGTCG AGGACATCTC CGGCACCGAG GCCACCAACG ACACGGTGAC CCGCGCCGAT CTGGGCACCG CGCCGGCCAT CGTTCTGGTC AAGACGGTCG ACGCCTCGGC GGTCTCCTCG CCGGCGGCGG TGGGCGATCT GCTCAGCTAC AGCTTCACCG TGACCAACAC CGGCAACGTG ACGCTGACCG ATGTGACCGT CACCGACGAC AGCCTGCCGG GTCTCGTGCT CACCGGCGAC CCGATCCCGT CGCTCGCACC GAATGCGACC GATGCCACCA CCTACACGGC GAGCTATGCG CTGAAGCAGG CGGACATCGA CCGCGGCTAT GTCGAGAACA CCGCGCTCGT CACCGGCACT CATACGGATG GCGCAGGGGT CGAGACCGAG GTCGAGGATA TTTCCGGCAC GGAGGCCACC AACGACACGC TGACGCGCGC CGATCTCGAC GCCCTGCCCT CCATCGCGCT GGTCAAGGAG GTCGATGTCT CGGCCGTCTC CTCTCCGGCA GCGGTGGGCG ACCTGCTCAC CTACAGCTTC ACGGTGACGA ACACCGGCAA CGTGACGCTG ACGGATGTCA CGGTGACGGA CACGAGCCTG CCGGGCCTCA CCCTCACGGG CGGGACCATT GCCAGCCTCG CTCCGAAGGC AAGCGACACC GCCACCTACA CGGCGAGCTA TGCGCTCACT CAGGAGGATC TCGACCGGGG CTTCGTCGAG AATACCGCGC GTGTGACCGG CACCTATACC GACGGCACGG GCGGCGAAAC CGAAGTCGAG GACATCTCCG GCACCGACGC GGGCAATGAC ACTCCCACCG AAGCCCTGAT CGAGCCGGCC CCGGCCCTCG CGCTGGTGAA GACGGTTGAT CTCTCGGGTC TCGGCACGCC GGCGGAAGTG GGCGAGGCGC TCACCTACAG CTTCACGGTG ACCAACACCG GCAATGTGAC GCTGACGGAT GTGACCGTGA CGGACACGAG CCTGCCGGGC CTCGTCCTTA CCGGCAGCCC GATCGCCCGC CTCGCCCCCG GCGAGAGCGA CAGCACCGCC TACAGCGCGC GCTATGCCCT GACGCAGGAG GATCTCGACC GCGGCTTCGT CGAGAACACG GCGCTGGCCA CCGGCCTCCA TACGGACGGC ACCGGGCGGG AGACGCAGGT CGAGGATGTC TCGGGCACCG ACGTCGGTCG CGACGATCCG ACCGTGGCTC CGGTGGGGCA GGCGCCGGCC GTGGCGCTGG TCAAGGCGGT GGACGCTTCG GCCGTCTCCT CGCCGCCGGC GGTGGGCGAC CCGCTGACCT ACAGCTTCAC CGTGACGAAC ACGGGCAGCG TGACGCTGAC GGACGTGACC GTCACCGACG ACAGCCTTCC GGGCCTCGTG CTGGCGGGCA GTCCCATCCC GCGCCTTGCG CCGGGCGAGA GCGACAGCAC GACCTACAGC GCCCGCTACC TGCTGACGCA GGAGGATCTC GATCAGGGTC GGGTGTCGAA CACGGCGCGC GTCACCGGCA GCTACCGGGC GCCCGACGGC TCGGCCGACA CCGTCACCGA CATCTCGGGC ACCGAGATCG AGAACGACGA TCCGACCGAC ACCGAGTTCG CGCCCGTCCC CGGCATCGCG CTCGTGAAGA CGGCCGATGT CTCGGGCATC GGCAGCCCGG CGGCCATCGG CGAATTGGTC CGCTACAGCT TCACCGTGAC GAACACGGGC AACGTGACGC TCGCGGATGT GACGGTGAGC GACACGAGCC TGCCGGGCCT CGTCCTGAGC GGCAGCCCGA TCGGGCGGCT TGCTCCGGGC GAGAGCGACA GCGTGACCTA CAGTGCCGCC TATGCGCTGA CGCAGGCAGA CCTCGACCGC GGCGTCATCG AGAACACGGC CCGGGCCACG GGCGCGTATC GCGGGCCGGA TGGCGAACCG GGCACGGTCG AGGACATCTC GGGCACCGAG GCGGAGAACG ATGATCCGAC CCTGTCGCTC GTGCCGCAGA CGCCCGGGAT CGCCCTCGTC AAGGAGGTGG CGGACGAGTC CGTCAGCACG CGCCCGGCCC TGGGCGACGA GCTTCTCTAC CGCTTCACGG TGACGAACAC CGGCAACGTG ACCCTCACCG ACGTCACCCT GACCGACGAT CTGGCGGGCG CGGTGGTCTC GGGCGGCCCC ATCGCCGCAC TGGCGCCGGG CGAGACCGAC AGCACCACCT TCACGGCACG CTATGCGCTG ACGGCGGCGG ATCTCGAGCG GGGGCAGGTC GCGAACACGG CCCGCGTCAG CGGGACCAAT CCGGGCGATC CCGACACGCC GGTGACGGAT GTCTCGGGCA CCGAGGTGGG CAATGACACG CCGACCGTGG TCGAGCTCGA CGTGCCCACG GATGTGACGG CCACCAAGAC CGCCAGCCCC GAGCGCGTGG TCATCGGCGA GACCGTCTCC TATGTGCTGG CCTTCACCAA CGACGCGCCG CGCTCGATGC GCGAGGTGGT GCTGGTCGAC CGGATGCCCG ACGGCCTCGT CTATACGCCC GGCAGCGCCA CGCTCGACGG CACGCCTCTG GAGCCCGAAG TCAGCGGCCG CTTCCTGCGC TGGCAGGCGG ACACCCTGCC CGCGGGCGGC ACGATCACCG TGCGCTTCGC CGCGCGCGTG CTGGGGGCTG CGCCCTACGG GCCGCTCACC AACAAGACCT GGCTCCTCGA CCGCACGGGC CAGCGCTCCT CGAACGTGGC GGAGGCCGTG GTGATCCGCG AGCCCGAGCA TGTGTTCGAA TGCGCCGACA TCATCGGGAA GGTCTTCGAC GACCGGAACA TGAACGGCTA TCAGGATCCG ATCGACGGCG CGGCCCACGG CCGCGGCGCC GAGGCCGAAG AGCCGGGCAT CCCCGATGTG CGTCTCGCGA CGCCGAACGG GACCCTGATC CAGACCGACA AGTTCGGCCG CTTCCACGTC CCCTGCGCCG AGCTACCCGG CCAGACCGGC GCGAACTTCA CGCTGAAGCT CGACACCCGC TCCCTGCCCT CGGGCTACCG GGTGACGACC GAGAACCCGC GCACGATCCG CGTCACGCCG GGCAAGATGG CCAAGCTGAA CTTCGGCGCG GCTCTGGGCC GGGTGGTGCG GCTCGACCTG ACGGCGGCGG CCTTCGCGGA CGGCCGGCCG ACCGCGGCCT TCGCCCGGGC GCTCGAGCAG ACGGCGGCAA GCCTCGGCGA TGCGCCGGTG GTGGTGCGGA TCAGCACCCG GCAGGACGCC GGCGGCGCGG GCGCGGCAAA AGCGCGGCTC GATGCGGCCG AGGCGCTGGT GCGCAAGGCC TGGAAGGGTC GGGCCGGGCC GGTGCTCATC GAACGCACGA TCCAGCGGGA CCAGTAA
|
Protein sequence | MTNGGVAPLT DVTLTDAVPG VQVTGGPISL AGGASDTTSF TATYELTQAD VDAGSFTNDA SVTGFVQVQG GGRVPVTADD SVTTPLALAP AITLVKEVDT TGLSSPAAVG DVLAYSFTVT NAGNVTLTDV TVTDDSLPGL VLTGGPIATL APGDSDSTTF TASYSLKQED LDRGFVENTA LATGTYAGPG GTPAEVTDRS GTDTSNDTPT VATVPPAPAI TLVKAVDASG ISSPAAVGEQ LSYSFTVTNT GNVTLTDVTV TDTSLPGLVL SGGPITLAPG ASDAATFTAT YALKQADIDR GFVENTALVT GTHVDGNGDK TEVEDVSGTD AANDLPTRSD VEAAPAIALV KTVDLSALSS PVAAGDVLSY GFAVTNTGNV TLTNVTVTDD SLADLVLTGG PIASLAPNAT DSTSYSASYT LTQADIDRGF VENTALSTGT YTDGAGVETE VEDASGTDTS NDLPTRADLD ALPSIALVKT VDASAVSSPA AVGDLLSYSF TVTNTGNVTL TDVTVTDDSL ADLVLAGDPI PTLAPGAADA TTYTATYALK QADIDRGYVE NTALVTGTHT DGAGVETEVE DISGTEATND TATRADLGTT PSIALVKAVD LSAVSSPAAV GDLLTYSFTV TNTGNVTLTD VTVSDDSLAD LILAGDPIPS LAPGATDATA YSATYALKQA DIDRGFVENT ALVTGIHTDG AGVETEVEDI SGTEATNDTL TRADLETAPA LALVKTVDAS AVSSPAAVGE LLTYSFAVTN TGNVTLTGVT VTDDSLAGLV LAGSPVPTLA PGATDATAYT ATYALTQADI DRGFVENTAL ATGTYTDGAG IETEVEDISG TEATNDTPTR ADLDTTPSIA LVKTVDASAV SSPAAVGDLL SYSFTVTNTG NVTLTDVTLN DDSLADLVLT GGPIPSLAPG AADATTYTAS YALKQADIDR GYVENTALVT GTHTDGAGVE TEVEDISGTE ATNDTVTRAD LGTAPAIVLV KTVDASAVSS PAAVGDLLSY SFTVTNTGNV TLTDVTVTDD SLPGLVLTGD PIPSLAPNAT DATTYTASYA LKQADIDRGY VENTALVTGT HTDGAGVETE VEDISGTEAT NDTLTRADLD ALPSIALVKE VDVSAVSSPA AVGDLLTYSF TVTNTGNVTL TDVTVTDTSL PGLTLTGGTI ASLAPKASDT ATYTASYALT QEDLDRGFVE NTARVTGTYT DGTGGETEVE DISGTDAGND TPTEALIEPA PALALVKTVD LSGLGTPAEV GEALTYSFTV TNTGNVTLTD VTVTDTSLPG LVLTGSPIAR LAPGESDSTA YSARYALTQE DLDRGFVENT ALATGLHTDG TGRETQVEDV SGTDVGRDDP TVAPVGQAPA VALVKAVDAS AVSSPPAVGD PLTYSFTVTN TGSVTLTDVT VTDDSLPGLV LAGSPIPRLA PGESDSTTYS ARYLLTQEDL DQGRVSNTAR VTGSYRAPDG SADTVTDISG TEIENDDPTD TEFAPVPGIA LVKTADVSGI GSPAAIGELV RYSFTVTNTG NVTLADVTVS DTSLPGLVLS GSPIGRLAPG ESDSVTYSAA YALTQADLDR GVIENTARAT GAYRGPDGEP GTVEDISGTE AENDDPTLSL VPQTPGIALV KEVADESVST RPALGDELLY RFTVTNTGNV TLTDVTLTDD LAGAVVSGGP IAALAPGETD STTFTARYAL TAADLERGQV ANTARVSGTN PGDPDTPVTD VSGTEVGNDT PTVVELDVPT DVTATKTASP ERVVIGETVS YVLAFTNDAP RSMREVVLVD RMPDGLVYTP GSATLDGTPL EPEVSGRFLR WQADTLPAGG TITVRFAARV LGAAPYGPLT NKTWLLDRTG QRSSNVAEAV VIREPEHVFE CADIIGKVFD DRNMNGYQDP IDGAAHGRGA EAEEPGIPDV RLATPNGTLI QTDKFGRFHV PCAELPGQTG ANFTLKLDTR SLPSGYRVTT ENPRTIRVTP GKMAKLNFGA ALGRVVRLDL TAAAFADGRP TAAFARALEQ TAASLGDAPV VVRISTRQDA GGAGAAKARL DAAEALVRKA WKGRAGPVLI ERTIQRDQ
|
| |