Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3159 |
Symbol | |
ID | 5706108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3643570 |
End bp | 3646476 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641272591 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001537958 |
Protein GI | 159038705 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000459834 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGCACTTCC GACTTCTCGG CCACATTGAG GTCGTTGACG ATGGGACGAG TTTGACGCTC GGCGGCATCC GCGCCCGGGC GACGCTCGGC TATCTGCTGC TCAACCACAA CTCGGTGATC CCGGTCAGCC GACTCATGAA GGCGTTGTGG CCCAACGGCA CCCCGGCTAC CGGACGCAAG ATGCTGCATA ACGCCATCTC CCACCTGCGC GGCGTGATCG GTACGAACAG CATGACCGCG CGGGCACCGG TGCTGCTGAC CCACGCGCCC GGCTACTTCC TGCGGGTGCA GGGGGAGTCG GTGGACATGA TCTGTTTCAG GCAGCTGGTG GATGCTGCCC GGGGTGATCT CGCGGCGGGG CGGTGGGAAC AGGCAGCCGA CACACTGCGC GGCGCACTTG CCCTCTGGCG CGGACCCGCG ATGGTGGATC TGGTCGAGGC GGGGGTGAAC TGGGCTGAGC TGACCGCACT TGAAAATGAC CGGCGGGCGG CGATCGAGTG TCTCGTCGAC GCGGATCTGG CCTTGGGCCG GCACGCGCAG GTGATCAGCG AGCTGGAGAC GCAGCTGCAG GCCGGTCCGC TGCGGGAGCG GATGGCCGGG CAGCTGATGC GGGCGCTCTA CCACTGCGGT CGACAGTCGG ACGCGCTCGC GCTCTACCAG CGCACCCGGA CCGCTCTGAT CGAGGAGTTC GGCCTGGATC CCAGCCCGGA GCTGCAGCAG CTGGAACGTG CCATCCTCAA CCATGCCCTC GTTGGGGGCA CGGCCGTGAG GCCGTCCGCC GACCAGCCGG CCGAGCTGAT CGTTCATAGC GATTCGGCGT CGCCCCTGCC TTCCCGGCCT GCCTCGGCTG GTCTGAACGC GGTACCCGAT GTCGCGTCCG ACCGGGTGGT GCCAGACACC TCAGCCGAAG CGGAAGCATC GGTGTCCGCT GCGCTGACCG CGGTACCCGA TGTCGCGTCC GACCGGGTGG TGCCAGACAC CTCAGCCGAA GCGGAAGCAT CGGTGTCCGC TGCGCTGACC GCGGTACCCG ATGTCGCGTC CGACCGGGTG GTGCCAGACA CCTCAGCCGA AGCGGAAGCA TCGGTGTCCG CTGCGCTGAC CGCGGTACCC GATGTCGCGT CCGACCGGGT GGTGCCAGAT ACCTCAGCCG GAGCGGGCTC CTCGGCATCC ACAGTGGCTA CGGCCGCCGT TGCCAGCGAG ATCAAGCAGG TCAGCATTGT GCTGGTCCGC GCTATCAAGG AGACGGCCGA GGCGTCGGAC CCGGCCGCAG CGGCGCTGAC GCACGAGCGC ACTACCCGGG TTGTCGAGGC CGAGGCCGCG CGGTGGGGCG GCACGATGGC CGGGACGTTG GGTTCGCTGT GGATGGTCGT CTTCGGGGTA CCGGTCTCCG GGGAGTTCGA CGCCTGGCAT GCGGCACGGG CGGCCCGTAC CATCAGCCAC CGCTTGACCC GACAACCGCC GAACGAGGAC GGGCCGGACG GCGACCCGTC CGGAGCGGTG GTGGTCGCCA CCGGCAAGGC CCTCGTCCAA CCGTCGGGCG GGTCGCCGTC CGCAGTCAGC GGGGCGGTGT TCGACCGGGC CATGGGCCAG CTCATCGCCG CTCGACCGGG CGAGGTGTCG ATGTGCGAGC AGACGGCTGC CGCGGGTAGG ACCACCGAGC GTCGCACCCT GTCGCTGGTC AATGTGCCGA CCGGCGGCGC CTCCGAGCCG TTCGTTGGTC GCGACCATGA CATGCGCGTC CTCCAGCACG CCTGGGCCCA GGTGCGTCGG CTCCACCACC CGCATCTGGT GACCGTTCTT GGACCAGCAG GGATGGGTAA GACCCGGCTG CTCGCCGAGT TCACCGGCCG AGCCGCAGGC GCGGACCCAG CGGCGGAGTG CCTGATCGGT AGCTGTGCCC CGGCCGAGCC GGGCGGTACC GGGTTCGACG CGCTGGCCGA TATCGTCAAG GCACGCTGCG GGGTTGCCAC GGACGGCAGT GAGTCCCACA CGCGGGCCCG GCTCGCGTCG GCGGTCACCG GGCTGTCCGG GCTGAACCGG CCGGACTGGG TGCTGGGCCA CCTGTGCGAC CTCGTTTTCG GCACGACCGA TGGCGGCGAG GATGCGGTGT CCGCCTGGTG CTCCTTCCTG GAAGCGACCG TGGTCGAGCG CCCGGCCGTG CTGGTTTTGG AGGATGCGCA CTGCGCTGCG GACGCGGTGC TCGACGTCGT CGACCAGCTG GCCAGCTCGA CCGGGCCGCT GCTCACCGTG GTCACCGCCC GCCCGGAGGA GTTACTCGGC CGGCGGCCCG GCTGGGGTGG CGGCAAACGC TACGCGAGTG CCACGGTCCT GCAGCCACTG GACGACCGCG ACATACGTGA ACTGGCCGCC ACGCGCCTTG GCCGCCGTAA CCGGGACGGT GCGCCCACTT TGGTCGACGC GGTGGTCACC AAGGCGGGCG GGAACCCGTT CTTCGCGCTG GAGTTGGCTG CCACGCTGGC CGGCTCTGCG GACCGGCCGG TGAGCGCCGT ACAGCACGCG ACCGGCAGCA CCTCGTTGTC CTTGAGCCTG CCCCCGGCCG TCCGTGCCGT GTTGGACGCG CAGATCGACA CGTTGCCCCT GGTGGCCAAG ACTGTCCTGC TCGACGCGGT CGTGCTGGGT GCGAGCTTCT GTGGCGGCGC CGTCGCTGCG ATAGGGTCGG CCTGCGCCGC GGATGTCGAG CAGGGTTTGC GGCACCTCGA ACGCCTCGAC CTGGTGGCCC GCAACGGGCA GCTGTCACCG GCAGGGCGGC CGGAGTACGA GTTCCGCATC AGCGCGCTGC GCGACGTGGC GTACGAACGC CTGGTGCCCG CCGCCCGTGA GGAGAAGCAG CGGATCGTCG CTGGATGGCG GTTCCGGCTG CCGGATTCGG ACAAGCTGGC CTCGTGA
|
Protein sequence | MHFRLLGHIE VVDDGTSLTL GGIRARATLG YLLLNHNSVI PVSRLMKALW PNGTPATGRK MLHNAISHLR GVIGTNSMTA RAPVLLTHAP GYFLRVQGES VDMICFRQLV DAARGDLAAG RWEQAADTLR GALALWRGPA MVDLVEAGVN WAELTALEND RRAAIECLVD ADLALGRHAQ VISELETQLQ AGPLRERMAG QLMRALYHCG RQSDALALYQ RTRTALIEEF GLDPSPELQQ LERAILNHAL VGGTAVRPSA DQPAELIVHS DSASPLPSRP ASAGLNAVPD VASDRVVPDT SAEAEASVSA ALTAVPDVAS DRVVPDTSAE AEASVSAALT AVPDVASDRV VPDTSAEAEA SVSAALTAVP DVASDRVVPD TSAGAGSSAS TVATAAVASE IKQVSIVLVR AIKETAEASD PAAAALTHER TTRVVEAEAA RWGGTMAGTL GSLWMVVFGV PVSGEFDAWH AARAARTISH RLTRQPPNED GPDGDPSGAV VVATGKALVQ PSGGSPSAVS GAVFDRAMGQ LIAARPGEVS MCEQTAAAGR TTERRTLSLV NVPTGGASEP FVGRDHDMRV LQHAWAQVRR LHHPHLVTVL GPAGMGKTRL LAEFTGRAAG ADPAAECLIG SCAPAEPGGT GFDALADIVK ARCGVATDGS ESHTRARLAS AVTGLSGLNR PDWVLGHLCD LVFGTTDGGE DAVSAWCSFL EATVVERPAV LVLEDAHCAA DAVLDVVDQL ASSTGPLLTV VTARPEELLG RRPGWGGGKR YASATVLQPL DDRDIRELAA TRLGRRNRDG APTLVDAVVT KAGGNPFFAL ELAATLAGSA DRPVSAVQHA TGSTSLSLSL PPAVRAVLDA QIDTLPLVAK TVLLDAVVLG ASFCGGAVAA IGSACAADVE QGLRHLERLD LVARNGQLSP AGRPEYEFRI SALRDVAYER LVPAAREEKQ RIVAGWRFRL PDSDKLAS
|
| |