Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4843 |
Symbol | |
ID | 5707622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5496603 |
End bp | 5500058 |
Gene Length | 3456 bp |
Protein Length | 1151 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641274239 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001539584 |
Protein GI | 159040331 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0675424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00881271 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCAACGA CACTTGATCA TCGGCTTCGT GTCTCGGTCC TGGGCTCCGT GCGGGCCTGG CTCGGCGAAC ACGAGCTGCC CCTCGGACCG GCCCGGCAGC GGGCACTCTT CGCGGTGCTG GCAGCCGCCG CCGGCCGTCC CGTCGGCCGC GACGAGCTGA TCGAGGGAAT CTGGGGCACC TCGCCGCCGG CTACCGCGGC CGGCAGTGTC TACACCTACG TCTCCGGGCT TCGGCGTAGT CTGGCGACGC CGGGCCTACC CCGATCGAGT CACCACCTCC TCACCTCCGG CCCGTCGGGC TACGCCCTGC GGCTGTCCCC CGAGGACCTG GACAGCGAAC GCTTTCTGGC TTTCTGCCAC CAGGCTCAGG AACTGCGGGA GGCCGATCCA CAGGCTGCGG CCACCCGCTG GGACGAGGCA CTGGCGTTGT GGCACGGGGA GGCGTACGCC GGAGTCTCCG GTCCCCGGAT CGAGACGGAA CGTACCCGGC TGGCCGAGCG TCGCCTCACC GCGACCGAAC AGCGGGCACG ACTGGGCCTG CACCTCGGCG ACGACGACCT GCTGGCCACA CTGTCCGGCC TGGTGCACGA GAACCCGCTG CACGAGCCCC TGTACGAGCT GCTGATGCTC GCCCTGCACC GGATCGGCCG CCGTACCGCG GCGATCGAGG TGTTCCGGGC CGCTCGGCAC ACCCTCCTCG CCGAGCTCGG TGTCGGGCCC GGCCCGGCGC TGACCGAGTT GCACCGACGG CTACTGGCCG AGCCGGCCGC CCCCGCCATC CCCGCCACCC GCCCCACACC ACCGATCCTG CCATCAGAGA TCAGCAGCGC GGCGTGCGAC GGTCAGGCGC GGGTGCTGGC GGGGCGCGAC GACGAACTGA CGATGCTGCG CATCCTGGTT CGGGCGGTCG TAGCGGGCCA CGGAACCGCG CTGTGGATCG AGGGTGAGCC GGGCATCGGC AAGACGGAAC TACTCACCGC CGCCATCGCC GACGCGAACG GCCTGGGCTG CCAGATCGCG TGGGGCGCCG CCGACGAACT GGACCAGGAC GCCCCGCTGC AGGTGATCTG GCGGGCGTTG GGCGTGCGGG CGACCTCGGA GTGGCATCGG CCCCCGAACC CGGCGGCCGC CGCGGAACGC CTGCTCGCAC ACGTGCGGGC GAGCTGCGAG ACCGGGCCCC TTGTGCTCGT CGTCGACAAC TTTCAGTGGG TGGACGAGGT CAGCTGCCTG CTCTGGGACC GGCTGATCGC CGCCACGCGA CGCATGCCGC TGTTGCTTGT CGCGGCGACC CGACTCGGGT CGTACGGCCA CGCACTAGCC CAGCTACGCC GCAGCGTCCA GGCCCGGGAG GGCCACGTAC TGCGTCTGCG TGCGCTGCCG CCAGCGGCGA TCGAGCAGTT GGTGGCCCAG TTGGTCGGGG CTCCGGTGGG GCCGTACCTG GCGGCGGTCG TGCCGCGGCT GGGCGGAAAC CCGCTCGGTG CCCGTGAGGT GATCACGGCC TTGGTCCGTC GTGGGGCGGT GCGGATCAGT GACGGGTGTG CCGACATCGA CACGTCCGTG CCAGTGGAGG CTCCCCGCTC ACTGCTGGTC GTGATCCGGG CCATCGTGGA CCTTCTCTCA CCCGCCACCC AGGAGGTCCT ACGAACGGCG GCGCTACTTG GCCCGGAGTT CAGCGGCGAC GACATCGTCG CGGTCACCGG ACGGTCACCA GTCGACCTGG TGGCAAACCT GGAGGAGGCA CTCGCTGCCC ACATCGTCGT GGAAGCGGGC AGCGTGCTCG CCTTTCGTCA CCCACTCGTG CGGCGAATGC TCTACGAGGG TATCCCGGCG CCGGCGCGAG CGGCGTTGCA CCGGCACACG GCCGAGGTGC TGCACCGTGG CGGCGGTTCG GCGACCCGGG TGGCCGAGCA ACTCGCCGCC GGTGAGCCCC CGGTCGACGA GTGGATGGTG TCCTGGCTCG TCGAGCGGCA CGCCGAGGTG ACCAAGCGGG CGCCGCAGGT CGCCGGCCAC CTGCTGCGGC GGGTGCTCGA CACGGATGTA CCCAGCACGA CACAGCGCGC GGTGTTGCTC ATCGCCCTGG TCAGACTGGA CTTCCGGCAC CAGCGGTGGT CGATGGCCGA GGCGCGCGAG GCCGCCGGGC TGGCGCGGGA TCCGGCCGAT CGTGCGGAGA TGCGTCACCT GCTCGCCACC ATGGCCTTCC GTCGTGGCGA CGCCGTGGCC GCGACCAGCC TGCTCGAGGC GTCGCTGTGC GATCCGGAGG CGCCCGAGCT CTGGCGGACG AGCCACCATG TGCTGCTGGC CACCTTTCGG CGGGGCAGCC TCGACGATCT CGACAGCGCC GATCGGACGG CCGGGTGCAC CCACAGCGAG GCAGTCGCGG CGGGACAGCC GTACGAGGCC GCGTTCGCGT TGCAGACCGT GTGGCTGACC AACTCGATCC GGCGCGACCA CGAACGTGCG CTGGAGTACG TCGACCAAGC GTTGGACACC GTTCGGGGCC GTCGGGCGTG CGTGGGCATG TACCTTGACC TGCTCGACAA CCGGACGTTC ACCCTGCAGA ACCTCGACCG GCTCGACGAG GCCGAACGGA CCCTGCGCGA GGCGGCGCTC GTCGCCCTCC GACACCGGCT TCCACACGGC CTACCGGTGG CCACGGCGGT CCAGCACTAC TGGCTTGGCC GCTGGGACGA CGCCCTGACA ACGCTCAGCG CGGTCAGGGC GGTCGCCGAT GACAATCCCG GCATCACGTT CCTGGGAAAC CGTGAACCGG GTGCCGTCAC CATGCTGCTG CACGGTGTCG CCGCGCTGAT CGCCGCACAC CACGACGACG CTGGCCTGGT CGCCGAGCAC CTGGCGGCGG TGCACGCGCT GCCCGCCACC GATGCCGAGC GGGCGAGTAG CGACTTCCTG CTGGTCGCCC GCTCGTTGGT CGCCGAGCAG CAGGGCCGGG CCGACGAGGC ACTCGACGTC CTCGCCCCCC TGCTGGCACC GTCGTACGCG CCGATGATGC TGCGTCACCA GTGGCTTCCG GGCGTGGTGC GGCTCGCCCT CGACCAGCGC CGTACCGACG TCGTCGAATG GGCGGTGGAG ATCTGCGCGG GTGAGGCGAA CAAGGAGGTC CGGCCCGCTC GAGCAGATGC CGCCGCCCAG CGTTGTCGGG CGCTGATCAC GGGGGATCCG GAACCCGCCC TGGCTGCGGC GGCCCGCTAC CGCAGGGTGG GTCGGGCGCC GGAACTGGCG GCGGCGCTCG AGGACGCAGC GGTGCTGCTC GCCGCCGATC GTCGCCCGCA CGAGGCGATC CGGGTGTGCG GCGAGGCGGT ATCGCTGTAT TCGGCGCTCG GCGCCCGGTG GGATCTGCGC CGGATCCGCC GCCGGCTCGC TGAGTTCGGG GTGGATCGGT CCCGCGAAGC GGTGTCCGTG GATCGGCGGT CCGTCGCTGT CCTCAGTGCA GGGTGA
|
Protein sequence | MSTTLDHRLR VSVLGSVRAW LGEHELPLGP ARQRALFAVL AAAAGRPVGR DELIEGIWGT SPPATAAGSV YTYVSGLRRS LATPGLPRSS HHLLTSGPSG YALRLSPEDL DSERFLAFCH QAQELREADP QAAATRWDEA LALWHGEAYA GVSGPRIETE RTRLAERRLT ATEQRARLGL HLGDDDLLAT LSGLVHENPL HEPLYELLML ALHRIGRRTA AIEVFRAARH TLLAELGVGP GPALTELHRR LLAEPAAPAI PATRPTPPIL PSEISSAACD GQARVLAGRD DELTMLRILV RAVVAGHGTA LWIEGEPGIG KTELLTAAIA DANGLGCQIA WGAADELDQD APLQVIWRAL GVRATSEWHR PPNPAAAAER LLAHVRASCE TGPLVLVVDN FQWVDEVSCL LWDRLIAATR RMPLLLVAAT RLGSYGHALA QLRRSVQARE GHVLRLRALP PAAIEQLVAQ LVGAPVGPYL AAVVPRLGGN PLGAREVITA LVRRGAVRIS DGCADIDTSV PVEAPRSLLV VIRAIVDLLS PATQEVLRTA ALLGPEFSGD DIVAVTGRSP VDLVANLEEA LAAHIVVEAG SVLAFRHPLV RRMLYEGIPA PARAALHRHT AEVLHRGGGS ATRVAEQLAA GEPPVDEWMV SWLVERHAEV TKRAPQVAGH LLRRVLDTDV PSTTQRAVLL IALVRLDFRH QRWSMAEARE AAGLARDPAD RAEMRHLLAT MAFRRGDAVA ATSLLEASLC DPEAPELWRT SHHVLLATFR RGSLDDLDSA DRTAGCTHSE AVAAGQPYEA AFALQTVWLT NSIRRDHERA LEYVDQALDT VRGRRACVGM YLDLLDNRTF TLQNLDRLDE AERTLREAAL VALRHRLPHG LPVATAVQHY WLGRWDDALT TLSAVRAVAD DNPGITFLGN REPGAVTMLL HGVAALIAAH HDDAGLVAEH LAAVHALPAT DAERASSDFL LVARSLVAEQ QGRADEALDV LAPLLAPSYA PMMLRHQWLP GVVRLALDQR RTDVVEWAVE ICAGEANKEV RPARADAAAQ RCRALITGDP EPALAAAARY RRVGRAPELA AALEDAAVLL AADRRPHEAI RVCGEAVSLY SALGARWDLR RIRRRLAEFG VDRSREAVSV DRRSVAVLSA G
|
| |