Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2592 |
Symbol | |
ID | 5707886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2955795 |
End bp | 2957603 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641272054 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001537424 |
Protein GI | 159038171 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0592694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCACG CCGCGGCGAG CCTATCCGGC CATGACCAGC CAGCCCAACC GGCGGCGTTC CGGGTGCTCG GCCCGTTGAC CCTCAGCAGC AGCAAGGACA CGCTGGTCCT CCCACCGTCG AAGGTGACCT CGCTGCTCGC CGTGCTGCTG TTGCACCCCG ACGAGGTGGT CTCGGTCAGC GCGTTGCGGC AGGCCATCTG GGGCGACGAG CAGCCTGCCT CGGCCAAGGC CGCGCTGCAG ACCTGCACCC TGCGACTTCG GCAACTGTTC GCCCGGCACG GCATCACCGG CAGCGTGATC AAGACCGTGC CGGGTGGCTA CCGGATCACC GCCACCGCCG AGACCGTCGA CCTGATGCGC TTCCGCGAAC TGATCGGCCA CACCCGCGAC GTCACGGACC CGGAGGTGGA ACTGGCGAGG CTGGAGGAGG CGCTCGCGCT GTGGAGCGAC CCGATGCTGG CCAACGTGCC ATCGGAGGCA CTGCACCGGG ACGTGGTACC CCGGATCAGC GAGGAGCGGG TCCGAGCGAT CGAGCGGGTG TGTGACCTGA AGATCGGCCT GGGCCGAGAC CGCTCCGCGC TGGTGGACCT CTGGACCGCC GTCCGTGCGT ACCCGGCGAA CGAGCGCTTC TCCGCGCAGC TCGCCTCGGT GCTCTACCGC ACCGGCCGGC AGGCCGACGC CCTGGCCGAA CTGCGTCGGA TCCGGGACCA CCTGCGCCAC GAACTCGGCA TCGCCCCCGG TCCCACCCTG CGCGCGTTGG AGCTGACCAT CCTGCGGGGC GAAGCACCCA CGTCGGTCGG CCCGGTCGTG CGACCGACCA GCGTGGTGAC CCGGCACCCG GTCGCGTCCG GCCTCGTCGG TCGGGACGCG CTCGCCGAGA CCATCGCCGA ACGCCTCCGC GCGGGATGCC CGATCGTGGT GCTCAGCGGC CCACCCGGCG TCGGCAAGAC CGCGCTGGCG CAGTACGTCG GGCAGCTCGT CGCCCCCGAC TTCCCCGGCG GCCGGGTGGA GGTGACCGCG GCCGCCACCG GGCCGGTGAC GTCGCTGCAC GACGCCCAAC AGCAGCTACT CACGGCGGTC GACCAGGGCC ACGACGTCCA GGTGGGCAGA CGACTGCTGC TGGCCGACGA CGTGGTCAAC GGCCGTCAGG TACGGGACCT GCCGGCCCTG TTGGCGCCCG GCGACGCCCT GCTGCTGACC AGCCGACAAA GCCTCTCTGG ACCGGTTGCC CGACTGGGCG GGTGGCTGCA CCGGGTGGAG CCGCTGAACC CCGACGACTC GCTGCGGCTG CTCCGTACCG CACTCGGCCC CGAACACGTC GACGCCGACC CGCAGAGCGC CGCGGAGATC GCGGCACTCT GCGACCACCT GCCACTCGCG CTGCGCATCG CCGCCACCCG CATCCTGCTG CGCACCAGGA CGGACCTCGC GGCGGCGTTG GAGTGGCTGC GCGTCGACCC GCTGAGCCGG CTGAGCCTGC CCGGTGAACC GGACATGTCA CTCGGTCACC GCTTCGACGA GGCCCTGTCC CGGACCGGCG ACACGCTGGG AGCGGTCTTT GTCCGACTGG CCACCGCGGC TCCCACCACC ATCACCGTCC GACAGGCCGC GCAGCTACTC GACGTCGACC CGGCCACGGC TCGCGACCTG CTCGACGAAC TGGTCGACCA CAGCCTGCTC GAGGAGGCCG CGGACCACTA CTGGATACGC GTACTGCTGC GGCGACACGC CCAGGTTGCG GCCGATCGGT ACGCCCCCCA CCCCGGCCCA CCGCGACACC CGCACCAAGC GAAGGGATCC ATGCGATGA
|
Protein sequence | MEHAAASLSG HDQPAQPAAF RVLGPLTLSS SKDTLVLPPS KVTSLLAVLL LHPDEVVSVS ALRQAIWGDE QPASAKAALQ TCTLRLRQLF ARHGITGSVI KTVPGGYRIT ATAETVDLMR FRELIGHTRD VTDPEVELAR LEEALALWSD PMLANVPSEA LHRDVVPRIS EERVRAIERV CDLKIGLGRD RSALVDLWTA VRAYPANERF SAQLASVLYR TGRQADALAE LRRIRDHLRH ELGIAPGPTL RALELTILRG EAPTSVGPVV RPTSVVTRHP VASGLVGRDA LAETIAERLR AGCPIVVLSG PPGVGKTALA QYVGQLVAPD FPGGRVEVTA AATGPVTSLH DAQQQLLTAV DQGHDVQVGR RLLLADDVVN GRQVRDLPAL LAPGDALLLT SRQSLSGPVA RLGGWLHRVE PLNPDDSLRL LRTALGPEHV DADPQSAAEI AALCDHLPLA LRIAATRILL RTRTDLAAAL EWLRVDPLSR LSLPGEPDMS LGHRFDEALS RTGDTLGAVF VRLATAAPTT ITVRQAAQLL DVDPATARDL LDELVDHSLL EEAADHYWIR VLLRRHAQVA ADRYAPHPGP PRHPHQAKGS MR
|
| |