Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_2058 |
Symbol | |
ID | 5058521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 2331322 |
End bp | 2333214 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640474323 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001158889 |
Protein GI | 145594592 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.366352 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGGA CCGAGATTGA CCCCCACATT CAGATCCTCG GTCCCGTCCA AGCGACGATC CGCGGTAGAG AGATCGACCT AGGACCCCCG AAACAGCGAG CAATTCTCGC CCTTTTGGCG TTACGGGCAG GACGACACGT GCCACTCGAT GATGTGGTCG CCGCGCTCTG GACGGGACAG TCACCGACCC GCGCCGCCAA CCTTGTGCAC ACCTACGTCG CCCGACTGCG CCATGTGCTG GAGCCGGACA CACCACGCCA TCAACGAACA AACGTTATCG GCTCCGTATC CGGCGGGTAC CGGCTGGCTG TCGGTGTGAG GCAGCTCGAC CTGCACGCAT TTCGTGCCGA AGTCCGGGAG GCCACGGACC TTCACGAGCG CGGCGAGCCG ACTGGCGCCT TCGCCCGCTT CGCCGAGAGC ATTGGCCGGT GGCGAGATCC CCAGGTCAGC GATCTGGCCG CTCTCCTGAT CGAGCAGGAC GACATTCGCC CGCTGCGGCA AGAGTTCCTG GCCGCAGCGC TCAGCTATGT CGGCCTCGGT CTGGAGCTGG GCCGGCCGGA GGCAGTCCTG CCGGTTGCCG AGCGACTGGC GCTGACGGAG CCACTCAACG AGCGGGTGCA GGCCCGGCTG TTGCAGACGC TGGCCCGAAT CGGCCAGCGG GCTCGGGCCA TTGAGCGGTA CGCCGAGGTA CGCGGGCGGC TTCGGCGTGA CCTCGGTGTA GACCCGGGGC CCGACCTGTC TACGGCGTAC CGCGAGGTGT TGGACGCCAG GCTGACCTCG TCGTACACCC ACCGAAAGGG CCCGGCTCAC GCCCCACCAT GGCGAGGGGC GGCACCCCTG ATCGACGAGC TGACCGGGCG TACGGCCGAT CTTGCGGCAG TCAACGATCT GCTTGACGGA TATCGCCTGG TAAGCCTCAC CGGTCCGGCC GGCGTAGGCA AGTCCGCGCT TGGCCTGACC GTCGCCGAGC AGCAGCGGGA GCGGCACGCC GACGGGGTCG CGGTCGTCGA CCTGACGGAC GTACGCACCG GGCTCGCTCT CGAACGGGCG GTGAACGCGG TGGTCGCCCC CCACCCGCCG CGAACGACGG GACCACCAGT GCCCCTGGTA CGTCAGCTGG ATGGCCGGCG GCTGCTGCTC GTGATCGACA ACGCCGAATT CGTCACCGAT GCGGCAGCCG ATTTGGCAGA CGAACTGCTC CGCGGTTGTC CCGGCCTCAC CGTCCTGTTG ACCTCGCGCG AGTTGCTCGG GATGCGGTAC GAGGCGGTGC ATCCGGTCCG GCCACTGCGC ACCGAGCCAG AGCCGGGGTC GTCGGCGCCT CCACCCGCAC AACAACTGTT CACCCGACGG GCCATGCAGG TGCAGCCGAG CTTCCGACTC GACGAGTCCA CCGTGGCAGG GGTCACCACG GTGTGCCGGG CCCTCGACGG CCTCCCCCTC GCCATCGAGC TGGCCGCAGC GTCGCTGCGG ACGCAACGGT TGGAATCCCT GGTTGAGGTC GTTGCCAATC CGCTGCATTG GCTCCGGCCG CCGCGGCGTG GGGTGCCGAG TCACCATCGG TCGCTACACG CTGCCGTGCA TCGCAGTATC GAATTGCTCG ACGCGGCGGA GCTGCGCTGC TTCACCGGTC TCGGGGCGAT GCCCGCCGAC TTCGACCTGG CGGCCGCCGC CGGTGTCGGC GAGGCGCTGA TCGGTGATCT CCGCGCCGTG CAGGTGCTGT TGGATCGACT GGTCGACAAA TCGGTGTTGG AAGTTCGGCA CGGTCCCGCC GGCCGGACTT ACCACATGCT CAGCACGGTT CATGCATTGG CCCGACAGCT CTTTCAGGAG GGTGTTTGCA CAGCAGCACC CTCCTCGGCC CGGTGCACAT GCGGCTGCCA CCCCGGTGAT TGA
|
Protein sequence | MSRTEIDPHI QILGPVQATI RGREIDLGPP KQRAILALLA LRAGRHVPLD DVVAALWTGQ SPTRAANLVH TYVARLRHVL EPDTPRHQRT NVIGSVSGGY RLAVGVRQLD LHAFRAEVRE ATDLHERGEP TGAFARFAES IGRWRDPQVS DLAALLIEQD DIRPLRQEFL AAALSYVGLG LELGRPEAVL PVAERLALTE PLNERVQARL LQTLARIGQR ARAIERYAEV RGRLRRDLGV DPGPDLSTAY REVLDARLTS SYTHRKGPAH APPWRGAAPL IDELTGRTAD LAAVNDLLDG YRLVSLTGPA GVGKSALGLT VAEQQRERHA DGVAVVDLTD VRTGLALERA VNAVVAPHPP RTTGPPVPLV RQLDGRRLLL VIDNAEFVTD AAADLADELL RGCPGLTVLL TSRELLGMRY EAVHPVRPLR TEPEPGSSAP PPAQQLFTRR AMQVQPSFRL DESTVAGVTT VCRALDGLPL AIELAAASLR TQRLESLVEV VANPLHWLRP PRRGVPSHHR SLHAAVHRSI ELLDAAELRC FTGLGAMPAD FDLAAAAGVG EALIGDLRAV QVLLDRLVDK SVLEVRHGPA GRTYHMLSTV HALARQLFQE GVCTAAPSSA RCTCGCHPGD
|
| |