Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_4244 |
Symbol | |
ID | 5060729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4809128 |
End bp | 4812106 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640476506 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001161050 |
Protein GI | 145596753 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0384973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTTG GGATCCTGGG CCCACTGCGG GTTGGTGGCG AGGCCACCGT TACCGCGGGT CGGGATCGGA CCATCCTCGC GATGCTGTTG CTGCAGGCTG GTCGGATCGT GCCAGCGGAG GAGTTGGTGG ACGCGGTCTG GGAGGAGCGG CCGCCGGCTA CTGCCCGCGC CCAGCTGCAG ACCTGTGTGT CCCGGCTGCG GCGCCGTTTC GGCACCCTCG GGCTTCCGCC GGAACTCATC GTCACCGACC CGGTCGGCTA CGGCGTCCGT ACCGCGTCGG GTGACCTCGA CGCCGAGGTG TTCGGCGACG AGGTGGCTGT CGCACGGGCG GCGGTCGCCA CTGGTCACCT GGTTGACGGT CGCCGGCACT TCCGTGCCGC GCTCGCGCTC TGGCGGGGAC CGGCGTTGGG CGGCGTCGCT GGTCGGAGCG TACGCGGGCG GGCCCGGACG CTCGACGAGC AGCGGTGCAC AGTGCTGGAG GAGTGTGTTG ACGTCGAGCT GCGGCTCGGC AACGCCGTCG ACCTGATCGA CGAACTGACC GAGGCGGTCG AGCGGCACCC GTTGCGGGAT CGCCTGCGGG GTCAACTGAT GCTGGCCCTC TCCGCCATCG GTCGCCAGGC CGATGCTCTC GTCGTCTACC GGGACGGGCG CCGTCGCTAC GTCGACGAAC TGGGTATTGA GCCCGGTGCG GAGCTGCAGG AGCTACACCA GCGGATGCTC GCCGGTGACC TGGTCGCCGT TGGGCCGCGA CGTACCACCA CCACCCCGGT TCGGGGCCTG CCGAGGGCGA TCAGCGATTT CACCGGCCGG CAGGAGATCG TCGCGAACCT GCTCCAGCAG CTCCACGAGG GGGACACCCA GGTTCACCTG ATCGACGGGA TGGCGGGTAG TGGCAAGACC ACCCTCGCCG TGCACCTCGC CACCCGTCTC GCGGACCGTT ACCCGGACGC GCAGCTCTTT ATCGACCTGC ATGGCCACAG CGAACGCACG CCGTTGACCC CGGTCGCCGC GGTGGCGACC CTGCTGCGGC AGCTCGGGAT ACCGGCGGAG CAGGTTCCGG TGGATGTGGA AGATCGGCTC GCCCTGTGGC GTACCGAGCT GTCCAACCGG CGGGCGTTGG TGTTGCTGGA CAACGCGGCG AGCGTTGACC AGGTCGCACC CCTGCTACCG ACCGGGCGTT CCTGTCTCAC GTTGGTAACC AGTCGACGAC GACTAGTTGG ATTGGATGCG GGCCGGCCCG TGTCGCTGCC CGTTCTCGAG CTGAACGAGG CTGTCACGCT GCTGGGGCGG GTAGCCGGTC CGGAGCGGGT GGCTGCCGAA CCGGAGGCGG CGGCGGAGGT GATTCGTCGC TGCGGCTTCC TGCCGTTGGC GATCCGGCTG GCGGGTGCCC GGCTGGCGCA CCGGCCGAGC TGGCGGATTG TTGACCTCGT CGAGCGGTTG GCCGGAGCCC GCGATCCGTT GGCCGAGTTT GCGGCCGGGC AGCGTTCGGT CGGAGGTGCC TTTGCCCTGT CGTACGCGCA GGTTACGCCG TCCGCGCAGC GGCTCTTTCG ACTGCTCGGC GTGCATCCCG CCAGCAACTT CGACAACGCG CTCGCCGCCG CGTTGGCTGA ACTGGCCCTA CCCGACACCC GGGACCTGCT CGACGAGCTG ATCGATGCGA ACCTGGTGGA GGAACCGAGG CCGGGCCGGT TCCGCCTACA CGACCTGGTC CAGGAGTACG CTCGTCGCCT GCTGGCTCAG CCGGCGTGGG CAGCAGAACG CTCCGCCGCC CTGGAACGAC TCCTCGATCA CCACCTGCAT ACGGCAGCGG CGATCGCCGA CCGGTTCGAG ACCGCCGCAA GTCTGTACCA GCTCACGAGG CCGATTGCGT CGCGGGCGGA CCTGGTGGCC AGCTGTGTCG AGCAGGGCCG GGCGTGGTGC GACGAGAACC GTACGGTGCT GACCGCCCTG CCCCGCATCG CTGAGTCAGA GGGTTTCCTG CAGCACTGCT GGCGGCTGGC CCGCGCCTGC TGGGGCTTCA ACTATGAGCG GGGCCAGCTG GACGACCTGA TCGAGACACA CACCGTCGGT CTCCGCGCCG CGCGACAGTT GGGGGACGAC GCGGCGATCG CCACGACGCT CAACTATCTC TCGGCCGCGT ACTATCGGCT GGGTCGATTC GCAGAGGCCA TCCGGCTTGT CGAAGAGGCG CTCATCCTGC GACGTCAGCT CGGGCTCACC TCGGCGGTTC GGACCACCCT GTACAACCTG GGCAGCCTGA AGGCGGCCAA CGGGGACTAC CGCCCCAGCA TGCGGCTCTT TCAGGAGGCG CTGGAGCTGG CACCTCCCGG CGAAGACCTC GCAGGCCTAG CGAATCTTCT GAACAACATC ACGCAAACCC TGCTGAACTG GGGGCGCTAC GGTGACGCGT TGCGATTCAG CCGGCAGCAC CTGTTGGTGA GTCGGCAAAT CAGCGACCTG CGGCAGTTGT CGCACGCTGT CGGGCATGTC GGATCGCTTC GTCACCGGCT TGGGGACAAT GAACCGGCCC GGCGGCTGCT GCTGATGGCG CTGCGCCTCA AACGCCGGAT CGGTCACCGG TACGGCGAAG GTGAGATGCT CAACGAACTC GGTGTGATGG AGCGGGAGGC TGGGCGGCCC GAAGAGGCGG CGGCGCTGCA CCGGGAAGCG CTGGTCACGA TGATCGACGC GGGCGACCAC GTCGGGCAGT GCGGCACGCG GAACCTGCTG GCTCGGGCGA TCGCCGATCA GGGTGACCGG CCGAGCGCCT TGGACCTCTT CCGCCGGGTG CTGCACGACG CCCAAAAGAT CAACCATCGG TACGAGCAGG CGCGGGCCCT GGACGGTATG GCCCACTGCC TGCGGTCGAC GGACGCGAGT GCGGCGCGAG CGTACTGGAC CCAGGCGCTT GCCCTGTTCC GGCAGCTCGA CGTCCCAGAG CGGCAGAAGG TGCGGCGGCT GCTGACCGAG CTGGATTGA
|
Protein sequence | MRFGILGPLR VGGEATVTAG RDRTILAMLL LQAGRIVPAE ELVDAVWEER PPATARAQLQ TCVSRLRRRF GTLGLPPELI VTDPVGYGVR TASGDLDAEV FGDEVAVARA AVATGHLVDG RRHFRAALAL WRGPALGGVA GRSVRGRART LDEQRCTVLE ECVDVELRLG NAVDLIDELT EAVERHPLRD RLRGQLMLAL SAIGRQADAL VVYRDGRRRY VDELGIEPGA ELQELHQRML AGDLVAVGPR RTTTTPVRGL PRAISDFTGR QEIVANLLQQ LHEGDTQVHL IDGMAGSGKT TLAVHLATRL ADRYPDAQLF IDLHGHSERT PLTPVAAVAT LLRQLGIPAE QVPVDVEDRL ALWRTELSNR RALVLLDNAA SVDQVAPLLP TGRSCLTLVT SRRRLVGLDA GRPVSLPVLE LNEAVTLLGR VAGPERVAAE PEAAAEVIRR CGFLPLAIRL AGARLAHRPS WRIVDLVERL AGARDPLAEF AAGQRSVGGA FALSYAQVTP SAQRLFRLLG VHPASNFDNA LAAALAELAL PDTRDLLDEL IDANLVEEPR PGRFRLHDLV QEYARRLLAQ PAWAAERSAA LERLLDHHLH TAAAIADRFE TAASLYQLTR PIASRADLVA SCVEQGRAWC DENRTVLTAL PRIAESEGFL QHCWRLARAC WGFNYERGQL DDLIETHTVG LRAARQLGDD AAIATTLNYL SAAYYRLGRF AEAIRLVEEA LILRRQLGLT SAVRTTLYNL GSLKAANGDY RPSMRLFQEA LELAPPGEDL AGLANLLNNI TQTLLNWGRY GDALRFSRQH LLVSRQISDL RQLSHAVGHV GSLRHRLGDN EPARRLLLMA LRLKRRIGHR YGEGEMLNEL GVMEREAGRP EEAAALHREA LVTMIDAGDH VGQCGTRNLL ARAIADQGDR PSALDLFRRV LHDAQKINHR YEQARALDGM AHCLRSTDAS AARAYWTQAL ALFRQLDVPE RQKVRRLLTE LD
|
| |