Gene Strop_4244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4244 
Symbol 
ID5060729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4809128 
End bp4812106 
Gene Length2979 bp 
Protein Length992 aa 
Translation table11 
GC content69% 
IMG OID640476506 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001161050 
Protein GI145596753 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0384973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTTG GGATCCTGGG CCCACTGCGG GTTGGTGGCG AGGCCACCGT TACCGCGGGT 
CGGGATCGGA CCATCCTCGC GATGCTGTTG CTGCAGGCTG GTCGGATCGT GCCAGCGGAG
GAGTTGGTGG ACGCGGTCTG GGAGGAGCGG CCGCCGGCTA CTGCCCGCGC CCAGCTGCAG
ACCTGTGTGT CCCGGCTGCG GCGCCGTTTC GGCACCCTCG GGCTTCCGCC GGAACTCATC
GTCACCGACC CGGTCGGCTA CGGCGTCCGT ACCGCGTCGG GTGACCTCGA CGCCGAGGTG
TTCGGCGACG AGGTGGCTGT CGCACGGGCG GCGGTCGCCA CTGGTCACCT GGTTGACGGT
CGCCGGCACT TCCGTGCCGC GCTCGCGCTC TGGCGGGGAC CGGCGTTGGG CGGCGTCGCT
GGTCGGAGCG TACGCGGGCG GGCCCGGACG CTCGACGAGC AGCGGTGCAC AGTGCTGGAG
GAGTGTGTTG ACGTCGAGCT GCGGCTCGGC AACGCCGTCG ACCTGATCGA CGAACTGACC
GAGGCGGTCG AGCGGCACCC GTTGCGGGAT CGCCTGCGGG GTCAACTGAT GCTGGCCCTC
TCCGCCATCG GTCGCCAGGC CGATGCTCTC GTCGTCTACC GGGACGGGCG CCGTCGCTAC
GTCGACGAAC TGGGTATTGA GCCCGGTGCG GAGCTGCAGG AGCTACACCA GCGGATGCTC
GCCGGTGACC TGGTCGCCGT TGGGCCGCGA CGTACCACCA CCACCCCGGT TCGGGGCCTG
CCGAGGGCGA TCAGCGATTT CACCGGCCGG CAGGAGATCG TCGCGAACCT GCTCCAGCAG
CTCCACGAGG GGGACACCCA GGTTCACCTG ATCGACGGGA TGGCGGGTAG TGGCAAGACC
ACCCTCGCCG TGCACCTCGC CACCCGTCTC GCGGACCGTT ACCCGGACGC GCAGCTCTTT
ATCGACCTGC ATGGCCACAG CGAACGCACG CCGTTGACCC CGGTCGCCGC GGTGGCGACC
CTGCTGCGGC AGCTCGGGAT ACCGGCGGAG CAGGTTCCGG TGGATGTGGA AGATCGGCTC
GCCCTGTGGC GTACCGAGCT GTCCAACCGG CGGGCGTTGG TGTTGCTGGA CAACGCGGCG
AGCGTTGACC AGGTCGCACC CCTGCTACCG ACCGGGCGTT CCTGTCTCAC GTTGGTAACC
AGTCGACGAC GACTAGTTGG ATTGGATGCG GGCCGGCCCG TGTCGCTGCC CGTTCTCGAG
CTGAACGAGG CTGTCACGCT GCTGGGGCGG GTAGCCGGTC CGGAGCGGGT GGCTGCCGAA
CCGGAGGCGG CGGCGGAGGT GATTCGTCGC TGCGGCTTCC TGCCGTTGGC GATCCGGCTG
GCGGGTGCCC GGCTGGCGCA CCGGCCGAGC TGGCGGATTG TTGACCTCGT CGAGCGGTTG
GCCGGAGCCC GCGATCCGTT GGCCGAGTTT GCGGCCGGGC AGCGTTCGGT CGGAGGTGCC
TTTGCCCTGT CGTACGCGCA GGTTACGCCG TCCGCGCAGC GGCTCTTTCG ACTGCTCGGC
GTGCATCCCG CCAGCAACTT CGACAACGCG CTCGCCGCCG CGTTGGCTGA ACTGGCCCTA
CCCGACACCC GGGACCTGCT CGACGAGCTG ATCGATGCGA ACCTGGTGGA GGAACCGAGG
CCGGGCCGGT TCCGCCTACA CGACCTGGTC CAGGAGTACG CTCGTCGCCT GCTGGCTCAG
CCGGCGTGGG CAGCAGAACG CTCCGCCGCC CTGGAACGAC TCCTCGATCA CCACCTGCAT
ACGGCAGCGG CGATCGCCGA CCGGTTCGAG ACCGCCGCAA GTCTGTACCA GCTCACGAGG
CCGATTGCGT CGCGGGCGGA CCTGGTGGCC AGCTGTGTCG AGCAGGGCCG GGCGTGGTGC
GACGAGAACC GTACGGTGCT GACCGCCCTG CCCCGCATCG CTGAGTCAGA GGGTTTCCTG
CAGCACTGCT GGCGGCTGGC CCGCGCCTGC TGGGGCTTCA ACTATGAGCG GGGCCAGCTG
GACGACCTGA TCGAGACACA CACCGTCGGT CTCCGCGCCG CGCGACAGTT GGGGGACGAC
GCGGCGATCG CCACGACGCT CAACTATCTC TCGGCCGCGT ACTATCGGCT GGGTCGATTC
GCAGAGGCCA TCCGGCTTGT CGAAGAGGCG CTCATCCTGC GACGTCAGCT CGGGCTCACC
TCGGCGGTTC GGACCACCCT GTACAACCTG GGCAGCCTGA AGGCGGCCAA CGGGGACTAC
CGCCCCAGCA TGCGGCTCTT TCAGGAGGCG CTGGAGCTGG CACCTCCCGG CGAAGACCTC
GCAGGCCTAG CGAATCTTCT GAACAACATC ACGCAAACCC TGCTGAACTG GGGGCGCTAC
GGTGACGCGT TGCGATTCAG CCGGCAGCAC CTGTTGGTGA GTCGGCAAAT CAGCGACCTG
CGGCAGTTGT CGCACGCTGT CGGGCATGTC GGATCGCTTC GTCACCGGCT TGGGGACAAT
GAACCGGCCC GGCGGCTGCT GCTGATGGCG CTGCGCCTCA AACGCCGGAT CGGTCACCGG
TACGGCGAAG GTGAGATGCT CAACGAACTC GGTGTGATGG AGCGGGAGGC TGGGCGGCCC
GAAGAGGCGG CGGCGCTGCA CCGGGAAGCG CTGGTCACGA TGATCGACGC GGGCGACCAC
GTCGGGCAGT GCGGCACGCG GAACCTGCTG GCTCGGGCGA TCGCCGATCA GGGTGACCGG
CCGAGCGCCT TGGACCTCTT CCGCCGGGTG CTGCACGACG CCCAAAAGAT CAACCATCGG
TACGAGCAGG CGCGGGCCCT GGACGGTATG GCCCACTGCC TGCGGTCGAC GGACGCGAGT
GCGGCGCGAG CGTACTGGAC CCAGGCGCTT GCCCTGTTCC GGCAGCTCGA CGTCCCAGAG
CGGCAGAAGG TGCGGCGGCT GCTGACCGAG CTGGATTGA
 
Protein sequence
MRFGILGPLR VGGEATVTAG RDRTILAMLL LQAGRIVPAE ELVDAVWEER PPATARAQLQ 
TCVSRLRRRF GTLGLPPELI VTDPVGYGVR TASGDLDAEV FGDEVAVARA AVATGHLVDG
RRHFRAALAL WRGPALGGVA GRSVRGRART LDEQRCTVLE ECVDVELRLG NAVDLIDELT
EAVERHPLRD RLRGQLMLAL SAIGRQADAL VVYRDGRRRY VDELGIEPGA ELQELHQRML
AGDLVAVGPR RTTTTPVRGL PRAISDFTGR QEIVANLLQQ LHEGDTQVHL IDGMAGSGKT
TLAVHLATRL ADRYPDAQLF IDLHGHSERT PLTPVAAVAT LLRQLGIPAE QVPVDVEDRL
ALWRTELSNR RALVLLDNAA SVDQVAPLLP TGRSCLTLVT SRRRLVGLDA GRPVSLPVLE
LNEAVTLLGR VAGPERVAAE PEAAAEVIRR CGFLPLAIRL AGARLAHRPS WRIVDLVERL
AGARDPLAEF AAGQRSVGGA FALSYAQVTP SAQRLFRLLG VHPASNFDNA LAAALAELAL
PDTRDLLDEL IDANLVEEPR PGRFRLHDLV QEYARRLLAQ PAWAAERSAA LERLLDHHLH
TAAAIADRFE TAASLYQLTR PIASRADLVA SCVEQGRAWC DENRTVLTAL PRIAESEGFL
QHCWRLARAC WGFNYERGQL DDLIETHTVG LRAARQLGDD AAIATTLNYL SAAYYRLGRF
AEAIRLVEEA LILRRQLGLT SAVRTTLYNL GSLKAANGDY RPSMRLFQEA LELAPPGEDL
AGLANLLNNI TQTLLNWGRY GDALRFSRQH LLVSRQISDL RQLSHAVGHV GSLRHRLGDN
EPARRLLLMA LRLKRRIGHR YGEGEMLNEL GVMEREAGRP EEAAALHREA LVTMIDAGDH
VGQCGTRNLL ARAIADQGDR PSALDLFRRV LHDAQKINHR YEQARALDGM AHCLRSTDAS
AARAYWTQAL ALFRQLDVPE RQKVRRLLTE LD