Gene Strop_2437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2437 
Symbol 
ID5058900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2738466 
End bp2740259 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content71% 
IMG OID640474696 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001159262 
Protein GI145594965 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0868593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.755858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCGG TCGCGGTGAG CCCATCCGGC GGTGACCAGC GAGCCCGCCC GGCGGTGTTC 
CGGGTGCTCG GCCCGCTGAC CCTCAGCAGC GGCGGGGACA CGCTGGTTCT GCCGCCGTCG
AAGGTGACCT CGTTGCTTGC CGTGCTGCTG TTGCACCCCG ACGAGGTGAT CTCGGTCAGT
ACCCTGCAGC AGGCTGTCTG GGGTGACGCG CAGCCTGCCT CGGCCAAGGC CGCGTTGCAG
ACCTGCACCC TGCGGCTTCG GCAACTGTTT CACCGGCACG GCATCACCGG CAGCGTGATC
AAGACAGTGC CGGGTGGCTA CCGGATCACC GCTACCGCCG CCACCGTCGA CCTGATGCGC
TTCCGCGAGT TGATCGGCCG TACCCGCGAC GTGCCGGATC CAGAGGCGGA GCTGGCGAGG
CTGGAGGAGG CGCTCGCGCT GTGGAGCGAC CCAATGCTGG CCAATGTGCC ATCAGAGGCG
CTGCACCGGG ACGTGGTACC CCGGATCAGC GAGGAGCGGG TCCGGGCGAT CGAGCGGGTG
TGCGACCTGA AGATCAGCCT CGGTCGGGAC CGGTCCGCGC TGGTGGACCT CTGGACCGCC
AACCGCGCGT ACCCCGCGAA CGAGCGCTTC TCCGCACAGC TCGCCTCGGT GCTCTACCGC
ACCGGTCGGC AGGCGGACGC CCTGGCCGAA CTGCGCCGGA TCCGGGACTA TCTTCGCCAC
GAACTCGGCA TCGCTCCGGG CCCGACCCTG CGGGAGTTGG AGCTGACCAT TCTGCGGGGC
GAAGCGTCCA CTCCGGTCGC TCCAGTAGGG AGACCCGCCA CCGTAGTGTC CCGGCACCCG
GTCGCGTCCA GCCTCATCGG TCGGGACGCG CTCGGCGAGA CCGTCGCCGA GCGCCTCCGC
GCGGATTGCC CGATCGTGGT GCTCACCGGC CCACCCGGCG TCGGCAAAAC CGCGCTGGCG
CAGCACGTCG GGCAGCTCGT CGCCCCCCAC TTCCCCGGCG GTCAACTCCG GGTGGCGGCG
GACACCGTGT CGCACGACGC CCAACGGCGG CTGATCACCC CGGTTGACCA GGGCCACGGT
GCCCAGGTGG GCCGACGGCT GCTGTTCGCC GACGACGTGG TCAACGGCAG TCAGGTACGA
GCCCTGCCGG CCCTTTTGGC ACCCGGTGAC GCGCTGCTGC TGACCAGTCG CCAAAGCCTG
TCTGGTCCGG TTACCCGACT TGGCGGCTGG TTGCACCGGG TGGAGCCGCT CGAGCCTGCC
GACTCGCTCC AGTTGCTCCG TTCCGCGCTG GGACCCGAGC AGGTCGATGC CGACCCGCAG
AGCGCCGCGG AGATCGCGGC GCTCTGCGAC CACCTGCCAC TCGCGCTGCG CATCGCCGCC
ACCCGCATCC TGCTGCGCCC CCGGACGGAA CTCGCGGCGG CGGCGGAGTG GCTGCGTGCG
GACCCGCTGA GCCGGCTGAG TCTGCCCGGC GAACCGGACA TGTCACTCGG CCACCGCTTC
GACGAGGCCC TGTCCCGAGC CGGCGAAACG TTGGAGGCGG CCTTCGTCAG GCTGGCCACC
GCGGCCCCCG CCGCCATCAC CGCCGCACCG GCCGCACAGC TGCTCGACGT TGACCCGGCC
ACGGCTCGCG ACCTGCTCGA CGGGCTGGTC GACCACAGCC TGGTCGAGGA GGCCGCGGAT
CACTACTGGA TACGAGCCTT GCTGCGGCGA CACGCCCAAC TCGCGGCCGA ACGGCACGCC
CCCCACCCCG ACCCACCGCG ACGGCCGCAC CGAGCGAAGG GATCCATGCG ATGA
 
Protein sequence
MESVAVSPSG GDQRARPAVF RVLGPLTLSS GGDTLVLPPS KVTSLLAVLL LHPDEVISVS 
TLQQAVWGDA QPASAKAALQ TCTLRLRQLF HRHGITGSVI KTVPGGYRIT ATAATVDLMR
FRELIGRTRD VPDPEAELAR LEEALALWSD PMLANVPSEA LHRDVVPRIS EERVRAIERV
CDLKISLGRD RSALVDLWTA NRAYPANERF SAQLASVLYR TGRQADALAE LRRIRDYLRH
ELGIAPGPTL RELELTILRG EASTPVAPVG RPATVVSRHP VASSLIGRDA LGETVAERLR
ADCPIVVLTG PPGVGKTALA QHVGQLVAPH FPGGQLRVAA DTVSHDAQRR LITPVDQGHG
AQVGRRLLFA DDVVNGSQVR ALPALLAPGD ALLLTSRQSL SGPVTRLGGW LHRVEPLEPA
DSLQLLRSAL GPEQVDADPQ SAAEIAALCD HLPLALRIAA TRILLRPRTE LAAAAEWLRA
DPLSRLSLPG EPDMSLGHRF DEALSRAGET LEAAFVRLAT AAPAAITAAP AAQLLDVDPA
TARDLLDGLV DHSLVEEAAD HYWIRALLRR HAQLAAERHA PHPDPPRRPH RAKGSMR