Gene Strop_2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2058 
Symbol 
ID5058521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2331322 
End bp2333214 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content68% 
IMG OID640474323 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001158889 
Protein GI145594592 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.366352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGGA CCGAGATTGA CCCCCACATT CAGATCCTCG GTCCCGTCCA AGCGACGATC 
CGCGGTAGAG AGATCGACCT AGGACCCCCG AAACAGCGAG CAATTCTCGC CCTTTTGGCG
TTACGGGCAG GACGACACGT GCCACTCGAT GATGTGGTCG CCGCGCTCTG GACGGGACAG
TCACCGACCC GCGCCGCCAA CCTTGTGCAC ACCTACGTCG CCCGACTGCG CCATGTGCTG
GAGCCGGACA CACCACGCCA TCAACGAACA AACGTTATCG GCTCCGTATC CGGCGGGTAC
CGGCTGGCTG TCGGTGTGAG GCAGCTCGAC CTGCACGCAT TTCGTGCCGA AGTCCGGGAG
GCCACGGACC TTCACGAGCG CGGCGAGCCG ACTGGCGCCT TCGCCCGCTT CGCCGAGAGC
ATTGGCCGGT GGCGAGATCC CCAGGTCAGC GATCTGGCCG CTCTCCTGAT CGAGCAGGAC
GACATTCGCC CGCTGCGGCA AGAGTTCCTG GCCGCAGCGC TCAGCTATGT CGGCCTCGGT
CTGGAGCTGG GCCGGCCGGA GGCAGTCCTG CCGGTTGCCG AGCGACTGGC GCTGACGGAG
CCACTCAACG AGCGGGTGCA GGCCCGGCTG TTGCAGACGC TGGCCCGAAT CGGCCAGCGG
GCTCGGGCCA TTGAGCGGTA CGCCGAGGTA CGCGGGCGGC TTCGGCGTGA CCTCGGTGTA
GACCCGGGGC CCGACCTGTC TACGGCGTAC CGCGAGGTGT TGGACGCCAG GCTGACCTCG
TCGTACACCC ACCGAAAGGG CCCGGCTCAC GCCCCACCAT GGCGAGGGGC GGCACCCCTG
ATCGACGAGC TGACCGGGCG TACGGCCGAT CTTGCGGCAG TCAACGATCT GCTTGACGGA
TATCGCCTGG TAAGCCTCAC CGGTCCGGCC GGCGTAGGCA AGTCCGCGCT TGGCCTGACC
GTCGCCGAGC AGCAGCGGGA GCGGCACGCC GACGGGGTCG CGGTCGTCGA CCTGACGGAC
GTACGCACCG GGCTCGCTCT CGAACGGGCG GTGAACGCGG TGGTCGCCCC CCACCCGCCG
CGAACGACGG GACCACCAGT GCCCCTGGTA CGTCAGCTGG ATGGCCGGCG GCTGCTGCTC
GTGATCGACA ACGCCGAATT CGTCACCGAT GCGGCAGCCG ATTTGGCAGA CGAACTGCTC
CGCGGTTGTC CCGGCCTCAC CGTCCTGTTG ACCTCGCGCG AGTTGCTCGG GATGCGGTAC
GAGGCGGTGC ATCCGGTCCG GCCACTGCGC ACCGAGCCAG AGCCGGGGTC GTCGGCGCCT
CCACCCGCAC AACAACTGTT CACCCGACGG GCCATGCAGG TGCAGCCGAG CTTCCGACTC
GACGAGTCCA CCGTGGCAGG GGTCACCACG GTGTGCCGGG CCCTCGACGG CCTCCCCCTC
GCCATCGAGC TGGCCGCAGC GTCGCTGCGG ACGCAACGGT TGGAATCCCT GGTTGAGGTC
GTTGCCAATC CGCTGCATTG GCTCCGGCCG CCGCGGCGTG GGGTGCCGAG TCACCATCGG
TCGCTACACG CTGCCGTGCA TCGCAGTATC GAATTGCTCG ACGCGGCGGA GCTGCGCTGC
TTCACCGGTC TCGGGGCGAT GCCCGCCGAC TTCGACCTGG CGGCCGCCGC CGGTGTCGGC
GAGGCGCTGA TCGGTGATCT CCGCGCCGTG CAGGTGCTGT TGGATCGACT GGTCGACAAA
TCGGTGTTGG AAGTTCGGCA CGGTCCCGCC GGCCGGACTT ACCACATGCT CAGCACGGTT
CATGCATTGG CCCGACAGCT CTTTCAGGAG GGTGTTTGCA CAGCAGCACC CTCCTCGGCC
CGGTGCACAT GCGGCTGCCA CCCCGGTGAT TGA
 
Protein sequence
MSRTEIDPHI QILGPVQATI RGREIDLGPP KQRAILALLA LRAGRHVPLD DVVAALWTGQ 
SPTRAANLVH TYVARLRHVL EPDTPRHQRT NVIGSVSGGY RLAVGVRQLD LHAFRAEVRE
ATDLHERGEP TGAFARFAES IGRWRDPQVS DLAALLIEQD DIRPLRQEFL AAALSYVGLG
LELGRPEAVL PVAERLALTE PLNERVQARL LQTLARIGQR ARAIERYAEV RGRLRRDLGV
DPGPDLSTAY REVLDARLTS SYTHRKGPAH APPWRGAAPL IDELTGRTAD LAAVNDLLDG
YRLVSLTGPA GVGKSALGLT VAEQQRERHA DGVAVVDLTD VRTGLALERA VNAVVAPHPP
RTTGPPVPLV RQLDGRRLLL VIDNAEFVTD AAADLADELL RGCPGLTVLL TSRELLGMRY
EAVHPVRPLR TEPEPGSSAP PPAQQLFTRR AMQVQPSFRL DESTVAGVTT VCRALDGLPL
AIELAAASLR TQRLESLVEV VANPLHWLRP PRRGVPSHHR SLHAAVHRSI ELLDAAELRC
FTGLGAMPAD FDLAAAAGVG EALIGDLRAV QVLLDRLVDK SVLEVRHGPA GRTYHMLSTV
HALARQLFQE GVCTAAPSSA RCTCGCHPGD