Gene Strop_0492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0492 
Symbol 
ID5056931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp567472 
End bp570363 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content71% 
IMG OID640472765 
Producttranscriptional regulator 
Protein accessionYP_001157355 
Protein GI145593058 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGGC CGGCGGTACG CATACGGTTG CTCGGCGGGG TCGAGGTGGT GGACGGGAGC 
GGCACCGCCG TCGACATCGG GGCGGGTAAG TGCCGTGCGC TGCTGGCGGC TCTGGCGATG
CAGCCCGGCA GGGCGATTCC GGACTGGCGG CTGATCGATC TGCTGTGGGG TGAGCAGCCA
CCCCGGACTG CCGTCCGGAC CCTGCAGTCG TACATCGCGC GGCTACGGGG CGGTCTGGGT
GCCGGGCGGA TTGTGCGCTC GGGTGCCTCG TACCGTCTCG ATGTGCCCGC CGATGCGGTC
GATGTGATCC GGTTCGGCCG GCGGGTCGAG GCCGGCGACG TTGCCGGGGC GCTCGCCGAG
TGGACCGGCG ATCCGCTGGC CGGGGTGCCG GTGCCGGGCC TGGCTGCGAC CGTGGACGGC
CTGGTCGAGC AGTGGCTCGG CGCGGTCGAA GCCGATCTCA CCGCCCGGGT GGACGCCGAC
GCCGCAGCGA CCGTGGGGCC GTTGACTGAG CTGAGCGCGC GGTATCCGTT CCGTGAAGGG
CTCTGGGCGC TGCTGATGAC GGCGCTGTAC CGGGTGGGCC GGCAGGCCGA CGCGCTGACT
GCGTACCGCA CTGCCCGTCA GCAGCTGGTT GAGCACCTGG GTGTGGAGCC CGGGCCGCGA
TTGCGCCGCC TGGAGTCGGC GATTCTCGAC CAGGACCCCC GTATCGGCGG CGAGCGGCGG
TCCGAGCCGG TCCACCGGCT GCCCCGGCGC GCCGTGCGAC TGATCGGCCG TGACGGTGAC
CTCGACCTCA TCGGCCGGGC GTTGGACGAG AGCCCGGTGG TCACCCTGGT CGGGCCGGGC
GGCATCGGCA AGACCGCGCT CGCCGTTGCG GCCGCCCAAC GCACCCGGCT CGAGCACGGC
GCCTGGCTGG TTGACCTGAC CGAAATCACG ACCGACCAGG ACGTACCCCA AGCGGTCGCC
GCGGCGGTGC GCGTCGAGGA AGGCCCAGGC CGGTCGCTGA GCGAGTCCAT TGTGCTGTCC
CTGAGCTCCC TGCGGGCACT ACTTGTGCTC GACAACTGTG AACACGTCGC GGACGGCGCG
GCGCGCCTGG CCCAGGCCGT CGCCGACGGC TGCCCGCAGG TGCGGGTGCT GGCCACCGCG
CGGGAACCGC TCGGCCTCAG CCACGGCCAC GAACGGCTGG TGGCCGTGAC GCCGTTACCC
GCGGCCGGGG CCGGGGCCGA TCTGTTCGCC GAACGTGCGA ACGCGCTGAC CGCCGCGTTC
ACGACGGGCG CCGCGCGGGA GGTGATCGAG GAGATCTGTC GCTGCCTCGA CGGGCTTCCC
CTCGCCATCG AGCTGGCCGC CGCCCAAACC GTCAGTCACA CCCCGCAGGA AATCCGCGAG
CGTCTCGACG ATCAGCTCGG CTTGCTGGTC GGCGGACGGC GGACCGGGGC GGACCGGCAC
CGCACCATGC GCGCCACGAT CCAGTGGTCC TACCGGCTCC TCACCGTGGC CGAACAGGAC
CTGCTGCAAC GGCTGTCGGT GTTCGCCGGC CCAGTCGACC GGGCCGGAGC TGCGGCCGTT
GCCGCCGGCA GCGGCCTGGA TGTCAACGAC GTGCTGCACA CTCTCGTACA GCGCTCGATG
GTTACCGCCG AACCCGGCCG GTTCGGCCAG CAGTTCAGGC TGCTGGAACC AGTCCGCCAG
TTCGCAGCCG AACACCTCGC CGCAGGATCG GCGGCCGCAC CCGCCCAGGC CGCGCACACC
CGATACGTGC GGGAACGGGT GACCTCGCTG CACGACCAGC TCACCGGAAC CGCCGAAATC
CAGGGGGTCG CCCATCTGGA CGAGCTGTGG CCCAACCTGC GCGTAGCGGT TGACCGGGCC
TTTGCCTGCG GCGACTACCG CCTCGCCCAT GACCTGTTCC GGCCGATCGG CACCGAGGCC
CTCCGGCGGC ACCGGCACGA AATCGGGCAG TGGGCGCAAC GCCTCCTTGA ACAGGCACCG
GCTGAGGATC GGCCGCGGGT CGTGGCCGGC CTGATCGCCG CCGGACCCCG GTATCACCTC
CGTCAGGACC CGGCCGGGTT CGACACCTTG GTCAGGCAGT ACGGCGATCC GGACGATCCG
GTGGCCCGGC ACATGCGGGC CAACGTCCAC GACGACTACG CTGCTCAGAT TCACTCGGCG
CCGCAGGCGC TGGCCGAGCT GCGCCGGCTC GGCGCCGACG ATCTCGCCGC GCATGTCGAG
GTCGACCTCG GCGCAGCGCT GGTCTTTCAG GGACAGTATG CCCGCGGAGA CACCAAGCTC
ACCGAACTCG CCGACCGGTT CCGCAGCGAC GGCCCGCCCA CCCTGCTGAA CTGGACGTTG
ATGCTGCTCG GCTTCTCAGC CGCCTTCCAA GGTAGACGGG CCGCCGCGGA CCAGTTGTTC
GACCAGGCTG TCGACGTGCC GCTGCCGGTA CGCACCCACT CGCCGAACCA GTCCGTGCGT
GCCCAGGCGC TGTTCCGGCG CGGCGATCGT AGAGCCGCCT ACCAGCTACT CCGTGCCCAC
GTCGAAGAAC TGCTCGACGC CGACAACATG CACGGTGCCT GCGTCGTATC GATCAGCTTC
GTCACGATGA TGCCGGCAGT GGCACGCTTC GCCGAGGGCG CCCGGGTCCT GGCCTTCCTC
GACACCGCCG GCCTGCTCGA CAATGTGGCC TGGGCGGCCA TGGTCGCCGA CGCCCGGGAC
GAACTCGCCA CCTTCGCCCC TGCCTTTGAC GAGTCCACGA TCGCTGATCA ACGGCAGGCC
CTCGTCGCGA TCGGCGACAC TCTCGACGAC CTTCTTGCCG AACAGACCAG CCGGCCAGAT
GCTAACTCGA TCTTTAACCT GCTGGGCGGT GGCATCGACC CCAGACGATC ATGGGAACAG
TCCCTTGATT GA
 
Protein sequence
MAGPAVRIRL LGGVEVVDGS GTAVDIGAGK CRALLAALAM QPGRAIPDWR LIDLLWGEQP 
PRTAVRTLQS YIARLRGGLG AGRIVRSGAS YRLDVPADAV DVIRFGRRVE AGDVAGALAE
WTGDPLAGVP VPGLAATVDG LVEQWLGAVE ADLTARVDAD AAATVGPLTE LSARYPFREG
LWALLMTALY RVGRQADALT AYRTARQQLV EHLGVEPGPR LRRLESAILD QDPRIGGERR
SEPVHRLPRR AVRLIGRDGD LDLIGRALDE SPVVTLVGPG GIGKTALAVA AAQRTRLEHG
AWLVDLTEIT TDQDVPQAVA AAVRVEEGPG RSLSESIVLS LSSLRALLVL DNCEHVADGA
ARLAQAVADG CPQVRVLATA REPLGLSHGH ERLVAVTPLP AAGAGADLFA ERANALTAAF
TTGAAREVIE EICRCLDGLP LAIELAAAQT VSHTPQEIRE RLDDQLGLLV GGRRTGADRH
RTMRATIQWS YRLLTVAEQD LLQRLSVFAG PVDRAGAAAV AAGSGLDVND VLHTLVQRSM
VTAEPGRFGQ QFRLLEPVRQ FAAEHLAAGS AAAPAQAAHT RYVRERVTSL HDQLTGTAEI
QGVAHLDELW PNLRVAVDRA FACGDYRLAH DLFRPIGTEA LRRHRHEIGQ WAQRLLEQAP
AEDRPRVVAG LIAAGPRYHL RQDPAGFDTL VRQYGDPDDP VARHMRANVH DDYAAQIHSA
PQALAELRRL GADDLAAHVE VDLGAALVFQ GQYARGDTKL TELADRFRSD GPPTLLNWTL
MLLGFSAAFQ GRRAAADQLF DQAVDVPLPV RTHSPNQSVR AQALFRRGDR RAAYQLLRAH
VEELLDADNM HGACVVSISF VTMMPAVARF AEGARVLAFL DTAGLLDNVA WAAMVADARD
ELATFAPAFD ESTIADQRQA LVAIGDTLDD LLAEQTSRPD ANSIFNLLGG GIDPRRSWEQ
SLD