Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_0492 |
Symbol | |
ID | 5056931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 567472 |
End bp | 570363 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640472765 |
Product | transcriptional regulator |
Protein accession | YP_001157355 |
Protein GI | 145593058 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGGC CGGCGGTACG CATACGGTTG CTCGGCGGGG TCGAGGTGGT GGACGGGAGC GGCACCGCCG TCGACATCGG GGCGGGTAAG TGCCGTGCGC TGCTGGCGGC TCTGGCGATG CAGCCCGGCA GGGCGATTCC GGACTGGCGG CTGATCGATC TGCTGTGGGG TGAGCAGCCA CCCCGGACTG CCGTCCGGAC CCTGCAGTCG TACATCGCGC GGCTACGGGG CGGTCTGGGT GCCGGGCGGA TTGTGCGCTC GGGTGCCTCG TACCGTCTCG ATGTGCCCGC CGATGCGGTC GATGTGATCC GGTTCGGCCG GCGGGTCGAG GCCGGCGACG TTGCCGGGGC GCTCGCCGAG TGGACCGGCG ATCCGCTGGC CGGGGTGCCG GTGCCGGGCC TGGCTGCGAC CGTGGACGGC CTGGTCGAGC AGTGGCTCGG CGCGGTCGAA GCCGATCTCA CCGCCCGGGT GGACGCCGAC GCCGCAGCGA CCGTGGGGCC GTTGACTGAG CTGAGCGCGC GGTATCCGTT CCGTGAAGGG CTCTGGGCGC TGCTGATGAC GGCGCTGTAC CGGGTGGGCC GGCAGGCCGA CGCGCTGACT GCGTACCGCA CTGCCCGTCA GCAGCTGGTT GAGCACCTGG GTGTGGAGCC CGGGCCGCGA TTGCGCCGCC TGGAGTCGGC GATTCTCGAC CAGGACCCCC GTATCGGCGG CGAGCGGCGG TCCGAGCCGG TCCACCGGCT GCCCCGGCGC GCCGTGCGAC TGATCGGCCG TGACGGTGAC CTCGACCTCA TCGGCCGGGC GTTGGACGAG AGCCCGGTGG TCACCCTGGT CGGGCCGGGC GGCATCGGCA AGACCGCGCT CGCCGTTGCG GCCGCCCAAC GCACCCGGCT CGAGCACGGC GCCTGGCTGG TTGACCTGAC CGAAATCACG ACCGACCAGG ACGTACCCCA AGCGGTCGCC GCGGCGGTGC GCGTCGAGGA AGGCCCAGGC CGGTCGCTGA GCGAGTCCAT TGTGCTGTCC CTGAGCTCCC TGCGGGCACT ACTTGTGCTC GACAACTGTG AACACGTCGC GGACGGCGCG GCGCGCCTGG CCCAGGCCGT CGCCGACGGC TGCCCGCAGG TGCGGGTGCT GGCCACCGCG CGGGAACCGC TCGGCCTCAG CCACGGCCAC GAACGGCTGG TGGCCGTGAC GCCGTTACCC GCGGCCGGGG CCGGGGCCGA TCTGTTCGCC GAACGTGCGA ACGCGCTGAC CGCCGCGTTC ACGACGGGCG CCGCGCGGGA GGTGATCGAG GAGATCTGTC GCTGCCTCGA CGGGCTTCCC CTCGCCATCG AGCTGGCCGC CGCCCAAACC GTCAGTCACA CCCCGCAGGA AATCCGCGAG CGTCTCGACG ATCAGCTCGG CTTGCTGGTC GGCGGACGGC GGACCGGGGC GGACCGGCAC CGCACCATGC GCGCCACGAT CCAGTGGTCC TACCGGCTCC TCACCGTGGC CGAACAGGAC CTGCTGCAAC GGCTGTCGGT GTTCGCCGGC CCAGTCGACC GGGCCGGAGC TGCGGCCGTT GCCGCCGGCA GCGGCCTGGA TGTCAACGAC GTGCTGCACA CTCTCGTACA GCGCTCGATG GTTACCGCCG AACCCGGCCG GTTCGGCCAG CAGTTCAGGC TGCTGGAACC AGTCCGCCAG TTCGCAGCCG AACACCTCGC CGCAGGATCG GCGGCCGCAC CCGCCCAGGC CGCGCACACC CGATACGTGC GGGAACGGGT GACCTCGCTG CACGACCAGC TCACCGGAAC CGCCGAAATC CAGGGGGTCG CCCATCTGGA CGAGCTGTGG CCCAACCTGC GCGTAGCGGT TGACCGGGCC TTTGCCTGCG GCGACTACCG CCTCGCCCAT GACCTGTTCC GGCCGATCGG CACCGAGGCC CTCCGGCGGC ACCGGCACGA AATCGGGCAG TGGGCGCAAC GCCTCCTTGA ACAGGCACCG GCTGAGGATC GGCCGCGGGT CGTGGCCGGC CTGATCGCCG CCGGACCCCG GTATCACCTC CGTCAGGACC CGGCCGGGTT CGACACCTTG GTCAGGCAGT ACGGCGATCC GGACGATCCG GTGGCCCGGC ACATGCGGGC CAACGTCCAC GACGACTACG CTGCTCAGAT TCACTCGGCG CCGCAGGCGC TGGCCGAGCT GCGCCGGCTC GGCGCCGACG ATCTCGCCGC GCATGTCGAG GTCGACCTCG GCGCAGCGCT GGTCTTTCAG GGACAGTATG CCCGCGGAGA CACCAAGCTC ACCGAACTCG CCGACCGGTT CCGCAGCGAC GGCCCGCCCA CCCTGCTGAA CTGGACGTTG ATGCTGCTCG GCTTCTCAGC CGCCTTCCAA GGTAGACGGG CCGCCGCGGA CCAGTTGTTC GACCAGGCTG TCGACGTGCC GCTGCCGGTA CGCACCCACT CGCCGAACCA GTCCGTGCGT GCCCAGGCGC TGTTCCGGCG CGGCGATCGT AGAGCCGCCT ACCAGCTACT CCGTGCCCAC GTCGAAGAAC TGCTCGACGC CGACAACATG CACGGTGCCT GCGTCGTATC GATCAGCTTC GTCACGATGA TGCCGGCAGT GGCACGCTTC GCCGAGGGCG CCCGGGTCCT GGCCTTCCTC GACACCGCCG GCCTGCTCGA CAATGTGGCC TGGGCGGCCA TGGTCGCCGA CGCCCGGGAC GAACTCGCCA CCTTCGCCCC TGCCTTTGAC GAGTCCACGA TCGCTGATCA ACGGCAGGCC CTCGTCGCGA TCGGCGACAC TCTCGACGAC CTTCTTGCCG AACAGACCAG CCGGCCAGAT GCTAACTCGA TCTTTAACCT GCTGGGCGGT GGCATCGACC CCAGACGATC ATGGGAACAG TCCCTTGATT GA
|
Protein sequence | MAGPAVRIRL LGGVEVVDGS GTAVDIGAGK CRALLAALAM QPGRAIPDWR LIDLLWGEQP PRTAVRTLQS YIARLRGGLG AGRIVRSGAS YRLDVPADAV DVIRFGRRVE AGDVAGALAE WTGDPLAGVP VPGLAATVDG LVEQWLGAVE ADLTARVDAD AAATVGPLTE LSARYPFREG LWALLMTALY RVGRQADALT AYRTARQQLV EHLGVEPGPR LRRLESAILD QDPRIGGERR SEPVHRLPRR AVRLIGRDGD LDLIGRALDE SPVVTLVGPG GIGKTALAVA AAQRTRLEHG AWLVDLTEIT TDQDVPQAVA AAVRVEEGPG RSLSESIVLS LSSLRALLVL DNCEHVADGA ARLAQAVADG CPQVRVLATA REPLGLSHGH ERLVAVTPLP AAGAGADLFA ERANALTAAF TTGAAREVIE EICRCLDGLP LAIELAAAQT VSHTPQEIRE RLDDQLGLLV GGRRTGADRH RTMRATIQWS YRLLTVAEQD LLQRLSVFAG PVDRAGAAAV AAGSGLDVND VLHTLVQRSM VTAEPGRFGQ QFRLLEPVRQ FAAEHLAAGS AAAPAQAAHT RYVRERVTSL HDQLTGTAEI QGVAHLDELW PNLRVAVDRA FACGDYRLAH DLFRPIGTEA LRRHRHEIGQ WAQRLLEQAP AEDRPRVVAG LIAAGPRYHL RQDPAGFDTL VRQYGDPDDP VARHMRANVH DDYAAQIHSA PQALAELRRL GADDLAAHVE VDLGAALVFQ GQYARGDTKL TELADRFRSD GPPTLLNWTL MLLGFSAAFQ GRRAAADQLF DQAVDVPLPV RTHSPNQSVR AQALFRRGDR RAAYQLLRAH VEELLDADNM HGACVVSISF VTMMPAVARF AEGARVLAFL DTAGLLDNVA WAAMVADARD ELATFAPAFD ESTIADQRQA LVAIGDTLDD LLAEQTSRPD ANSIFNLLGG GIDPRRSWEQ SLD
|
| |