Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4606 |
Symbol | |
ID | 5706627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5224603 |
End bp | 5227428 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274008 |
Product | transcriptional regulator |
Protein accession | YP_001539355 |
Protein GI | 159040102 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0568023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.290469 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGGC CGGCGGTACG CATACGGTTG CTCGGCGGGG TCGAGGTGGT GGACGGCAAC GGCGCCGCCG TCGACATCGG GGCGGGTAAG TGCCGTGCGC TGCTCGCGGC CCTGGCGTTG CAGCCCGGTA CGGCGATTCC GGACTGGCGG CTAGTCGATC TGCTGTGGGG CGAGCAACCA CCCCGGACTG CCGTCCGAAC CCTGCAGTCG TACATCGCTC GGCTACGGGG CGGTCTGGGC GCCACGAGGA TTGTGCGCTC GGGTGCCGCG TACCGTCTCG ATGTGCCCGC CGATGCGGTC GATGTGATCC GATTCGGCCG GCGGGTCGAG GCCGGCGACC TCGCCGGGGC GCTCGCCGAG TGGACCGGCG AGCCACTGGC CGGGGTACCG GTGCCCGGCC TGGCTGCGGC CGTGGACGGC CTGGTCGAGC GGTGGCTCGG CACGGTCGAA GCCGATCTCG CCGCCCGGGT GGACGCTGAC GCCGCAGCGA CCGTGGGGCC GTTGACTGAG CTGAGCACGC GGTATCCGTT CCGTGAAGGC ATCTGGGCGC TGCTGATGAC GGCGCTGTAC CGGGTGGGCC GGCAGGCCGA CGCGCTGGCC GCATACCGCA CTGCCCGTCA ACGGCTGGTT GAGCACCTGG GCGTGGAGCC CGGGCCGCGC TTGCGCCGCC TGGAGGTGGC GATTCTCGGG CAGGACCACC GCATTGGCGG CGAGCAGCGG TCCGAGTCGA TCGACCGGTT GCCCCGACGC GCCGTGCGAC TGATCGGCCG CGACGGTGAC CTCGACCTCA TCGGCCGGGC ATTGGCCGAG AGCCCGGTGG TCACCCTGGT CGGGCCGGGC GGCATCGGCA AGACCGCGCT CGCTGTTGCG GCCGCCCAAC GCGTGCGGCT CGAGCACGGC GCCTGGCTGG TCGACCTGAC CGAGATCACG ACCGACCAGG ACGTCCCCCA AGCCGTCGCC GCGGCGGTGC GCGTCGAGGA GGGCCCAGGC CGATCGCTGA GTGAATCCAT TGTGCTAGCC CTGAGCTCAC TCCGGGCACT GCTGGTGCTC GACAACTGCG AACACGTCGT GGACGGCGCG GCACGCCTGG CCCAGGCCGT CGCCGACGGC TGCCCGCAGG TGCGGGTGCT GGCCACCGCG CGGGAACCGC TCGGCCTCAA CCACGGTCAC GAACGGCTGG TCCCCGTGAC GCCGTTGCCC GCGGCCGGGG CGGGCGCCGA CCTGTTCGCC GACCGTGCGA ACGCGCTGAC CGCCGCGTTC ACGATGGATG CCGCGCGGGA GGTGATCGAG GAGATCTGCC GCTGCCTCGA CGGGCTTCCC CTCGCCATCG AGCTGGCCGC CGCCCAAACC GTCAGTCACA CCCCGCCGGA AATCCGCGAG CGTCTCGACG ATCAGCTCGG TTTGCTGGTC GGCGGGCGGC GAACCGGGGC GGACCGGCAC CGCACCATGC GCGCCACGAT CCAGTGGTCC TACCGGCTGC TCACCGTGGC CGAACAGGAC CTGCTGCAAC GGCTGTCGGT GTTCACCGGC CCCGTCGACC GGGCCGGAGC TGCGGCCGTT GCCGCCGGCA GCGGCCTGGA TGTCAACGAC GTGCTGCACA CCCTCGTACA GCGCTCGATG GTTACCGCCG GACCCGGCCT GTTCGGCCAG CAGTTCAGGT TGCTGGAACC AATCCGCCAG TTCGCAGCCG AACACCTCAC CGCAGGACCG GCGGCCGCAC CCGCCCAGGC CGCGCACACC CGATACGTGC GGGAACGGGT GACCTCGCTA CGCGACCAGC TCACCGGACC CGCCGAAGTC CAGGGGGTCG CCCGTCTGGA CGAGCTGTGG CCCAACCTGC GCGTAGCGGT TGACCGGGCC TTTGCCTGCG GCGACTACCG CCTCGCCCAT GACCTGTTCC GGCCGATCGC CACCGAGGCC GCCCGGCGGC ACCGGCACGA AGTCGGGCAG TGGGCCCAAC GCCTCCTCGA ACAGGCACCG CCCGAGGATC GGCCGCGGAT CGTGACCGGC CTGATCGCTG CCGCATCCCG CTATCACGTC TGTCAGGACC CGGCCGGGTT CGACACCTTG ATCAGGCAGC ACTGCGAACC GGACGACCCG GTAGCCCGGC ACATGCGGGC CAACGTCCGC GACGACTACG CTACCCAGAT ACACACGGCG CCGCAGGCAC TGGCCGAGCT GCGCCGGCTC GGCGCCGACG ACCTCGCCGC GCACGTCGAG GTCGACCTCG GCGCGGCGTT GGTCTTTCAG GGACAGTACG CACGCGGAGA AGCCCAGCTC ACCCAGCTCG TCGACCGGTT CCGCAGCCAC GGCCCGCCCA CCCTGCTGAA CTGGACGCTG ACGCTACTCG GCTTCTCAGC CGCCTTCCAA GGTCGACGGG CCGCCGCGGA CACGTTGTTC GACCAGGCGA TCGACGTGCC GCTGCCGGCA CGCACCCACT CGCCGAACCA GTCCGTGCGT GCCCTGGCGT TGTTCCGGCG CGGCGACCGC AGAGCCGCCT ATCAGCTGCT CCGTGCCCAC GTCGAAGAAC TGCTCGACGC GGACAACATG CACGGTGCCT GCGTCGTGTC GGTCAACTTC GTCACGATGA TGCCGGCAGT GGCACGCTTC GCCGACGGGG CCCGGATCCT GGCCTTCCTC GACACCACCG GCGCGCTCGA CAACGCCGCC TGGGCGGCCA TGGTCGCCGA CGCCAGGGAC AAACTCGCCA CCTTCGCCCC CATCCCGAAC GGATCCATGA TCCTCGACCA GCGGCAGGCC CTCGCCACGA TCGGCAAGAC TCTCGACGGC CTTCTTATCG AACAGGCCAG CCCGGTCAGA TGCTAA
|
Protein sequence | MAGPAVRIRL LGGVEVVDGN GAAVDIGAGK CRALLAALAL QPGTAIPDWR LVDLLWGEQP PRTAVRTLQS YIARLRGGLG ATRIVRSGAA YRLDVPADAV DVIRFGRRVE AGDLAGALAE WTGEPLAGVP VPGLAAAVDG LVERWLGTVE ADLAARVDAD AAATVGPLTE LSTRYPFREG IWALLMTALY RVGRQADALA AYRTARQRLV EHLGVEPGPR LRRLEVAILG QDHRIGGEQR SESIDRLPRR AVRLIGRDGD LDLIGRALAE SPVVTLVGPG GIGKTALAVA AAQRVRLEHG AWLVDLTEIT TDQDVPQAVA AAVRVEEGPG RSLSESIVLA LSSLRALLVL DNCEHVVDGA ARLAQAVADG CPQVRVLATA REPLGLNHGH ERLVPVTPLP AAGAGADLFA DRANALTAAF TMDAAREVIE EICRCLDGLP LAIELAAAQT VSHTPPEIRE RLDDQLGLLV GGRRTGADRH RTMRATIQWS YRLLTVAEQD LLQRLSVFTG PVDRAGAAAV AAGSGLDVND VLHTLVQRSM VTAGPGLFGQ QFRLLEPIRQ FAAEHLTAGP AAAPAQAAHT RYVRERVTSL RDQLTGPAEV QGVARLDELW PNLRVAVDRA FACGDYRLAH DLFRPIATEA ARRHRHEVGQ WAQRLLEQAP PEDRPRIVTG LIAAASRYHV CQDPAGFDTL IRQHCEPDDP VARHMRANVR DDYATQIHTA PQALAELRRL GADDLAAHVE VDLGAALVFQ GQYARGEAQL TQLVDRFRSH GPPTLLNWTL TLLGFSAAFQ GRRAAADTLF DQAIDVPLPA RTHSPNQSVR ALALFRRGDR RAAYQLLRAH VEELLDADNM HGACVVSVNF VTMMPAVARF ADGARILAFL DTTGALDNAA WAAMVADARD KLATFAPIPN GSMILDQRQA LATIGKTLDG LLIEQASPVR C
|
| |