Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4854 |
Symbol | |
ID | 5707633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5508643 |
End bp | 5511003 |
Gene Length | 2361 bp |
Protein Length | 786 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641274250 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001539595 |
Protein GI | 159040342 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0558013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCGACGA CGATCACCAA TCGCACTCTC CAACCGCTGG CCATGCCGGC AGAGCACGCA GGCATCACCA TTCGCTTACT GGGACCACTA CAGGCCTGGC AGGACGACAC CGAACTCGTG CTCGGCTCTG GCAACCGGAC CGCCGTCCTC AGCTTCCTCG CACTGCACGC GAACCATGCC GTGACCCGGG AACAGCTGAT CGCAGCACTG TGGGGTGACG ACCCGCCGGC CAGTGCGACC GGAAACCTCT ACACGTACAT TTCCTCGTTA CGCCAGATCC TGGATCCGTG TCGACACCGG TGGTCGGCTG GGCAGGTGCT CACCTCCGGC GGCGGCACCT ATCGCCTGCG GGTACGGAAG CAGGACGTCG ACGCCTTCCG GTTCGAGGCA CTGCGCGAGG AGAGCAAACG GCACCGGACC ATCGGCGACA GCAGCACGGA ACTGGCCTCA CTCACCGCCG CACTGCGGAT GTGGCAGGGC ACGGCGCTCG CCGGCGTACC CGGCCCCTTT GCCGAGGCCC AGCGGCAACG CCTGGCGGAG CTCCGGTTGA CCACCGCGGA ACGGCACGCC ACGCTGCTCG TCGAAACCGG CCGTCACGAC GAGGCCATCT CGACGCTCCG CACGCTGATC GAGGCTCATC CAACTCGAGA GAACCTCGCT CCCCTGCTGA CAGCGGCTCT CCACGCCGCG GGCCGAGCGG ATGTGCCACG TCGTCCGCGC CCTCCCGTAC CGACGGAGCC CGCGGGTCAC GTACCAGCGT CACCACGACG AACCGGCACC AGGTCGCCGG TACGAGCAGC CCCTGGATCG CTGGCCGGCC GCGAGACAGA AATCCGCTGG CTACGTCGAG CCGTCGTCGA CACGACCCGC GGCCACGGGC GCAGCATCCG CGTCGAGGGC TTCACGGGAA TGGGCAAGTC GGCACTACTC GCCGTGGCAC TGCCGAGTAC GGCACCGTCC ACCTGCCGGA TCGGTTGGGC CGTCGGCAGG GAGTTGTCAC AGCGCGTACC GCTGGGACTC CTGCTGGAGT GCATGGAGTC GGCGTTGGCT GGTGAGCCCA GCAGCGAGCT GGTCCGGCAG ATCTCGCTGG CCACGAGCCC GTCCATGGGC AACTCGGCCA CTGCGCCGGT TGCCCGGGTG GTCAGCCTGG TACGACAGGT CGCCCAGCAA GCTCCGCTGG TCCTCGTGGT GGACAACCTG CATTGGGCTG ATCCACACAC CGTGGAGGCC TGGACGGCCC TACACGAGGT GACGACGCAA CTGCCGCTCC TTCTCATCGC TACCAGTTGG CCCGGCACCG CCACGCTCGG CACCGCCACG CTCGGCACCG CCACGCTCGG CACCGCCGAG TTCGGCACCG CCGCCGACGA GGTCCACCAG CTGGCCCCCC TGAACCGGGC GGCCAGCAGC CACCTGGTGC GTACGGCTGC TCCGGAACCG CCGGAATCCC CGGAGCTGGA CCATCTCGTG GCGGCCGCCG GGGGTAACCC GTGCTACCTG TGGCACCTGG CCACAGCGGG TGCCGACCTC AACGGCCAGC TTCCCGCAAG CCTCGTCCGG GTGGTCCACA CCCACCTCGC CCCATTCACC CAGCCGACCC GGGAGGTACT GCGCGCGGCG GCCTTCCTCA CTGCTGGACC TGCCGCGCAC GCCGGCCCCG GCTGTTCGCT CAGCGAATTG GCCGTGGTGA CCGAGCGTCC CACCGCCGAG TTGGTCGAGC TGCTGACGCC GGCCGCACGG GCCGGCGTCG TCGCCATCAC CGGTGACGGG CTGACATTCC GCCACCCGGT CGTCCCGCGA GTGCTGCACG AGGGAACGGC CAGCGCGCTA CGAGTCCTGC TGCACCGATC CTTCGCCGAG CGGATAGCAG GCGCCGGCGG GCCACCCGAG CGGGTCGTCA CCCAGTTACT CGCTGACGCG GTGCCGTTGG ACGTCACGCT GACCCGCTGG CTCACTACTC ACGTCGAGCA GCTCACCGCC CGCGCGCCCC GGATCACGGT GACCATCCTG CAACGGGCCC ACGCCCAGTG CACGATGCCG CCCGCCGACC GCATGACCCT CACCATCTGG CTGGCCCGGC TGCTCCTGCG ATTGGGTCGC AACTGCGCCG CCGAGGCCGG GTGGGTTGCC TCCCGCACCG ATGACCTCGA CGTGGAAGGC GAGATGCGGT GGATGGTCGC CCTGACCTAT GAGAAGCGCG GCGAGCACCA GACCGCCGCG GACATCGCCC GTTCGGTGCT GCACGATCGC CGTGTCCCGG CTCCGTGGCT CGATCGGTTT CGCCTCATGC AGGCGCGGCT GCGCCAACAC CTGGCCGACC GGCACGACGA CTGTCACCCC GAAATCGTCA CGTGTACGTA A
|
Protein sequence | MSTTITNRTL QPLAMPAEHA GITIRLLGPL QAWQDDTELV LGSGNRTAVL SFLALHANHA VTREQLIAAL WGDDPPASAT GNLYTYISSL RQILDPCRHR WSAGQVLTSG GGTYRLRVRK QDVDAFRFEA LREESKRHRT IGDSSTELAS LTAALRMWQG TALAGVPGPF AEAQRQRLAE LRLTTAERHA TLLVETGRHD EAISTLRTLI EAHPTRENLA PLLTAALHAA GRADVPRRPR PPVPTEPAGH VPASPRRTGT RSPVRAAPGS LAGRETEIRW LRRAVVDTTR GHGRSIRVEG FTGMGKSALL AVALPSTAPS TCRIGWAVGR ELSQRVPLGL LLECMESALA GEPSSELVRQ ISLATSPSMG NSATAPVARV VSLVRQVAQQ APLVLVVDNL HWADPHTVEA WTALHEVTTQ LPLLLIATSW PGTATLGTAT LGTATLGTAE FGTAADEVHQ LAPLNRAASS HLVRTAAPEP PESPELDHLV AAAGGNPCYL WHLATAGADL NGQLPASLVR VVHTHLAPFT QPTREVLRAA AFLTAGPAAH AGPGCSLSEL AVVTERPTAE LVELLTPAAR AGVVAITGDG LTFRHPVVPR VLHEGTASAL RVLLHRSFAE RIAGAGGPPE RVVTQLLADA VPLDVTLTRW LTTHVEQLTA RAPRITVTIL QRAHAQCTMP PADRMTLTIW LARLLLRLGR NCAAEAGWVA SRTDDLDVEG EMRWMVALTY EKRGEHQTAA DIARSVLHDR RVPAPWLDRF RLMQARLRQH LADRHDDCHP EIVTCT
|
| |