Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3671 |
Symbol | |
ID | 5707193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4231323 |
End bp | 4234469 |
Gene Length | 3147 bp |
Protein Length | 1048 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273093 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001538457 |
Protein GI | 159039204 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0046189 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGTCGG GTGTTGAGGT CAATATTCTC GGCGGCGTCG AAGTGATCGG ACCGCGCGGC CGGACCGAGT TGGTGGGACG ACGGCATCGT GCCTTGGCGG CTGCCCTGGC CCTGCGGCCC GGCGCGGTGC TGCCGCTGTG GCGGCTCGTC GAGGCGCTGT GGGGGGAGCG TCCACCACGA ACCGCTGTCC GGTCGCTGCA CAGTCACGTG GCGCGGCTGC GCCTCGCGCT CGACGCGAGC GGGTTGGGCG GAGTGCTGCA GACCCGGGAG CCGGGCTACC TACTCGCCAT CGATGCGACG GTGGTGGACG CATGCCGCTT CGAACAGCAA ACGAGGGCGG CCCGGGACGG CCTGGCCGCC GGAATGGCCA GTTGGGCTGC CGACGCCATG GAACAGGCGT TGGCTCTGTG GCGCGGTGAC GCACTCGCCG ACGCGGAACC GATCGGATGG ACGGCAGCCG AGGCGGCGCG CTTGGAGGAC CTGCGGCTTG CCGCGCGGAT GGACCTCTGC GAGGTACTGA CCTGCCTCGG CAGAACGGGT GAGGCGCTCG GCGAGGTAGA GCGGCTGTTG GCAGCAGACC CGACCCGCGA ACGGCTGGTG GGCCTGCGCA TGCTCGCCCT GGCGGGATCC GGGCGGCCGA CCGAGGCGCT GAACGTCTAC CAACGGCTAC GCGTTCGGCT CGCCGATGAA CTCGGGGTAG ATCCGTCGCC CGAACTGGCA GATCTGCACA CCGCACTGCT ACGTGGCGCC GCGCCTGGTG AGCTGCGGGC ACCGGGTACG GTTCTCGGCC GGAACTCCGT CGCGGCGACG GGCCGGAACT CCGTCGCGGC GACGCCACCT CGACCGGCGC AGCTTCCGGC ACCGGTGGGG TACTTCACCG GCCGCGTCGC CGAGCTGGGT GAGCTGAGTA GTGTCATCGA AATAGGCCAC GATGACGTCC GGCCCGTGGT GCTGATCAGC GGCCAGGGCG GGATCGGCAA GACGTCGCTT GCGGTGCAGT GGGCCGCCAG CGTCACTGAC CGGTTTCCGG ACGGTCAGCT CTTCGTCAAT CTCCATGGCC ACAACCGGGC CGATGCGCTT GCCCCGGCGG AGGTGGTGGC AGTCCTGCTG CGGTGCCTCG GGATACCGGA TGATCGGCTT CCTACCGGGC TGGCCGAGCG AGTCGCCCTG TATCGCACGA TCCTCGCCGA CCAGCGCATG CTGGTCGTGC TCGACGACGT CGGTTCCACC GAGCAGGTGT TGCCGGTGAT CCCCGGCAGC GCCGCGAGCC TGCTCGTGGT GACGAGCCGT AACAGCCTCG TCGCTCTGGT AACGCACACC CGGGTACACA CCATCCTCCC TGAGCTGTTC ACCCAGGACG AAGCAACCGA TCTGATGGCA AAGATGCTCG GCACCGAGCG GGTAGGCCGA GAGCGCGACG CGGTGGCCGG ACTCGCCAAG CTGTGCGGTT GGCTTCCGCT GGCGTTACGG ATCGCGGCGG CAAAGCTCGC GCTGCGCCCG GCTCAGCCCA TCGAGGTGCT CGTCGAGGAG TTGTCTGGCG GCGACCGGTT GGCCAACCTC TCCGTCGAGA ACGGCAGCCG CGACGTCAGT GTGGTGTTCG CCAGTGCGTA CCAATCGCTG TCGGTACCCG CGATGCGGCT GTTCCGGCTG CTCGGCCTGC ATCCCGGGCC GCACCTCGGC GCAGCACTGG CTGCCGCGCT CTGCGGCCTG CCCGCCGACG TGCAGCGACA TGCGTTGGCC GAACTCGTCG CGGCACACTT GGTTGCCGAG CCACGGCCCG GCCGATACCA GTTCCACGAC TTGGTCCGGC TCTTCGCGCG GCGGTGTGCG CTTGCCGACG AGCCCGCGAG TACGCGTGCC GAGGTGGCCG AGAAGTTGCT CGACTGGTAT CTCGTCGGTG CCGCAACGGC CACCCAGGTG CTCGACAGCA ATCTCGACCG CGTAACCGCG ACGCTGCGTC ATCCGGCTCC GGAACTTCCC TTTTCGGCCA CTCGCGAGCA CACGATCGCG TTTCTCCACT CTGAACGCGA CAATCTACTG CCGATCGTGC GGTACGCGGT GGAGCACGAC CAGCCCGCTG CGGCATGCCA GCTGACCTAC CTACTGACCA GCTACTTCGA CGTACATGGC GACTGGTCCG AGCGGGTGGT GATGTGCCAG CACGCGGTCC GGGCCGCCCG TCGGCTCGGC GATCCGGTGC TCGAAGCCGA GATGCATCGG GCGCTCGGCG TGGCCTACCG CACGACACAC CAGCTCAGCC AGGCACTCGA CAGCCACCAC CACGCGTTGG CGCTGTTGCG GCCGCTCGGG GACAACCGCG GATTGGCGTA CGTCTACAAC AACATCGGCG GCGCGTTCGT AGAAATGCGT CGTTTCACCG CTGCGATCGA GGCGTACCAG ACCGCGCTGC GGCTGCACGG CCACTGTGGC AACCGGGCCG GCGCGGCGAC CGCCCAGCGC AACCTCGGAT ACGTCCACGT CCGGATGGGC TGTGCCGATC TCAGCTTCGC CCCGCTGGAC GCGGCGTTGG CCACCAGCCG GGCCATCGGT CTCCACCGGC TCGAGGCGAG CACGTTGAAC AGCCTCGGCG AGGCGCACCT GCAGCAGCTG CGACACGATC GGGCGCTCGA CTGCTTCCAC GAAGCCTTCG CCGTGAGCCG CAAAGCCGGC GATCGCCGCT ACCAGATGGT CGCGCTGGGC GACCTCGGAC GCACCTACCT GGCCCACGGC GACCCCGCGT CCGCTGTAGA TCACTTTAAT CGGGCACTGG CGATGAGCCG GAGCCTGGGT CATCGGCACA TCGAGGCACG CACCCTCAAC CAGCTCGGCG AAGCGCAGCT GCGCCTGGCG AACCTCGACG AGGCCCGCCG ATGCCTGACG GCAAGCGCCA GCCTTCGGCG GGCCGTGCCC GACCTGTACG AGCAGGCACA CGTGCAGCGC AACCTCGGCG ACCTTGCGGA GCTGACGGGT AGCCGAGGTG CCGCAGAACG CCACTGGTCA ACGGCTGTTC GCCTCTACCA CGAGGCGAGC GCGACCGATG AGGCGGAGCA GCTCGCCGGC AAGCTGACCG ACGAAACCGA CCTGGGTGCC GTCCCCGTTC CCCCACGATC GCGTCAGTCG TCGTCGACGA TGCCCATGGC CACCTGA
|
Protein sequence | MMSGVEVNIL GGVEVIGPRG RTELVGRRHR ALAAALALRP GAVLPLWRLV EALWGERPPR TAVRSLHSHV ARLRLALDAS GLGGVLQTRE PGYLLAIDAT VVDACRFEQQ TRAARDGLAA GMASWAADAM EQALALWRGD ALADAEPIGW TAAEAARLED LRLAARMDLC EVLTCLGRTG EALGEVERLL AADPTRERLV GLRMLALAGS GRPTEALNVY QRLRVRLADE LGVDPSPELA DLHTALLRGA APGELRAPGT VLGRNSVAAT GRNSVAATPP RPAQLPAPVG YFTGRVAELG ELSSVIEIGH DDVRPVVLIS GQGGIGKTSL AVQWAASVTD RFPDGQLFVN LHGHNRADAL APAEVVAVLL RCLGIPDDRL PTGLAERVAL YRTILADQRM LVVLDDVGST EQVLPVIPGS AASLLVVTSR NSLVALVTHT RVHTILPELF TQDEATDLMA KMLGTERVGR ERDAVAGLAK LCGWLPLALR IAAAKLALRP AQPIEVLVEE LSGGDRLANL SVENGSRDVS VVFASAYQSL SVPAMRLFRL LGLHPGPHLG AALAAALCGL PADVQRHALA ELVAAHLVAE PRPGRYQFHD LVRLFARRCA LADEPASTRA EVAEKLLDWY LVGAATATQV LDSNLDRVTA TLRHPAPELP FSATREHTIA FLHSERDNLL PIVRYAVEHD QPAAACQLTY LLTSYFDVHG DWSERVVMCQ HAVRAARRLG DPVLEAEMHR ALGVAYRTTH QLSQALDSHH HALALLRPLG DNRGLAYVYN NIGGAFVEMR RFTAAIEAYQ TALRLHGHCG NRAGAATAQR NLGYVHVRMG CADLSFAPLD AALATSRAIG LHRLEASTLN SLGEAHLQQL RHDRALDCFH EAFAVSRKAG DRRYQMVALG DLGRTYLAHG DPASAVDHFN RALAMSRSLG HRHIEARTLN QLGEAQLRLA NLDEARRCLT ASASLRRAVP DLYEQAHVQR NLGDLAELTG SRGAAERHWS TAVRLYHEAS ATDEAEQLAG KLTDETDLGA VPVPPRSRQS SSTMPMAT
|
| |