Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5071 |
Symbol | |
ID | 5704591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5741513 |
End bp | 5742403 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641274464 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001539805 |
Protein GI | 159040552 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000312862 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTATTCT TCAGATTACT CGGCCCATTA ACAGTTCGTA GCGAAACTTG CTTTGTCCCA CTCTCGAGCC CGCGCCACCG GAAAATTCTG GCTGCACTAC TGCTCAGCAG AAGCGGCAAT GTATCGTTGG AACGTTTGAT CGACGTTGTC TGGCCATCCC GACCACCTGC CACGGCTCGT CAACAAGTGC AAAACTGTAT CAGTTCCCTC AATTCCCGGT TGAGGGACTG GGGACATACA CAGACCGTGC AGCGATGCGA CTCCCACTAC AGCCTCGATG TTCCCGGCGA ATCCGTCGAT GAACGCCTTT TCCGCACCGA GTACACGGCT GCGGCGCAAC TCGCAGAGAA TGGCGACCCT GCTGAGGCCA GCATGCGGCT CCGGCACGCG CTGAGGCTCT GGCGCGGTAA TGCCCTGGAG GGAATTGGTA GCGACCAACT CGTCGGCGAG GCCACCCGTC TGGATGAGAT CCGGATGCAT GCCCTGGAGC AGCTGGTCAA CTGGGAGTTC GCCGAAGGTC GCTACCACAA GATGATTCCT GACCTCTGCC TGTGGTCGGA TACCTACCCA CACAACGAGC ACCTGCACGC CCGTCTGGCT GAGGCACTAC ATCATGCCTC CCGGACTGCT GAGGCGTTGG AAGTACTCCG GCAACTTCAC CATCGGCTTG ACCGCGAGCT GGGGATCAGA CCCGGTCCCG TCGTACTCAA CCTTGAGGCG CAGCTGCGCC GTCCGACACC AACTAGCGCG CCCGGCAGCC CGGTGGATCT GGCGGCTCTC AAAGACATAC ACGCCACCCT CGCAAACTTG ACCGACACTA TGCAGGCCCT GATCAAGGAT GTCCCCGCCC TGAGTCGAGC CGATTCACAC CGCTTGTCGG ATAAAGTTTA G
|
Protein sequence | MLFFRLLGPL TVRSETCFVP LSSPRHRKIL AALLLSRSGN VSLERLIDVV WPSRPPATAR QQVQNCISSL NSRLRDWGHT QTVQRCDSHY SLDVPGESVD ERLFRTEYTA AAQLAENGDP AEASMRLRHA LRLWRGNALE GIGSDQLVGE ATRLDEIRMH ALEQLVNWEF AEGRYHKMIP DLCLWSDTYP HNEHLHARLA EALHHASRTA EALEVLRQLH HRLDRELGIR PGPVVLNLEA QLRRPTPTSA PGSPVDLAAL KDIHATLANL TDTMQALIKD VPALSRADSH RLSDKV
|
| |