Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3099 |
Symbol | |
ID | 5706573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3521491 |
End bp | 3523455 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641272533 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001537901 |
Protein GI | 159038648 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0399621 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTCC ACATACTGGG TCCAGTTGAA CTACGCGTCG ACGGTCAGGT CACAGCCCTC GGTGGGGCGA AGCCACGAAC CCTCCTGGCC ACCATGCTGG TCCATCACGA CCAGGTGATA GCGGCGGACC GCCTGATTGA GGCGCTCTGG GGTGCCTCCC CACCGAGTCG GGCACGTTCG ATCCTCCAAA CGTACGTCTC GAGTCTGCGT CGAACCATCA GCGGTTCTGG TGGGGCGACT GTCGCCGCCG TGCCACCTGG CTATTCGCTG CGTCTCATGT CGAGCACGCT CGACCGGAAC GTTTTCGAGC AGCTGCTTAG CTCGGCGAAG CAAGCGACCA GCCGCGGCCG GCACGAGATC GCAGCTGACA TACTGAGTCG AGCATTGGCG CAGTGGCATG GTCCGGCCCT CGGCGGCGTC GAGAGCAGCT TGCTCGACAG TGAGGCGGCG CGGCTAGAGG AGCTACGTCT CACCGCCCTA GAGGATCGTG TCAACGCGAA CATAGCGCTC GGGCGACTGG CCGAGGTAGC TGCCGAGCTG ACCGACCTGG TGCGAAAGCA CCCGTTCCGT GAACGGCTGC ACGGACAGCT GATGACTGTG CTCTGTGGCC TCGGCCGGCA GTCCGAGGCT CTGCTGGTCT ATCGAGACGC ACGCCAGAGC CTAGTGGAGG AGCTCGGCGT CGAACCGGGC CCTGAACTGC GGGCACTCCA CACCCAGATT CTGCGAGGCG GCAGCGGAGG TCTCACAATG CCTTCAGCTG ATCGCCTCAA CACCTTGTCT CAGCGATCCA GCCACGATGA ACAACTAGAG GGTCGGCCCG CGCAGCTGCC AGCAGTCCCT GCCGACTTCA CCGGTCGGGC AGGTGAAGCG AAAGAGCTAG TTGCAAACCT CACTGCGGCT GCTAACAACG GCCGGGTCCC GGTGCAGCTG ATCGTCGGTG GCGCCGGAAT GGGTAAGTCC GCCCTCGCGG CCCATGTCGC GCACCAGATC ATCGACGAAT ACCCCGACGG CCAGCTCTAC GCGGACCTGC GCGGCCTCGA CGGAACCCCG GCAGAGTCGC ACGAGATACT GGGCAGCTTT CTCCGTGCTC TCTCACCGGG GAACCCGACG CTACCAGAGA GCACAGTGGA GCGGGCAGCC CGATACCGCA CTCTGCTCGC TGAGCGGCGG ATGCTTGTCG TGCTTGACGA CGCCCGCGAC GAACGCCAGA TCCGGCCTTT GCTTCCAGGG ACGGAGACCT GCGGTGTGCT GGTGACGGCC CGTCGCCGTC TCGCCGGCCT AGCCGGTTGC CAGGTCCTGG AGTTAGAGGG ATTGCCGGAC ACCGACGGTC GGCTACTCTT TGCCTCATTG GCTGGCCTGG ACAGAACCAG TGCCGAACCA GAGGCGACCC GACAGGTGGT GCAACTCTGC GGCGGGCTAC CGCTAGCCCT GCGCCTAGCC GGCGCCCGGC TAGCCAGCAG GCGTCTGTGG ACCGTGCGCT TGCTCGCCGA CCGGCTGGCC GACGAAAGCC TGCGGCTCGA CGAGCTCAGC GCCGGTGACC ATGACATGCG CGACAGTATT CGGCGCAGCT ACAACCAGCT TGACTTCCGG CAACGGGCGG CGCTTGGGAT CTGCGGGCTG CTCGGTCCTC GCGACATTTC CCCCTGGATC CTGTGCACCG CGCTTGCCAT CTCACCTATC AAGGCCGAGC GTGTCATGGA AGGTCTGGTT GACGCGTACC TGATGGACGT GGTCCGGGTT GACGAAGTCG GGCAAGCCCA CTATGCCGTA CATGATCTTG TGCGCCTATA TGCACGGGAG CGAGCACCCG CCGATGGCTT GATCGCCAAG GCTGCCGGGG ACAGGCTAGA CGCCTGGCTG TTGCCGCGCC AAGCTGTCCT AGACGGTGCT GCCCCCATCT CTCAACAACC AGGAGTAGAT CTACCCGCCA ACTCCCTCTA TGCCAGCATT AGGGAAGCAG GTTAG
|
Protein sequence | MEFHILGPVE LRVDGQVTAL GGAKPRTLLA TMLVHHDQVI AADRLIEALW GASPPSRARS ILQTYVSSLR RTISGSGGAT VAAVPPGYSL RLMSSTLDRN VFEQLLSSAK QATSRGRHEI AADILSRALA QWHGPALGGV ESSLLDSEAA RLEELRLTAL EDRVNANIAL GRLAEVAAEL TDLVRKHPFR ERLHGQLMTV LCGLGRQSEA LLVYRDARQS LVEELGVEPG PELRALHTQI LRGGSGGLTM PSADRLNTLS QRSSHDEQLE GRPAQLPAVP ADFTGRAGEA KELVANLTAA ANNGRVPVQL IVGGAGMGKS ALAAHVAHQI IDEYPDGQLY ADLRGLDGTP AESHEILGSF LRALSPGNPT LPESTVERAA RYRTLLAERR MLVVLDDARD ERQIRPLLPG TETCGVLVTA RRRLAGLAGC QVLELEGLPD TDGRLLFASL AGLDRTSAEP EATRQVVQLC GGLPLALRLA GARLASRRLW TVRLLADRLA DESLRLDELS AGDHDMRDSI RRSYNQLDFR QRAALGICGL LGPRDISPWI LCTALAISPI KAERVMEGLV DAYLMDVVRV DEVGQAHYAV HDLVRLYARE RAPADGLIAK AAGDRLDAWL LPRQAVLDGA APISQQPGVD LPANSLYASI REAG
|
| |