Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4191 |
Symbol | |
ID | 5703845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4759217 |
End bp | 4761082 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641273612 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001538965 |
Protein GI | 159039712 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000808201 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGGTCG AATTGAGGGT TCTGGGAACC GTAGACATTA TATATGGCAG TACATCCGTT GCATTACGGG GAGTTAAACC GAAGCAATTA CTCGCAGCGC TCGTGCTGAA TAGCAACACT TTTCTATCGG TGGGCCGAAT TATTGAGGCG CTCTGGCCGT TTGACCCGCC AGTGTCTGTC CACCATAATA TCCGAACGTA CGCCACCGCA ATACGCCAAC GATTGCGTTG CAGCAGCAAT GACAATACAG TAGCGCTACT CGCAACTACG GGCGGTTATT TACTCAGGGT TCCGCCCGAG CAGGTCGACG TCACGAAATT CGTGACACAC ATAACAAGTG CACGCGAGAA GCGTGCGGCT GGCGATACCT GCAACGCAGC ATCTGACTTA ACCCGGGCAC TTGGCATTTG GCGTGGATCC GCAGGTGAAG GACTGCCACG CGAGGGCTGG CTGGGAAATG CGCTTTTAGC GTTGGATGAA CAACGATTGT TGGCGGTCGA GGAGCGAGTC CAATTGTGGC TACAACTCGG GCGGCACACC GAGTTAGTAC CCGAATTAAT GAGCGAGTTA ACCGAACAGC CCCTACGAGA AAGTTTCTGG CGCTATCTGA TGTTGGCGCA GTATAATAGC GGCAGAACTA GCGACGCTTT GGCAAGCTAC GAAAAGGTGC GTTCGGTGCT TGCCGAGCAT CTCGGCATTG ATCCATGGCC GGCGCTCGCC GAACTGCACC AAGCAATTCT TCGCCACGAT CCCGTCCTCG GTCGCCACTG CCTCGGTCTT ACAGATATGC CGGTACGCCG CGCAGCGACC TCGCCGGTAA ATGTATTGCC GACTCCTCGG CAGCTCCCTG CCCCGCCCGT AGACCTGGCG GACCGTCGAC AGACATACTC GCGGATTACA CAAGTTTTAC GGGAAGGCGC GTCGAAACCG GTTACCATCG CGATAACCGG CCCCCCCGGT ATTGGAAAGT CAGCGCTGGC CCAACAGGTG GCGCACACAG TCGCAAGTGA ATTTTCAGAT GGCCAATTGT ACGTCGACCT GCGGGAGCTC GGGCAACCGC GCCATCCGGA AGCCATGTTG GCAGTGGTCC GCTCTCTGTT ACGAAGTCTA GGAAAAGATG ATGCCTCCTT CACTACCGCC TCCGAGGCTG CCGCCCATTT CCGAAGTAGC ATCGCCGGTA GACGCCTGCT TGTCCTGATC GACAATGCGG TATCAGCTCG GCAAGTGAGA TACCTTCTAC CGGCTTATCC AGGGTCTGCG GCGATTATCA CCAGCTGCCG CCAACTCGAC CTAAGTGTGG TAAACGAACA AGTCACACTC GATCCAATTA GCCTAACCGG CGGTATGGAA ATTCTTCGGC GGATCATCGG CGATGATCAG CTTCGCGCTG AAACGGAAGC AGCTAAAGGG CTCATCAAGG TCTGCGACGG GGCACCACTG GCGCTCAGGG CTGCAGGCAG GCATCTGACG CAACAGCCAT GCAGTGCACC GACCCAACAC CTTTACAATA AGCTTAGCGC CGATCCCAAA TTGCTCGACG ACCTCCGTTT CGATGGACGA CGGTCCGCAG AACCGGTGGC GGTTGCCCTG GCAGAACTGA GAGCAGTGGA GTTCAGGGCG TCCGCCGCAC TGGTTGAGCT CAGTGCTGTC GGCGGCCGAT TGAAGGACAT CGAATTCAGT GAGCTAAACA GAGGCGGAGC GAGGAACCGG ATCAGGCGTG CCGATGTGGA CGTACTCTTG GACTTCCACT TGCTCAGACG ACAAGCAGGC GGTTTCGTGG TACCGGATGT TGTCCGCCGC CAGTGCGAGG CAGATGCGCA TCCAGCTCGG TACCGCGTGC AGGGATTTTG TAAAGTTCCT ATTTAA
|
Protein sequence | MQVELRVLGT VDIIYGSTSV ALRGVKPKQL LAALVLNSNT FLSVGRIIEA LWPFDPPVSV HHNIRTYATA IRQRLRCSSN DNTVALLATT GGYLLRVPPE QVDVTKFVTH ITSAREKRAA GDTCNAASDL TRALGIWRGS AGEGLPREGW LGNALLALDE QRLLAVEERV QLWLQLGRHT ELVPELMSEL TEQPLRESFW RYLMLAQYNS GRTSDALASY EKVRSVLAEH LGIDPWPALA ELHQAILRHD PVLGRHCLGL TDMPVRRAAT SPVNVLPTPR QLPAPPVDLA DRRQTYSRIT QVLREGASKP VTIAITGPPG IGKSALAQQV AHTVASEFSD GQLYVDLREL GQPRHPEAML AVVRSLLRSL GKDDASFTTA SEAAAHFRSS IAGRRLLVLI DNAVSARQVR YLLPAYPGSA AIITSCRQLD LSVVNEQVTL DPISLTGGME ILRRIIGDDQ LRAETEAAKG LIKVCDGAPL ALRAAGRHLT QQPCSAPTQH LYNKLSADPK LLDDLRFDGR RSAEPVAVAL AELRAVEFRA SAALVELSAV GGRLKDIEFS ELNRGGARNR IRRADVDVLL DFHLLRRQAG GFVVPDVVRR QCEADAHPAR YRVQGFCKVP I
|
| |