Gene Sare_4606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4606 
Symbol 
ID5706627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5224603 
End bp5227428 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content71% 
IMG OID641274008 
Producttranscriptional regulator 
Protein accessionYP_001539355 
Protein GI159040102 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0568023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.290469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGGC CGGCGGTACG CATACGGTTG CTCGGCGGGG TCGAGGTGGT GGACGGCAAC 
GGCGCCGCCG TCGACATCGG GGCGGGTAAG TGCCGTGCGC TGCTCGCGGC CCTGGCGTTG
CAGCCCGGTA CGGCGATTCC GGACTGGCGG CTAGTCGATC TGCTGTGGGG CGAGCAACCA
CCCCGGACTG CCGTCCGAAC CCTGCAGTCG TACATCGCTC GGCTACGGGG CGGTCTGGGC
GCCACGAGGA TTGTGCGCTC GGGTGCCGCG TACCGTCTCG ATGTGCCCGC CGATGCGGTC
GATGTGATCC GATTCGGCCG GCGGGTCGAG GCCGGCGACC TCGCCGGGGC GCTCGCCGAG
TGGACCGGCG AGCCACTGGC CGGGGTACCG GTGCCCGGCC TGGCTGCGGC CGTGGACGGC
CTGGTCGAGC GGTGGCTCGG CACGGTCGAA GCCGATCTCG CCGCCCGGGT GGACGCTGAC
GCCGCAGCGA CCGTGGGGCC GTTGACTGAG CTGAGCACGC GGTATCCGTT CCGTGAAGGC
ATCTGGGCGC TGCTGATGAC GGCGCTGTAC CGGGTGGGCC GGCAGGCCGA CGCGCTGGCC
GCATACCGCA CTGCCCGTCA ACGGCTGGTT GAGCACCTGG GCGTGGAGCC CGGGCCGCGC
TTGCGCCGCC TGGAGGTGGC GATTCTCGGG CAGGACCACC GCATTGGCGG CGAGCAGCGG
TCCGAGTCGA TCGACCGGTT GCCCCGACGC GCCGTGCGAC TGATCGGCCG CGACGGTGAC
CTCGACCTCA TCGGCCGGGC ATTGGCCGAG AGCCCGGTGG TCACCCTGGT CGGGCCGGGC
GGCATCGGCA AGACCGCGCT CGCTGTTGCG GCCGCCCAAC GCGTGCGGCT CGAGCACGGC
GCCTGGCTGG TCGACCTGAC CGAGATCACG ACCGACCAGG ACGTCCCCCA AGCCGTCGCC
GCGGCGGTGC GCGTCGAGGA GGGCCCAGGC CGATCGCTGA GTGAATCCAT TGTGCTAGCC
CTGAGCTCAC TCCGGGCACT GCTGGTGCTC GACAACTGCG AACACGTCGT GGACGGCGCG
GCACGCCTGG CCCAGGCCGT CGCCGACGGC TGCCCGCAGG TGCGGGTGCT GGCCACCGCG
CGGGAACCGC TCGGCCTCAA CCACGGTCAC GAACGGCTGG TCCCCGTGAC GCCGTTGCCC
GCGGCCGGGG CGGGCGCCGA CCTGTTCGCC GACCGTGCGA ACGCGCTGAC CGCCGCGTTC
ACGATGGATG CCGCGCGGGA GGTGATCGAG GAGATCTGCC GCTGCCTCGA CGGGCTTCCC
CTCGCCATCG AGCTGGCCGC CGCCCAAACC GTCAGTCACA CCCCGCCGGA AATCCGCGAG
CGTCTCGACG ATCAGCTCGG TTTGCTGGTC GGCGGGCGGC GAACCGGGGC GGACCGGCAC
CGCACCATGC GCGCCACGAT CCAGTGGTCC TACCGGCTGC TCACCGTGGC CGAACAGGAC
CTGCTGCAAC GGCTGTCGGT GTTCACCGGC CCCGTCGACC GGGCCGGAGC TGCGGCCGTT
GCCGCCGGCA GCGGCCTGGA TGTCAACGAC GTGCTGCACA CCCTCGTACA GCGCTCGATG
GTTACCGCCG GACCCGGCCT GTTCGGCCAG CAGTTCAGGT TGCTGGAACC AATCCGCCAG
TTCGCAGCCG AACACCTCAC CGCAGGACCG GCGGCCGCAC CCGCCCAGGC CGCGCACACC
CGATACGTGC GGGAACGGGT GACCTCGCTA CGCGACCAGC TCACCGGACC CGCCGAAGTC
CAGGGGGTCG CCCGTCTGGA CGAGCTGTGG CCCAACCTGC GCGTAGCGGT TGACCGGGCC
TTTGCCTGCG GCGACTACCG CCTCGCCCAT GACCTGTTCC GGCCGATCGC CACCGAGGCC
GCCCGGCGGC ACCGGCACGA AGTCGGGCAG TGGGCCCAAC GCCTCCTCGA ACAGGCACCG
CCCGAGGATC GGCCGCGGAT CGTGACCGGC CTGATCGCTG CCGCATCCCG CTATCACGTC
TGTCAGGACC CGGCCGGGTT CGACACCTTG ATCAGGCAGC ACTGCGAACC GGACGACCCG
GTAGCCCGGC ACATGCGGGC CAACGTCCGC GACGACTACG CTACCCAGAT ACACACGGCG
CCGCAGGCAC TGGCCGAGCT GCGCCGGCTC GGCGCCGACG ACCTCGCCGC GCACGTCGAG
GTCGACCTCG GCGCGGCGTT GGTCTTTCAG GGACAGTACG CACGCGGAGA AGCCCAGCTC
ACCCAGCTCG TCGACCGGTT CCGCAGCCAC GGCCCGCCCA CCCTGCTGAA CTGGACGCTG
ACGCTACTCG GCTTCTCAGC CGCCTTCCAA GGTCGACGGG CCGCCGCGGA CACGTTGTTC
GACCAGGCGA TCGACGTGCC GCTGCCGGCA CGCACCCACT CGCCGAACCA GTCCGTGCGT
GCCCTGGCGT TGTTCCGGCG CGGCGACCGC AGAGCCGCCT ATCAGCTGCT CCGTGCCCAC
GTCGAAGAAC TGCTCGACGC GGACAACATG CACGGTGCCT GCGTCGTGTC GGTCAACTTC
GTCACGATGA TGCCGGCAGT GGCACGCTTC GCCGACGGGG CCCGGATCCT GGCCTTCCTC
GACACCACCG GCGCGCTCGA CAACGCCGCC TGGGCGGCCA TGGTCGCCGA CGCCAGGGAC
AAACTCGCCA CCTTCGCCCC CATCCCGAAC GGATCCATGA TCCTCGACCA GCGGCAGGCC
CTCGCCACGA TCGGCAAGAC TCTCGACGGC CTTCTTATCG AACAGGCCAG CCCGGTCAGA
TGCTAA
 
Protein sequence
MAGPAVRIRL LGGVEVVDGN GAAVDIGAGK CRALLAALAL QPGTAIPDWR LVDLLWGEQP 
PRTAVRTLQS YIARLRGGLG ATRIVRSGAA YRLDVPADAV DVIRFGRRVE AGDLAGALAE
WTGEPLAGVP VPGLAAAVDG LVERWLGTVE ADLAARVDAD AAATVGPLTE LSTRYPFREG
IWALLMTALY RVGRQADALA AYRTARQRLV EHLGVEPGPR LRRLEVAILG QDHRIGGEQR
SESIDRLPRR AVRLIGRDGD LDLIGRALAE SPVVTLVGPG GIGKTALAVA AAQRVRLEHG
AWLVDLTEIT TDQDVPQAVA AAVRVEEGPG RSLSESIVLA LSSLRALLVL DNCEHVVDGA
ARLAQAVADG CPQVRVLATA REPLGLNHGH ERLVPVTPLP AAGAGADLFA DRANALTAAF
TMDAAREVIE EICRCLDGLP LAIELAAAQT VSHTPPEIRE RLDDQLGLLV GGRRTGADRH
RTMRATIQWS YRLLTVAEQD LLQRLSVFTG PVDRAGAAAV AAGSGLDVND VLHTLVQRSM
VTAGPGLFGQ QFRLLEPIRQ FAAEHLTAGP AAAPAQAAHT RYVRERVTSL RDQLTGPAEV
QGVARLDELW PNLRVAVDRA FACGDYRLAH DLFRPIATEA ARRHRHEVGQ WAQRLLEQAP
PEDRPRIVTG LIAAASRYHV CQDPAGFDTL IRQHCEPDDP VARHMRANVR DDYATQIHTA
PQALAELRRL GADDLAAHVE VDLGAALVFQ GQYARGEAQL TQLVDRFRSH GPPTLLNWTL
TLLGFSAAFQ GRRAAADTLF DQAIDVPLPA RTHSPNQSVR ALALFRRGDR RAAYQLLRAH
VEELLDADNM HGACVVSVNF VTMMPAVARF ADGARILAFL DTTGALDNAA WAAMVADARD
KLATFAPIPN GSMILDQRQA LATIGKTLDG LLIEQASPVR C