Gene Sare_4843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4843 
Symbol 
ID5707622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5496603 
End bp5500058 
Gene Length3456 bp 
Protein Length1151 aa 
Translation table11 
GC content72% 
IMG OID641274239 
ProductSARP family transcriptional regulator 
Protein accessionYP_001539584 
Protein GI159040331 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0675424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00881271 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCAACGA CACTTGATCA TCGGCTTCGT GTCTCGGTCC TGGGCTCCGT GCGGGCCTGG 
CTCGGCGAAC ACGAGCTGCC CCTCGGACCG GCCCGGCAGC GGGCACTCTT CGCGGTGCTG
GCAGCCGCCG CCGGCCGTCC CGTCGGCCGC GACGAGCTGA TCGAGGGAAT CTGGGGCACC
TCGCCGCCGG CTACCGCGGC CGGCAGTGTC TACACCTACG TCTCCGGGCT TCGGCGTAGT
CTGGCGACGC CGGGCCTACC CCGATCGAGT CACCACCTCC TCACCTCCGG CCCGTCGGGC
TACGCCCTGC GGCTGTCCCC CGAGGACCTG GACAGCGAAC GCTTTCTGGC TTTCTGCCAC
CAGGCTCAGG AACTGCGGGA GGCCGATCCA CAGGCTGCGG CCACCCGCTG GGACGAGGCA
CTGGCGTTGT GGCACGGGGA GGCGTACGCC GGAGTCTCCG GTCCCCGGAT CGAGACGGAA
CGTACCCGGC TGGCCGAGCG TCGCCTCACC GCGACCGAAC AGCGGGCACG ACTGGGCCTG
CACCTCGGCG ACGACGACCT GCTGGCCACA CTGTCCGGCC TGGTGCACGA GAACCCGCTG
CACGAGCCCC TGTACGAGCT GCTGATGCTC GCCCTGCACC GGATCGGCCG CCGTACCGCG
GCGATCGAGG TGTTCCGGGC CGCTCGGCAC ACCCTCCTCG CCGAGCTCGG TGTCGGGCCC
GGCCCGGCGC TGACCGAGTT GCACCGACGG CTACTGGCCG AGCCGGCCGC CCCCGCCATC
CCCGCCACCC GCCCCACACC ACCGATCCTG CCATCAGAGA TCAGCAGCGC GGCGTGCGAC
GGTCAGGCGC GGGTGCTGGC GGGGCGCGAC GACGAACTGA CGATGCTGCG CATCCTGGTT
CGGGCGGTCG TAGCGGGCCA CGGAACCGCG CTGTGGATCG AGGGTGAGCC GGGCATCGGC
AAGACGGAAC TACTCACCGC CGCCATCGCC GACGCGAACG GCCTGGGCTG CCAGATCGCG
TGGGGCGCCG CCGACGAACT GGACCAGGAC GCCCCGCTGC AGGTGATCTG GCGGGCGTTG
GGCGTGCGGG CGACCTCGGA GTGGCATCGG CCCCCGAACC CGGCGGCCGC CGCGGAACGC
CTGCTCGCAC ACGTGCGGGC GAGCTGCGAG ACCGGGCCCC TTGTGCTCGT CGTCGACAAC
TTTCAGTGGG TGGACGAGGT CAGCTGCCTG CTCTGGGACC GGCTGATCGC CGCCACGCGA
CGCATGCCGC TGTTGCTTGT CGCGGCGACC CGACTCGGGT CGTACGGCCA CGCACTAGCC
CAGCTACGCC GCAGCGTCCA GGCCCGGGAG GGCCACGTAC TGCGTCTGCG TGCGCTGCCG
CCAGCGGCGA TCGAGCAGTT GGTGGCCCAG TTGGTCGGGG CTCCGGTGGG GCCGTACCTG
GCGGCGGTCG TGCCGCGGCT GGGCGGAAAC CCGCTCGGTG CCCGTGAGGT GATCACGGCC
TTGGTCCGTC GTGGGGCGGT GCGGATCAGT GACGGGTGTG CCGACATCGA CACGTCCGTG
CCAGTGGAGG CTCCCCGCTC ACTGCTGGTC GTGATCCGGG CCATCGTGGA CCTTCTCTCA
CCCGCCACCC AGGAGGTCCT ACGAACGGCG GCGCTACTTG GCCCGGAGTT CAGCGGCGAC
GACATCGTCG CGGTCACCGG ACGGTCACCA GTCGACCTGG TGGCAAACCT GGAGGAGGCA
CTCGCTGCCC ACATCGTCGT GGAAGCGGGC AGCGTGCTCG CCTTTCGTCA CCCACTCGTG
CGGCGAATGC TCTACGAGGG TATCCCGGCG CCGGCGCGAG CGGCGTTGCA CCGGCACACG
GCCGAGGTGC TGCACCGTGG CGGCGGTTCG GCGACCCGGG TGGCCGAGCA ACTCGCCGCC
GGTGAGCCCC CGGTCGACGA GTGGATGGTG TCCTGGCTCG TCGAGCGGCA CGCCGAGGTG
ACCAAGCGGG CGCCGCAGGT CGCCGGCCAC CTGCTGCGGC GGGTGCTCGA CACGGATGTA
CCCAGCACGA CACAGCGCGC GGTGTTGCTC ATCGCCCTGG TCAGACTGGA CTTCCGGCAC
CAGCGGTGGT CGATGGCCGA GGCGCGCGAG GCCGCCGGGC TGGCGCGGGA TCCGGCCGAT
CGTGCGGAGA TGCGTCACCT GCTCGCCACC ATGGCCTTCC GTCGTGGCGA CGCCGTGGCC
GCGACCAGCC TGCTCGAGGC GTCGCTGTGC GATCCGGAGG CGCCCGAGCT CTGGCGGACG
AGCCACCATG TGCTGCTGGC CACCTTTCGG CGGGGCAGCC TCGACGATCT CGACAGCGCC
GATCGGACGG CCGGGTGCAC CCACAGCGAG GCAGTCGCGG CGGGACAGCC GTACGAGGCC
GCGTTCGCGT TGCAGACCGT GTGGCTGACC AACTCGATCC GGCGCGACCA CGAACGTGCG
CTGGAGTACG TCGACCAAGC GTTGGACACC GTTCGGGGCC GTCGGGCGTG CGTGGGCATG
TACCTTGACC TGCTCGACAA CCGGACGTTC ACCCTGCAGA ACCTCGACCG GCTCGACGAG
GCCGAACGGA CCCTGCGCGA GGCGGCGCTC GTCGCCCTCC GACACCGGCT TCCACACGGC
CTACCGGTGG CCACGGCGGT CCAGCACTAC TGGCTTGGCC GCTGGGACGA CGCCCTGACA
ACGCTCAGCG CGGTCAGGGC GGTCGCCGAT GACAATCCCG GCATCACGTT CCTGGGAAAC
CGTGAACCGG GTGCCGTCAC CATGCTGCTG CACGGTGTCG CCGCGCTGAT CGCCGCACAC
CACGACGACG CTGGCCTGGT CGCCGAGCAC CTGGCGGCGG TGCACGCGCT GCCCGCCACC
GATGCCGAGC GGGCGAGTAG CGACTTCCTG CTGGTCGCCC GCTCGTTGGT CGCCGAGCAG
CAGGGCCGGG CCGACGAGGC ACTCGACGTC CTCGCCCCCC TGCTGGCACC GTCGTACGCG
CCGATGATGC TGCGTCACCA GTGGCTTCCG GGCGTGGTGC GGCTCGCCCT CGACCAGCGC
CGTACCGACG TCGTCGAATG GGCGGTGGAG ATCTGCGCGG GTGAGGCGAA CAAGGAGGTC
CGGCCCGCTC GAGCAGATGC CGCCGCCCAG CGTTGTCGGG CGCTGATCAC GGGGGATCCG
GAACCCGCCC TGGCTGCGGC GGCCCGCTAC CGCAGGGTGG GTCGGGCGCC GGAACTGGCG
GCGGCGCTCG AGGACGCAGC GGTGCTGCTC GCCGCCGATC GTCGCCCGCA CGAGGCGATC
CGGGTGTGCG GCGAGGCGGT ATCGCTGTAT TCGGCGCTCG GCGCCCGGTG GGATCTGCGC
CGGATCCGCC GCCGGCTCGC TGAGTTCGGG GTGGATCGGT CCCGCGAAGC GGTGTCCGTG
GATCGGCGGT CCGTCGCTGT CCTCAGTGCA GGGTGA
 
Protein sequence
MSTTLDHRLR VSVLGSVRAW LGEHELPLGP ARQRALFAVL AAAAGRPVGR DELIEGIWGT 
SPPATAAGSV YTYVSGLRRS LATPGLPRSS HHLLTSGPSG YALRLSPEDL DSERFLAFCH
QAQELREADP QAAATRWDEA LALWHGEAYA GVSGPRIETE RTRLAERRLT ATEQRARLGL
HLGDDDLLAT LSGLVHENPL HEPLYELLML ALHRIGRRTA AIEVFRAARH TLLAELGVGP
GPALTELHRR LLAEPAAPAI PATRPTPPIL PSEISSAACD GQARVLAGRD DELTMLRILV
RAVVAGHGTA LWIEGEPGIG KTELLTAAIA DANGLGCQIA WGAADELDQD APLQVIWRAL
GVRATSEWHR PPNPAAAAER LLAHVRASCE TGPLVLVVDN FQWVDEVSCL LWDRLIAATR
RMPLLLVAAT RLGSYGHALA QLRRSVQARE GHVLRLRALP PAAIEQLVAQ LVGAPVGPYL
AAVVPRLGGN PLGAREVITA LVRRGAVRIS DGCADIDTSV PVEAPRSLLV VIRAIVDLLS
PATQEVLRTA ALLGPEFSGD DIVAVTGRSP VDLVANLEEA LAAHIVVEAG SVLAFRHPLV
RRMLYEGIPA PARAALHRHT AEVLHRGGGS ATRVAEQLAA GEPPVDEWMV SWLVERHAEV
TKRAPQVAGH LLRRVLDTDV PSTTQRAVLL IALVRLDFRH QRWSMAEARE AAGLARDPAD
RAEMRHLLAT MAFRRGDAVA ATSLLEASLC DPEAPELWRT SHHVLLATFR RGSLDDLDSA
DRTAGCTHSE AVAAGQPYEA AFALQTVWLT NSIRRDHERA LEYVDQALDT VRGRRACVGM
YLDLLDNRTF TLQNLDRLDE AERTLREAAL VALRHRLPHG LPVATAVQHY WLGRWDDALT
TLSAVRAVAD DNPGITFLGN REPGAVTMLL HGVAALIAAH HDDAGLVAEH LAAVHALPAT
DAERASSDFL LVARSLVAEQ QGRADEALDV LAPLLAPSYA PMMLRHQWLP GVVRLALDQR
RTDVVEWAVE ICAGEANKEV RPARADAAAQ RCRALITGDP EPALAAAARY RRVGRAPELA
AALEDAAVLL AADRRPHEAI RVCGEAVSLY SALGARWDLR RIRRRLAEFG VDRSREAVSV
DRRSVAVLSA G