Gene Sare_2592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2592 
Symbol 
ID5707886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2955795 
End bp2957603 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content72% 
IMG OID641272054 
ProductSARP family transcriptional regulator 
Protein accessionYP_001537424 
Protein GI159038171 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0592694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCACG CCGCGGCGAG CCTATCCGGC CATGACCAGC CAGCCCAACC GGCGGCGTTC 
CGGGTGCTCG GCCCGTTGAC CCTCAGCAGC AGCAAGGACA CGCTGGTCCT CCCACCGTCG
AAGGTGACCT CGCTGCTCGC CGTGCTGCTG TTGCACCCCG ACGAGGTGGT CTCGGTCAGC
GCGTTGCGGC AGGCCATCTG GGGCGACGAG CAGCCTGCCT CGGCCAAGGC CGCGCTGCAG
ACCTGCACCC TGCGACTTCG GCAACTGTTC GCCCGGCACG GCATCACCGG CAGCGTGATC
AAGACCGTGC CGGGTGGCTA CCGGATCACC GCCACCGCCG AGACCGTCGA CCTGATGCGC
TTCCGCGAAC TGATCGGCCA CACCCGCGAC GTCACGGACC CGGAGGTGGA ACTGGCGAGG
CTGGAGGAGG CGCTCGCGCT GTGGAGCGAC CCGATGCTGG CCAACGTGCC ATCGGAGGCA
CTGCACCGGG ACGTGGTACC CCGGATCAGC GAGGAGCGGG TCCGAGCGAT CGAGCGGGTG
TGTGACCTGA AGATCGGCCT GGGCCGAGAC CGCTCCGCGC TGGTGGACCT CTGGACCGCC
GTCCGTGCGT ACCCGGCGAA CGAGCGCTTC TCCGCGCAGC TCGCCTCGGT GCTCTACCGC
ACCGGCCGGC AGGCCGACGC CCTGGCCGAA CTGCGTCGGA TCCGGGACCA CCTGCGCCAC
GAACTCGGCA TCGCCCCCGG TCCCACCCTG CGCGCGTTGG AGCTGACCAT CCTGCGGGGC
GAAGCACCCA CGTCGGTCGG CCCGGTCGTG CGACCGACCA GCGTGGTGAC CCGGCACCCG
GTCGCGTCCG GCCTCGTCGG TCGGGACGCG CTCGCCGAGA CCATCGCCGA ACGCCTCCGC
GCGGGATGCC CGATCGTGGT GCTCAGCGGC CCACCCGGCG TCGGCAAGAC CGCGCTGGCG
CAGTACGTCG GGCAGCTCGT CGCCCCCGAC TTCCCCGGCG GCCGGGTGGA GGTGACCGCG
GCCGCCACCG GGCCGGTGAC GTCGCTGCAC GACGCCCAAC AGCAGCTACT CACGGCGGTC
GACCAGGGCC ACGACGTCCA GGTGGGCAGA CGACTGCTGC TGGCCGACGA CGTGGTCAAC
GGCCGTCAGG TACGGGACCT GCCGGCCCTG TTGGCGCCCG GCGACGCCCT GCTGCTGACC
AGCCGACAAA GCCTCTCTGG ACCGGTTGCC CGACTGGGCG GGTGGCTGCA CCGGGTGGAG
CCGCTGAACC CCGACGACTC GCTGCGGCTG CTCCGTACCG CACTCGGCCC CGAACACGTC
GACGCCGACC CGCAGAGCGC CGCGGAGATC GCGGCACTCT GCGACCACCT GCCACTCGCG
CTGCGCATCG CCGCCACCCG CATCCTGCTG CGCACCAGGA CGGACCTCGC GGCGGCGTTG
GAGTGGCTGC GCGTCGACCC GCTGAGCCGG CTGAGCCTGC CCGGTGAACC GGACATGTCA
CTCGGTCACC GCTTCGACGA GGCCCTGTCC CGGACCGGCG ACACGCTGGG AGCGGTCTTT
GTCCGACTGG CCACCGCGGC TCCCACCACC ATCACCGTCC GACAGGCCGC GCAGCTACTC
GACGTCGACC CGGCCACGGC TCGCGACCTG CTCGACGAAC TGGTCGACCA CAGCCTGCTC
GAGGAGGCCG CGGACCACTA CTGGATACGC GTACTGCTGC GGCGACACGC CCAGGTTGCG
GCCGATCGGT ACGCCCCCCA CCCCGGCCCA CCGCGACACC CGCACCAAGC GAAGGGATCC
ATGCGATGA
 
Protein sequence
MEHAAASLSG HDQPAQPAAF RVLGPLTLSS SKDTLVLPPS KVTSLLAVLL LHPDEVVSVS 
ALRQAIWGDE QPASAKAALQ TCTLRLRQLF ARHGITGSVI KTVPGGYRIT ATAETVDLMR
FRELIGHTRD VTDPEVELAR LEEALALWSD PMLANVPSEA LHRDVVPRIS EERVRAIERV
CDLKIGLGRD RSALVDLWTA VRAYPANERF SAQLASVLYR TGRQADALAE LRRIRDHLRH
ELGIAPGPTL RALELTILRG EAPTSVGPVV RPTSVVTRHP VASGLVGRDA LAETIAERLR
AGCPIVVLSG PPGVGKTALA QYVGQLVAPD FPGGRVEVTA AATGPVTSLH DAQQQLLTAV
DQGHDVQVGR RLLLADDVVN GRQVRDLPAL LAPGDALLLT SRQSLSGPVA RLGGWLHRVE
PLNPDDSLRL LRTALGPEHV DADPQSAAEI AALCDHLPLA LRIAATRILL RTRTDLAAAL
EWLRVDPLSR LSLPGEPDMS LGHRFDEALS RTGDTLGAVF VRLATAAPTT ITVRQAAQLL
DVDPATARDL LDELVDHSLL EEAADHYWIR VLLRRHAQVA ADRYAPHPGP PRHPHQAKGS
MR