Gene Sare_3099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3099 
Symbol 
ID5706573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3521491 
End bp3523455 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content65% 
IMG OID641272533 
ProductSARP family transcriptional regulator 
Protein accessionYP_001537901 
Protein GI159038648 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0399621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTCC ACATACTGGG TCCAGTTGAA CTACGCGTCG ACGGTCAGGT CACAGCCCTC 
GGTGGGGCGA AGCCACGAAC CCTCCTGGCC ACCATGCTGG TCCATCACGA CCAGGTGATA
GCGGCGGACC GCCTGATTGA GGCGCTCTGG GGTGCCTCCC CACCGAGTCG GGCACGTTCG
ATCCTCCAAA CGTACGTCTC GAGTCTGCGT CGAACCATCA GCGGTTCTGG TGGGGCGACT
GTCGCCGCCG TGCCACCTGG CTATTCGCTG CGTCTCATGT CGAGCACGCT CGACCGGAAC
GTTTTCGAGC AGCTGCTTAG CTCGGCGAAG CAAGCGACCA GCCGCGGCCG GCACGAGATC
GCAGCTGACA TACTGAGTCG AGCATTGGCG CAGTGGCATG GTCCGGCCCT CGGCGGCGTC
GAGAGCAGCT TGCTCGACAG TGAGGCGGCG CGGCTAGAGG AGCTACGTCT CACCGCCCTA
GAGGATCGTG TCAACGCGAA CATAGCGCTC GGGCGACTGG CCGAGGTAGC TGCCGAGCTG
ACCGACCTGG TGCGAAAGCA CCCGTTCCGT GAACGGCTGC ACGGACAGCT GATGACTGTG
CTCTGTGGCC TCGGCCGGCA GTCCGAGGCT CTGCTGGTCT ATCGAGACGC ACGCCAGAGC
CTAGTGGAGG AGCTCGGCGT CGAACCGGGC CCTGAACTGC GGGCACTCCA CACCCAGATT
CTGCGAGGCG GCAGCGGAGG TCTCACAATG CCTTCAGCTG ATCGCCTCAA CACCTTGTCT
CAGCGATCCA GCCACGATGA ACAACTAGAG GGTCGGCCCG CGCAGCTGCC AGCAGTCCCT
GCCGACTTCA CCGGTCGGGC AGGTGAAGCG AAAGAGCTAG TTGCAAACCT CACTGCGGCT
GCTAACAACG GCCGGGTCCC GGTGCAGCTG ATCGTCGGTG GCGCCGGAAT GGGTAAGTCC
GCCCTCGCGG CCCATGTCGC GCACCAGATC ATCGACGAAT ACCCCGACGG CCAGCTCTAC
GCGGACCTGC GCGGCCTCGA CGGAACCCCG GCAGAGTCGC ACGAGATACT GGGCAGCTTT
CTCCGTGCTC TCTCACCGGG GAACCCGACG CTACCAGAGA GCACAGTGGA GCGGGCAGCC
CGATACCGCA CTCTGCTCGC TGAGCGGCGG ATGCTTGTCG TGCTTGACGA CGCCCGCGAC
GAACGCCAGA TCCGGCCTTT GCTTCCAGGG ACGGAGACCT GCGGTGTGCT GGTGACGGCC
CGTCGCCGTC TCGCCGGCCT AGCCGGTTGC CAGGTCCTGG AGTTAGAGGG ATTGCCGGAC
ACCGACGGTC GGCTACTCTT TGCCTCATTG GCTGGCCTGG ACAGAACCAG TGCCGAACCA
GAGGCGACCC GACAGGTGGT GCAACTCTGC GGCGGGCTAC CGCTAGCCCT GCGCCTAGCC
GGCGCCCGGC TAGCCAGCAG GCGTCTGTGG ACCGTGCGCT TGCTCGCCGA CCGGCTGGCC
GACGAAAGCC TGCGGCTCGA CGAGCTCAGC GCCGGTGACC ATGACATGCG CGACAGTATT
CGGCGCAGCT ACAACCAGCT TGACTTCCGG CAACGGGCGG CGCTTGGGAT CTGCGGGCTG
CTCGGTCCTC GCGACATTTC CCCCTGGATC CTGTGCACCG CGCTTGCCAT CTCACCTATC
AAGGCCGAGC GTGTCATGGA AGGTCTGGTT GACGCGTACC TGATGGACGT GGTCCGGGTT
GACGAAGTCG GGCAAGCCCA CTATGCCGTA CATGATCTTG TGCGCCTATA TGCACGGGAG
CGAGCACCCG CCGATGGCTT GATCGCCAAG GCTGCCGGGG ACAGGCTAGA CGCCTGGCTG
TTGCCGCGCC AAGCTGTCCT AGACGGTGCT GCCCCCATCT CTCAACAACC AGGAGTAGAT
CTACCCGCCA ACTCCCTCTA TGCCAGCATT AGGGAAGCAG GTTAG
 
Protein sequence
MEFHILGPVE LRVDGQVTAL GGAKPRTLLA TMLVHHDQVI AADRLIEALW GASPPSRARS 
ILQTYVSSLR RTISGSGGAT VAAVPPGYSL RLMSSTLDRN VFEQLLSSAK QATSRGRHEI
AADILSRALA QWHGPALGGV ESSLLDSEAA RLEELRLTAL EDRVNANIAL GRLAEVAAEL
TDLVRKHPFR ERLHGQLMTV LCGLGRQSEA LLVYRDARQS LVEELGVEPG PELRALHTQI
LRGGSGGLTM PSADRLNTLS QRSSHDEQLE GRPAQLPAVP ADFTGRAGEA KELVANLTAA
ANNGRVPVQL IVGGAGMGKS ALAAHVAHQI IDEYPDGQLY ADLRGLDGTP AESHEILGSF
LRALSPGNPT LPESTVERAA RYRTLLAERR MLVVLDDARD ERQIRPLLPG TETCGVLVTA
RRRLAGLAGC QVLELEGLPD TDGRLLFASL AGLDRTSAEP EATRQVVQLC GGLPLALRLA
GARLASRRLW TVRLLADRLA DESLRLDELS AGDHDMRDSI RRSYNQLDFR QRAALGICGL
LGPRDISPWI LCTALAISPI KAERVMEGLV DAYLMDVVRV DEVGQAHYAV HDLVRLYARE
RAPADGLIAK AAGDRLDAWL LPRQAVLDGA APISQQPGVD LPANSLYASI REAG