Gene Sare_3671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3671 
Symbol 
ID5707193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4231323 
End bp4234469 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content69% 
IMG OID641273093 
ProductSARP family transcriptional regulator 
Protein accessionYP_001538457 
Protein GI159039204 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0046189 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGTCGG GTGTTGAGGT CAATATTCTC GGCGGCGTCG AAGTGATCGG ACCGCGCGGC 
CGGACCGAGT TGGTGGGACG ACGGCATCGT GCCTTGGCGG CTGCCCTGGC CCTGCGGCCC
GGCGCGGTGC TGCCGCTGTG GCGGCTCGTC GAGGCGCTGT GGGGGGAGCG TCCACCACGA
ACCGCTGTCC GGTCGCTGCA CAGTCACGTG GCGCGGCTGC GCCTCGCGCT CGACGCGAGC
GGGTTGGGCG GAGTGCTGCA GACCCGGGAG CCGGGCTACC TACTCGCCAT CGATGCGACG
GTGGTGGACG CATGCCGCTT CGAACAGCAA ACGAGGGCGG CCCGGGACGG CCTGGCCGCC
GGAATGGCCA GTTGGGCTGC CGACGCCATG GAACAGGCGT TGGCTCTGTG GCGCGGTGAC
GCACTCGCCG ACGCGGAACC GATCGGATGG ACGGCAGCCG AGGCGGCGCG CTTGGAGGAC
CTGCGGCTTG CCGCGCGGAT GGACCTCTGC GAGGTACTGA CCTGCCTCGG CAGAACGGGT
GAGGCGCTCG GCGAGGTAGA GCGGCTGTTG GCAGCAGACC CGACCCGCGA ACGGCTGGTG
GGCCTGCGCA TGCTCGCCCT GGCGGGATCC GGGCGGCCGA CCGAGGCGCT GAACGTCTAC
CAACGGCTAC GCGTTCGGCT CGCCGATGAA CTCGGGGTAG ATCCGTCGCC CGAACTGGCA
GATCTGCACA CCGCACTGCT ACGTGGCGCC GCGCCTGGTG AGCTGCGGGC ACCGGGTACG
GTTCTCGGCC GGAACTCCGT CGCGGCGACG GGCCGGAACT CCGTCGCGGC GACGCCACCT
CGACCGGCGC AGCTTCCGGC ACCGGTGGGG TACTTCACCG GCCGCGTCGC CGAGCTGGGT
GAGCTGAGTA GTGTCATCGA AATAGGCCAC GATGACGTCC GGCCCGTGGT GCTGATCAGC
GGCCAGGGCG GGATCGGCAA GACGTCGCTT GCGGTGCAGT GGGCCGCCAG CGTCACTGAC
CGGTTTCCGG ACGGTCAGCT CTTCGTCAAT CTCCATGGCC ACAACCGGGC CGATGCGCTT
GCCCCGGCGG AGGTGGTGGC AGTCCTGCTG CGGTGCCTCG GGATACCGGA TGATCGGCTT
CCTACCGGGC TGGCCGAGCG AGTCGCCCTG TATCGCACGA TCCTCGCCGA CCAGCGCATG
CTGGTCGTGC TCGACGACGT CGGTTCCACC GAGCAGGTGT TGCCGGTGAT CCCCGGCAGC
GCCGCGAGCC TGCTCGTGGT GACGAGCCGT AACAGCCTCG TCGCTCTGGT AACGCACACC
CGGGTACACA CCATCCTCCC TGAGCTGTTC ACCCAGGACG AAGCAACCGA TCTGATGGCA
AAGATGCTCG GCACCGAGCG GGTAGGCCGA GAGCGCGACG CGGTGGCCGG ACTCGCCAAG
CTGTGCGGTT GGCTTCCGCT GGCGTTACGG ATCGCGGCGG CAAAGCTCGC GCTGCGCCCG
GCTCAGCCCA TCGAGGTGCT CGTCGAGGAG TTGTCTGGCG GCGACCGGTT GGCCAACCTC
TCCGTCGAGA ACGGCAGCCG CGACGTCAGT GTGGTGTTCG CCAGTGCGTA CCAATCGCTG
TCGGTACCCG CGATGCGGCT GTTCCGGCTG CTCGGCCTGC ATCCCGGGCC GCACCTCGGC
GCAGCACTGG CTGCCGCGCT CTGCGGCCTG CCCGCCGACG TGCAGCGACA TGCGTTGGCC
GAACTCGTCG CGGCACACTT GGTTGCCGAG CCACGGCCCG GCCGATACCA GTTCCACGAC
TTGGTCCGGC TCTTCGCGCG GCGGTGTGCG CTTGCCGACG AGCCCGCGAG TACGCGTGCC
GAGGTGGCCG AGAAGTTGCT CGACTGGTAT CTCGTCGGTG CCGCAACGGC CACCCAGGTG
CTCGACAGCA ATCTCGACCG CGTAACCGCG ACGCTGCGTC ATCCGGCTCC GGAACTTCCC
TTTTCGGCCA CTCGCGAGCA CACGATCGCG TTTCTCCACT CTGAACGCGA CAATCTACTG
CCGATCGTGC GGTACGCGGT GGAGCACGAC CAGCCCGCTG CGGCATGCCA GCTGACCTAC
CTACTGACCA GCTACTTCGA CGTACATGGC GACTGGTCCG AGCGGGTGGT GATGTGCCAG
CACGCGGTCC GGGCCGCCCG TCGGCTCGGC GATCCGGTGC TCGAAGCCGA GATGCATCGG
GCGCTCGGCG TGGCCTACCG CACGACACAC CAGCTCAGCC AGGCACTCGA CAGCCACCAC
CACGCGTTGG CGCTGTTGCG GCCGCTCGGG GACAACCGCG GATTGGCGTA CGTCTACAAC
AACATCGGCG GCGCGTTCGT AGAAATGCGT CGTTTCACCG CTGCGATCGA GGCGTACCAG
ACCGCGCTGC GGCTGCACGG CCACTGTGGC AACCGGGCCG GCGCGGCGAC CGCCCAGCGC
AACCTCGGAT ACGTCCACGT CCGGATGGGC TGTGCCGATC TCAGCTTCGC CCCGCTGGAC
GCGGCGTTGG CCACCAGCCG GGCCATCGGT CTCCACCGGC TCGAGGCGAG CACGTTGAAC
AGCCTCGGCG AGGCGCACCT GCAGCAGCTG CGACACGATC GGGCGCTCGA CTGCTTCCAC
GAAGCCTTCG CCGTGAGCCG CAAAGCCGGC GATCGCCGCT ACCAGATGGT CGCGCTGGGC
GACCTCGGAC GCACCTACCT GGCCCACGGC GACCCCGCGT CCGCTGTAGA TCACTTTAAT
CGGGCACTGG CGATGAGCCG GAGCCTGGGT CATCGGCACA TCGAGGCACG CACCCTCAAC
CAGCTCGGCG AAGCGCAGCT GCGCCTGGCG AACCTCGACG AGGCCCGCCG ATGCCTGACG
GCAAGCGCCA GCCTTCGGCG GGCCGTGCCC GACCTGTACG AGCAGGCACA CGTGCAGCGC
AACCTCGGCG ACCTTGCGGA GCTGACGGGT AGCCGAGGTG CCGCAGAACG CCACTGGTCA
ACGGCTGTTC GCCTCTACCA CGAGGCGAGC GCGACCGATG AGGCGGAGCA GCTCGCCGGC
AAGCTGACCG ACGAAACCGA CCTGGGTGCC GTCCCCGTTC CCCCACGATC GCGTCAGTCG
TCGTCGACGA TGCCCATGGC CACCTGA
 
Protein sequence
MMSGVEVNIL GGVEVIGPRG RTELVGRRHR ALAAALALRP GAVLPLWRLV EALWGERPPR 
TAVRSLHSHV ARLRLALDAS GLGGVLQTRE PGYLLAIDAT VVDACRFEQQ TRAARDGLAA
GMASWAADAM EQALALWRGD ALADAEPIGW TAAEAARLED LRLAARMDLC EVLTCLGRTG
EALGEVERLL AADPTRERLV GLRMLALAGS GRPTEALNVY QRLRVRLADE LGVDPSPELA
DLHTALLRGA APGELRAPGT VLGRNSVAAT GRNSVAATPP RPAQLPAPVG YFTGRVAELG
ELSSVIEIGH DDVRPVVLIS GQGGIGKTSL AVQWAASVTD RFPDGQLFVN LHGHNRADAL
APAEVVAVLL RCLGIPDDRL PTGLAERVAL YRTILADQRM LVVLDDVGST EQVLPVIPGS
AASLLVVTSR NSLVALVTHT RVHTILPELF TQDEATDLMA KMLGTERVGR ERDAVAGLAK
LCGWLPLALR IAAAKLALRP AQPIEVLVEE LSGGDRLANL SVENGSRDVS VVFASAYQSL
SVPAMRLFRL LGLHPGPHLG AALAAALCGL PADVQRHALA ELVAAHLVAE PRPGRYQFHD
LVRLFARRCA LADEPASTRA EVAEKLLDWY LVGAATATQV LDSNLDRVTA TLRHPAPELP
FSATREHTIA FLHSERDNLL PIVRYAVEHD QPAAACQLTY LLTSYFDVHG DWSERVVMCQ
HAVRAARRLG DPVLEAEMHR ALGVAYRTTH QLSQALDSHH HALALLRPLG DNRGLAYVYN
NIGGAFVEMR RFTAAIEAYQ TALRLHGHCG NRAGAATAQR NLGYVHVRMG CADLSFAPLD
AALATSRAIG LHRLEASTLN SLGEAHLQQL RHDRALDCFH EAFAVSRKAG DRRYQMVALG
DLGRTYLAHG DPASAVDHFN RALAMSRSLG HRHIEARTLN QLGEAQLRLA NLDEARRCLT
ASASLRRAVP DLYEQAHVQR NLGDLAELTG SRGAAERHWS TAVRLYHEAS ATDEAEQLAG
KLTDETDLGA VPVPPRSRQS SSTMPMAT