Gene Sare_4854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4854 
Symbol 
ID5707633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5508643 
End bp5511003 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content70% 
IMG OID641274250 
ProductSARP family transcriptional regulator 
Protein accessionYP_001539595 
Protein GI159040342 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0558013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGACGA CGATCACCAA TCGCACTCTC CAACCGCTGG CCATGCCGGC AGAGCACGCA 
GGCATCACCA TTCGCTTACT GGGACCACTA CAGGCCTGGC AGGACGACAC CGAACTCGTG
CTCGGCTCTG GCAACCGGAC CGCCGTCCTC AGCTTCCTCG CACTGCACGC GAACCATGCC
GTGACCCGGG AACAGCTGAT CGCAGCACTG TGGGGTGACG ACCCGCCGGC CAGTGCGACC
GGAAACCTCT ACACGTACAT TTCCTCGTTA CGCCAGATCC TGGATCCGTG TCGACACCGG
TGGTCGGCTG GGCAGGTGCT CACCTCCGGC GGCGGCACCT ATCGCCTGCG GGTACGGAAG
CAGGACGTCG ACGCCTTCCG GTTCGAGGCA CTGCGCGAGG AGAGCAAACG GCACCGGACC
ATCGGCGACA GCAGCACGGA ACTGGCCTCA CTCACCGCCG CACTGCGGAT GTGGCAGGGC
ACGGCGCTCG CCGGCGTACC CGGCCCCTTT GCCGAGGCCC AGCGGCAACG CCTGGCGGAG
CTCCGGTTGA CCACCGCGGA ACGGCACGCC ACGCTGCTCG TCGAAACCGG CCGTCACGAC
GAGGCCATCT CGACGCTCCG CACGCTGATC GAGGCTCATC CAACTCGAGA GAACCTCGCT
CCCCTGCTGA CAGCGGCTCT CCACGCCGCG GGCCGAGCGG ATGTGCCACG TCGTCCGCGC
CCTCCCGTAC CGACGGAGCC CGCGGGTCAC GTACCAGCGT CACCACGACG AACCGGCACC
AGGTCGCCGG TACGAGCAGC CCCTGGATCG CTGGCCGGCC GCGAGACAGA AATCCGCTGG
CTACGTCGAG CCGTCGTCGA CACGACCCGC GGCCACGGGC GCAGCATCCG CGTCGAGGGC
TTCACGGGAA TGGGCAAGTC GGCACTACTC GCCGTGGCAC TGCCGAGTAC GGCACCGTCC
ACCTGCCGGA TCGGTTGGGC CGTCGGCAGG GAGTTGTCAC AGCGCGTACC GCTGGGACTC
CTGCTGGAGT GCATGGAGTC GGCGTTGGCT GGTGAGCCCA GCAGCGAGCT GGTCCGGCAG
ATCTCGCTGG CCACGAGCCC GTCCATGGGC AACTCGGCCA CTGCGCCGGT TGCCCGGGTG
GTCAGCCTGG TACGACAGGT CGCCCAGCAA GCTCCGCTGG TCCTCGTGGT GGACAACCTG
CATTGGGCTG ATCCACACAC CGTGGAGGCC TGGACGGCCC TACACGAGGT GACGACGCAA
CTGCCGCTCC TTCTCATCGC TACCAGTTGG CCCGGCACCG CCACGCTCGG CACCGCCACG
CTCGGCACCG CCACGCTCGG CACCGCCGAG TTCGGCACCG CCGCCGACGA GGTCCACCAG
CTGGCCCCCC TGAACCGGGC GGCCAGCAGC CACCTGGTGC GTACGGCTGC TCCGGAACCG
CCGGAATCCC CGGAGCTGGA CCATCTCGTG GCGGCCGCCG GGGGTAACCC GTGCTACCTG
TGGCACCTGG CCACAGCGGG TGCCGACCTC AACGGCCAGC TTCCCGCAAG CCTCGTCCGG
GTGGTCCACA CCCACCTCGC CCCATTCACC CAGCCGACCC GGGAGGTACT GCGCGCGGCG
GCCTTCCTCA CTGCTGGACC TGCCGCGCAC GCCGGCCCCG GCTGTTCGCT CAGCGAATTG
GCCGTGGTGA CCGAGCGTCC CACCGCCGAG TTGGTCGAGC TGCTGACGCC GGCCGCACGG
GCCGGCGTCG TCGCCATCAC CGGTGACGGG CTGACATTCC GCCACCCGGT CGTCCCGCGA
GTGCTGCACG AGGGAACGGC CAGCGCGCTA CGAGTCCTGC TGCACCGATC CTTCGCCGAG
CGGATAGCAG GCGCCGGCGG GCCACCCGAG CGGGTCGTCA CCCAGTTACT CGCTGACGCG
GTGCCGTTGG ACGTCACGCT GACCCGCTGG CTCACTACTC ACGTCGAGCA GCTCACCGCC
CGCGCGCCCC GGATCACGGT GACCATCCTG CAACGGGCCC ACGCCCAGTG CACGATGCCG
CCCGCCGACC GCATGACCCT CACCATCTGG CTGGCCCGGC TGCTCCTGCG ATTGGGTCGC
AACTGCGCCG CCGAGGCCGG GTGGGTTGCC TCCCGCACCG ATGACCTCGA CGTGGAAGGC
GAGATGCGGT GGATGGTCGC CCTGACCTAT GAGAAGCGCG GCGAGCACCA GACCGCCGCG
GACATCGCCC GTTCGGTGCT GCACGATCGC CGTGTCCCGG CTCCGTGGCT CGATCGGTTT
CGCCTCATGC AGGCGCGGCT GCGCCAACAC CTGGCCGACC GGCACGACGA CTGTCACCCC
GAAATCGTCA CGTGTACGTA A
 
Protein sequence
MSTTITNRTL QPLAMPAEHA GITIRLLGPL QAWQDDTELV LGSGNRTAVL SFLALHANHA 
VTREQLIAAL WGDDPPASAT GNLYTYISSL RQILDPCRHR WSAGQVLTSG GGTYRLRVRK
QDVDAFRFEA LREESKRHRT IGDSSTELAS LTAALRMWQG TALAGVPGPF AEAQRQRLAE
LRLTTAERHA TLLVETGRHD EAISTLRTLI EAHPTRENLA PLLTAALHAA GRADVPRRPR
PPVPTEPAGH VPASPRRTGT RSPVRAAPGS LAGRETEIRW LRRAVVDTTR GHGRSIRVEG
FTGMGKSALL AVALPSTAPS TCRIGWAVGR ELSQRVPLGL LLECMESALA GEPSSELVRQ
ISLATSPSMG NSATAPVARV VSLVRQVAQQ APLVLVVDNL HWADPHTVEA WTALHEVTTQ
LPLLLIATSW PGTATLGTAT LGTATLGTAE FGTAADEVHQ LAPLNRAASS HLVRTAAPEP
PESPELDHLV AAAGGNPCYL WHLATAGADL NGQLPASLVR VVHTHLAPFT QPTREVLRAA
AFLTAGPAAH AGPGCSLSEL AVVTERPTAE LVELLTPAAR AGVVAITGDG LTFRHPVVPR
VLHEGTASAL RVLLHRSFAE RIAGAGGPPE RVVTQLLADA VPLDVTLTRW LTTHVEQLTA
RAPRITVTIL QRAHAQCTMP PADRMTLTIW LARLLLRLGR NCAAEAGWVA SRTDDLDVEG
EMRWMVALTY EKRGEHQTAA DIARSVLHDR RVPAPWLDRF RLMQARLRQH LADRHDDCHP
EIVTCT