Gene Sare_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2197 
Symbol 
ID5708192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2529732 
End bp2531624 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content72% 
IMG OID641271678 
Producttranscriptional regulator 
Protein accessionYP_001537049 
Protein GI159037796 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.774056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCGGA CCGAGGACGA GCTCCGCATT CAGATCCTCG GTCCTGTCCA GGCGACCATC 
CGCGGTGACG CGGTCGACCT GGGCCCTCCC AAACAGCGGG CGGTTCTCGC CCTGCTGGCC
CTGCGGGCCG GCGGGCACGT GCCGCTCGAC GATCTGATCG CCGCGCTCTG GGCGGGACAG
CCACCGGCCC GGGCGGCCAA TCTGGTGCAC ACCTACGTCG CCCGCCTGCG CCAGACGCTG
GAGCCGGACA CGCCACGTCG GCGGCGGACC AACGTCATCG CATCGGTGCC GGGCGGGTAC
CGGTTGGCTG TCGGCGCGGA GCAGCTCGAC CTGCACGCGT TCCGTGCCGG AGTCCGGGAG
GCTGCGGCCC TACGCGAGCG CGGCGAGCCG ACCAGGGCCT TCGCCCGCTT CGGGGAGAGC
GTCGGCCGGT GGCGGGATCC ACAGGTCGGC GACCTGGCCG CGCTCCTGGT CGAGCAGGAC
GACCTCGGTC CACTGCGGCA GGAGTTCCTG GCCGCGGCGC TCACCTATGT CGGCCTCGGC
CTGGATCTGG GCCGACCGGA GGCGATCCTG CACCTCGCCG AACGGCTGGC GCTGACCGAG
CCGCTCAACG AAAGGGTGCA GGCCCGGCTG TTGCAGACGC TGGCACGAAT CGGCCAGCGC
GCCTGGGCCA TCGAGCGCTA TGCCGAGGTC CGCGAGCGAC TCCGGCTCGA CCTCGGCGTG
GACCCGGGGC CCGAGCTGTC CGCGGCGTAC CGCGAGGTGT TGGATGCCGA GCTGATGTCG
TCCGATACCG CCCACCGGAC CTCGCCCCAG GTGCCACCGT GGCGGGGAAC GGTGCCCCTG
ATCGATGAAC TGACCGGTCG TTCGGCTGAC CTTACCGCGA TCAACGACCT GCTCGACGGG
TACCGGCTGG TGAGCCTTAC CGGCCCGGCT GGGGTGGGCA AGTCCGCGCT CGGCCTGGCC
GCCGCGGAGC GGCAGCGAAA GCGGCACGCC GACGGCGTGG CGGTGGTCGA CGTGACGAAC
GTCCGCACCG GACACGCCCT CACGCAGGCG GTGACCGCGG TCGTCGTCAA CGGTCCACTG
CAGGCAACGG GTACACCCGT GTCCCTGGTG CGCCAGTTGC ACGACCGAAG CCTGCTGCTT
GTGATCGACA ACGCCGAGTT GGTCACCGAC GAGGCGGCAG AGCTGACAGA CGAGTTGCTC
CGCGAATGTC CCGGACTCAC CGTCCTGTTG ACCTCCCGGG AGCTACTCGG GATGCGGTAC
GAGGCGGTGT ATCCGGTCCG GCCGTTGCGA ACCGACCCCG CACCGGGGAC GTCCGCGCCT
CCGCCCGCCC AACAGTTGTT CGCCCGGCGG GCCACGCAGG TGCAGCCGAG CTTCCGGCTC
GACGAGTCCA CCCTGCCCGG GGTCACCGCG GTGTGCCGGG CGCTCGACGG CCTCCCGCTC
GCCATCGAGC TGGCCGCGGC GTGCCTGCGC ACCCAACGGC TGGACACCCT GGTTGACGTC
GTGGCTGATT CGCTGCGCTG GCTCCAACCA CCGCGACGGG GGGTGCCGAG ACACCATCGG
TCGCTACGGG CCGCCGTGCA CCGCAGCATC GAACTGCTCG ACGCGGCGGA GCAGCGGTGT
TTCGCCGCGC TCGGGGCGAT GCCGGCCGAG TTCGACCTCG CGGCCGCTGC CAGCGCCAGC
GGCGCACTGG TCGGTGAACG CGCTGCGGTG CAGGTACTCC TGGATCGATT GGTCGACAAG
TCGGTACTGG AGGTCCGGCA CGGTCCGGCC GGGAGGCAGT ACCGCATGCT CGGGACGGTT
CGCGCAATGG CCCGACAGCT ACTCCAGGAG CAGGGCTCGA TCGGGGCATC TCCCTCGACC
CGGTGCGCGT GCGCCTGCCA CCTGGACCCC TGA
 
Protein sequence
MNRTEDELRI QILGPVQATI RGDAVDLGPP KQRAVLALLA LRAGGHVPLD DLIAALWAGQ 
PPARAANLVH TYVARLRQTL EPDTPRRRRT NVIASVPGGY RLAVGAEQLD LHAFRAGVRE
AAALRERGEP TRAFARFGES VGRWRDPQVG DLAALLVEQD DLGPLRQEFL AAALTYVGLG
LDLGRPEAIL HLAERLALTE PLNERVQARL LQTLARIGQR AWAIERYAEV RERLRLDLGV
DPGPELSAAY REVLDAELMS SDTAHRTSPQ VPPWRGTVPL IDELTGRSAD LTAINDLLDG
YRLVSLTGPA GVGKSALGLA AAERQRKRHA DGVAVVDVTN VRTGHALTQA VTAVVVNGPL
QATGTPVSLV RQLHDRSLLL VIDNAELVTD EAAELTDELL RECPGLTVLL TSRELLGMRY
EAVYPVRPLR TDPAPGTSAP PPAQQLFARR ATQVQPSFRL DESTLPGVTA VCRALDGLPL
AIELAAACLR TQRLDTLVDV VADSLRWLQP PRRGVPRHHR SLRAAVHRSI ELLDAAEQRC
FAALGAMPAE FDLAAAASAS GALVGERAAV QVLLDRLVDK SVLEVRHGPA GRQYRMLGTV
RAMARQLLQE QGSIGASPST RCACACHLDP