Gene Sare_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0089 
Symbol 
ID5707059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp102219 
End bp103157 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content69% 
IMG OID641269615 
ProductAraC family transcriptional regulator 
Protein accessionYP_001535015 
Protein GI159035762 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00450491 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTACACAG TCGTCGTCCT CGCGCTGCCG GATGTGATTG CCTTTGATCT GGCCACACCG 
GTCGAGACGT TCGGCCGTGT CCGCCTGCCG GACGGCCGGC CCGGATACCG GGTCCTCGTC
GCAGGGCCCG ACGATGTCGT CGACGCCGGG CCGGTGCGGC TGGCAGTCAG CGAGCAGCTG
GATGCGCTCG ATCGCGCCGA CCTGGTCGTG GTGCCTGGCC GCAACAACCC CTTACGGCCC
TCCCCACCCT CGGTGCTCGC CGCCCTGCGT GCTGCCGCGA CCAGGGGCAC ACGCGTCGCC
TCCATCTGCG TCGGAGCATT CACGCTGGCA GAAGCAGGAC TACTCGACAA CATGAGGGCC
ACGACCCACT GGCTCGCCGC CGAACACCTC GCGCACCAAC ACCCGTCGAT CCAGGTGGAC
CCCGACGTGC TCTACATCGA CAACGGCAGC ATTCTCACCT CTGCCGGTGC CGCGTCCGGG
CTGGACCTGT GCCTGCACGT GATCCACACC GACTACGGTG CGGCGGTGGC CGCGGATGCC
GCACGCCTCG CCGTGGCCCC ACTGCACCGA GCCGGTGGGC AGGCGCAGTA CATCCTGCGG
AACCGGCCGC CCCTGCGGAC CTCAGTCCTC GAACCCGTCC TCGCCTGGAT CGAGACCAAC
GCGCATCGGG CCCTCACGCT CGCCGACCTC GCCGCCGCCG CGAACCTGAG CACACGCACC
CTGACCAGGC GATTCGCTGT CGAGACCGGA CAGAGCCCGA TGCAATGGGT CGCCGGCGTC
CGGATTCGTC ACGCCCAGGA GCTCCTGGAG ACCACCGACT ACACGATCGA CCGCATCGCA
AACCAGACCG GATTCACCAC CACGAGCAAC TTCCGTGCGC AGTTCCAGGA GGTCGTCGGC
ACCACACCAG GCGCCTATCG CACCACCTTC CGGCTGTGA
 
Protein sequence
MYTVVVLALP DVIAFDLATP VETFGRVRLP DGRPGYRVLV AGPDDVVDAG PVRLAVSEQL 
DALDRADLVV VPGRNNPLRP SPPSVLAALR AAATRGTRVA SICVGAFTLA EAGLLDNMRA
TTHWLAAEHL AHQHPSIQVD PDVLYIDNGS ILTSAGAASG LDLCLHVIHT DYGAAVAADA
ARLAVAPLHR AGGQAQYILR NRPPLRTSVL EPVLAWIETN AHRALTLADL AAAANLSTRT
LTRRFAVETG QSPMQWVAGV RIRHAQELLE TTDYTIDRIA NQTGFTTTSN FRAQFQEVVG
TTPGAYRTTF RL