Gene Sare_4999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4999 
Symbol 
ID5705739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5667833 
End bp5668927 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID641274392 
ProductGntR family transcriptional regulator 
Protein accessionYP_001539733 
Protein GI159040480 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000160838 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCCG AGCAGTTGAT CTCCTTCGCC CGTGGCGCTC CCTCGCTGGA CATCGTCGAT 
ATCGAGGGGC TGAAGGCCGC CGCCGTCCGC GCCTTCGACG CCGACCCCGC CGGTGTGACG
GCGTACGGTT CCTCCGCCGG GTACCTTCCG TTGCGCGAGT GGATCGCGAA CAAACACGGG
GTCCAGGCCG ACCAGATCCT GGTGACCAAC GGATCGCTAC AGGCCGACGC CTTCCTCTTC
GACCACCTGA TCCGACCCGG CGACGCGGTG GTGGTGGAGC GCCCGACCTA CGACCGAACT
CTGCTGAATC TGCGGCGGAT GGGTGGTGAG CTGCACGGGA TCACCATCCA GCCGGACGGA
CTGGACACCA CCGAGCTGCG TAAGTTGCTG GAGTCCGGGG TGCGCCCACG GGTGGCGCAC
GTCATCCCGA ACTACCAGAA CCCGGCCGGC GTGACGCTCA GCCTCGACAA GCGGCGCGAG
CTTCTCGAAC TCGCCGCCGA GTTCGAGTTC ACTGTCTTCG AGGACGACCC GTACGCCGAC
ATCCGGTTCC GCGGCGAGGC GCTGCCGTCG ATGCTCTCGT TGGACAGCCA CAACCTGGTG
GTGCACGCGT CCAGCTTCAC CAAGACGGTC TGCCCGGGGG TGCGGGTCGG CTACCTGGTC
GGGCCCTCGG ACCTGATTGC CGACATCGCG AAGAAAGCGA CAAGTCTCTA CATCTCGCCG
GGCGTGGTGT CCGAGGCGAT CGTCCACCAG TTCTGCGTCT CCGGGGACAT CGACCGCTCG
ATCGCCACGG TCCGTCGGGC CCTCGGCGAG CGGGCCCGGG TGCTGGCCGA GTCGTTGCGG
CGGCACATCC CGCAGGCCCA GTTCGTCGAG CCGGACGGCG GCTACTTCCT CTGGGTGGAG
TTGCCGGAGG ACGTCCGGGT GGACCGGCTG GCCCCGGCCG CGGCGGAGCG AGGAGTCGCG
GTGGTGAAGG GCAGCGACTT CGTCCTCGAC GGTGGGCAGC ATGCGCTGCG GCTGGCGTAC
TCGGCGGTGA CCGCGGACCA GATCGATGAG GGTGTGCGCC GGCTCGCGGC GGCGATGGCG
GCCGTGCGCG GCTGA
 
Protein sequence
MTAEQLISFA RGAPSLDIVD IEGLKAAAVR AFDADPAGVT AYGSSAGYLP LREWIANKHG 
VQADQILVTN GSLQADAFLF DHLIRPGDAV VVERPTYDRT LLNLRRMGGE LHGITIQPDG
LDTTELRKLL ESGVRPRVAH VIPNYQNPAG VTLSLDKRRE LLELAAEFEF TVFEDDPYAD
IRFRGEALPS MLSLDSHNLV VHASSFTKTV CPGVRVGYLV GPSDLIADIA KKATSLYISP
GVVSEAIVHQ FCVSGDIDRS IATVRRALGE RARVLAESLR RHIPQAQFVE PDGGYFLWVE
LPEDVRVDRL APAAAERGVA VVKGSDFVLD GGQHALRLAY SAVTADQIDE GVRRLAAAMA
AVRG