Gene Sare_5101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5101 
Symbol 
ID5704069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5775454 
End bp5776767 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content70% 
IMG OID641274493 
ProductGntR family transcriptional regulator 
Protein accessionYP_001539834 
Protein GI159040581 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000307944 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACCGGTA CGACACTCGA CGACTACACC GACCGCTACG CCCGACGGGT CCGGGGCATG 
ACCGCCTCGG AGATCCGGGC GCTCTTCGCG GTGGCCAACA GACCAGAGGT GGTCTCGCTC
GCCGGTGGCG CGCCGTACAT CGCGGCTCTC CCCCTGGACG CGGTCGGCGA GATGCTCGGC
CGGCTCGGCA CCGACCACGG TGTCACCACC CTCCAGTACG GCATCGGCCA GGGCACCCTG
GAACTACGTG AGCGGATCTG CGAGGTGATG GCGCTCTCGG GCATCGACGC CGCCTGCGGA
GCCTCCCCCG ACGACGTCGT CGTCACCGTC GGCGGCCAGC AGGCGCTGGA CCTCGTCGCG
CGACTCTTTC TCGACCCGGG CGACGTGGTA CTCGCCGAGG GACCGACCTA TGTCGGGGCA
CTCGGCGTGT TCCAGGCCGC CCAGGCACAG GTCGTACACG TCCCGATGGA CAGCGACGGG
CTGGTCCCGG AGGCGCTGGA GGCGGCGATC GCCGAGCAGG CACGTGCCGG GCGTCGGATC
AAGTTCCTCT ACACCATCCC CACCTACCAG AACCCGACCG GTGTGACGCT GACGGAGCAG
CGACGCGAAC AGGTGCTCGA CATCTGTGAA CGCGCCGGTC TGCTGGTGGT GGAGGACGAC
CCGTACGGCC AGCTCGGCTT CGAGGGCGAT GCCCCGGCCC CGCTGCGTGC CCGCCGCCGG
GACGGCGTCT TCTACCTGGG GACGTTCTCG AAGACCTTCG CGCCGGGGCT CCGGGTCGGA
TGGATCCTCG CCCCACACGC GGTGCGGGAC AAGCTCGTCA TCGCCAGTGA GGCGCAGATC
CTCTGCCCCA GCGGCTACGC TCAGGCGGCC GTGTCCACCT ACCTCGGCAC CATGCCGTGG
CGCGAACAGC TCAAGGTCTA CCAGGAGATC TACCGGGAAC GGCGGGACGC GTTACTCACC
GCCATGGCGG ACCTGATGCC GGACGGCACG ACCTGGACCC GGCCCGGAGG CGGCCTCTTC
GTCTGGGCCA CCCTGCCGGA CGGCCTGGAC TCGAAGGCGA TGATGCCCCG CGCCATCGCC
GCCCGGGTGG CATACGTGCC CGGCACCGGC TTCTACGCCG ACGGCACCGG TAACGGCGCC
ATGCGACTCA ACTTCTCCTT CCCGCCGCCG GATCGGATCC GGGAGGGTGT TCGGCGGTTG
GCCAGCGTCA TGGAGCAGGA CATCGCCATG CGCAGGGTCT TTGGCACCGT TGGCCATCCC
GGCTCGCGGC GGGGGCAGGC CGGTTCGGAC ACACCAGGAC CGGACTTGGC ATGA
 
Protein sequence
MTGTTLDDYT DRYARRVRGM TASEIRALFA VANRPEVVSL AGGAPYIAAL PLDAVGEMLG 
RLGTDHGVTT LQYGIGQGTL ELRERICEVM ALSGIDAACG ASPDDVVVTV GGQQALDLVA
RLFLDPGDVV LAEGPTYVGA LGVFQAAQAQ VVHVPMDSDG LVPEALEAAI AEQARAGRRI
KFLYTIPTYQ NPTGVTLTEQ RREQVLDICE RAGLLVVEDD PYGQLGFEGD APAPLRARRR
DGVFYLGTFS KTFAPGLRVG WILAPHAVRD KLVIASEAQI LCPSGYAQAA VSTYLGTMPW
REQLKVYQEI YRERRDALLT AMADLMPDGT TWTRPGGGLF VWATLPDGLD SKAMMPRAIA
ARVAYVPGTG FYADGTGNGA MRLNFSFPPP DRIREGVRRL ASVMEQDIAM RRVFGTVGHP
GSRRGQAGSD TPGPDLA