Gene Sare_3410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3410 
Symbol 
ID5704019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3939500 
End bp3940957 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content72% 
IMG OID641272837 
Producttranscriptional regulator 
Protein accessionYP_001538203 
Protein GI159038950 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.723571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0095682 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGAGTC AGATACGCGG AAACCAATTG GCCCGGCTAC TCGGCCAGTG GCATGCGCTT 
CCCGGTCGTC GGCGCAGCCC CGACTATGCG GCGCTGGCCG CGGCCATCCG GGGGCTGCTC
GCCGACGGGC GGCTTCCGCT GGGCGCCCGC CTGCCGGCCG AGCGCGAGCT GGCCGAGGCC
CTGCGGATCA GCCGCACCAC GGTCACCGCG GCGTACCGGG AACTGCGGGA CAGCGGTCAC
CTCGCCAGCC GGCGGGGCGC GGGCAGCTGG ACCATGCTTC CCGGCAACCA CCGGATGGCG
GGTAACGGCC TGTGGACTCC GCTGGACGAT CGCGACATGC TTGATCTTGG GGTCGCCGCC
CTGGCCGCCC CGCCCGAGCT GTTGCCCGCC GCCCAGGCCG CCGCCGAGGA CCTGCCCCGC
TACCTGGGTG GGGCGGGATA TCACCCGACC GGCATCAGCG AACTGCGGGA GGCGGTCGCC
CGCACGTACA GCAACCGGGG ACTACCCACC TCACCCGAAC AGATCATGGT CACCAACGGC
ACCCAGCATG CTCTGGACCT GGTGCTGCGC CTGACGCACA ACCCTGGTGG CAGCGTCCTG
GTGGAGGCCC CCAGCTATCC GAACGCGCTG GCCGCGCTGC ACGCCCGTCG CGCCCGGATC
GCCACCCACG GCCTCGCCCC GGACGAACCC GGGTGGGACG CGGACCTGTT GCTCGGCACC
CTACGCCAGG GGCGACCCAA ACTGGCCTAC CTGATCCCCG ACTTCCAGAA CCCGACCGGC
CACCTGATGT CGGCGGAACT GCGGGAACGG CTGGTCGCCA CCGCCCACGC CGTCGGCGCC
GACCTGGTGA TCGATGAGTC CTTCGTGGAT CTGTCCCTGG ACGGTACGGT GATGCCCCCG
CCGACGGCCA GTTTCGATCG GCACTCCCGC GTGGTCACCG TCGGGGGGAT GAGCAAGGCG
TACTGGGGCG GGCTGCGGAT CGGCTGGATC CGCGCGTCCG CGCCGCAGGT GCAGCGGCTG
GCCGCCGCCC GGGTCGGCGT GGACATGGCG AGTCCGGTGC TGGACCAACT GGTCGCCGTT
CACCTGCTGG CGCAGAGTCC GACGATCGTC GCGGCCCGGC GGGCGCAGCT CACTGCGCAG
CGTGACGTGC TGCTCGGCGC CCTCGCCGAC CGCCTGCCCG AATGGCGGGT GACTGTGCCA
CACGGTGGGG TGACTCTCTG GGCCGAACTC GATGGTCCGA TCTCCAGCGC GCTCGCCCGG
GCCGCCGAGC AGGCCGGCGT ACGTCTTGCC CCCGGCCCCC GTTTCGGCCT CGACGGAACG
TTGGAGCGGT TCCTGCGGTT GCCGTTCACC CTGCCCGTGG CCGACCTGGT GGAGGCGGTC
GACCGGATCG CCGCCATCCG CTACGACCTT GACCGCGTTG GCCGGCCGGG CTGGTCGGAG
CCCGCTGTCA TCGCCTGA
 
Protein sequence
MTSQIRGNQL ARLLGQWHAL PGRRRSPDYA ALAAAIRGLL ADGRLPLGAR LPAERELAEA 
LRISRTTVTA AYRELRDSGH LASRRGAGSW TMLPGNHRMA GNGLWTPLDD RDMLDLGVAA
LAAPPELLPA AQAAAEDLPR YLGGAGYHPT GISELREAVA RTYSNRGLPT SPEQIMVTNG
TQHALDLVLR LTHNPGGSVL VEAPSYPNAL AALHARRARI ATHGLAPDEP GWDADLLLGT
LRQGRPKLAY LIPDFQNPTG HLMSAELRER LVATAHAVGA DLVIDESFVD LSLDGTVMPP
PTASFDRHSR VVTVGGMSKA YWGGLRIGWI RASAPQVQRL AAARVGVDMA SPVLDQLVAV
HLLAQSPTIV AARRAQLTAQ RDVLLGALAD RLPEWRVTVP HGGVTLWAEL DGPISSALAR
AAEQAGVRLA PGPRFGLDGT LERFLRLPFT LPVADLVEAV DRIAAIRYDL DRVGRPGWSE
PAVIA