Gene Sare_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1079 
Symbol 
ID5704347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1211122 
End bp1212165 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content70% 
IMG OID641270594 
ProductLacI family transcription regulator 
Protein accessionYP_001535978 
Protein GI159036725 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000248753 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCAGCGAC CGACAATCGC CGACATCGCC CGGCAGGCCG GGGTGTCCAA GGGTGCGGTC 
TCGTTCGCGC TCAACGGCCG ACCCGGCGTC AGCGCGACCA CCCGAGCACG CATCCTCCGC
GTCGCCGAAG AGATCAACTG GCGCCCGCAC AGTGCGGCCC GGGCCCTCGG TGGAGCCCGA
GCCGGCGCCG TCGGCCTGGT GATCGCACGC CCCGCGCGCA CCCTCGGGGT GGAGCCCTTC
TTCGCCCGGC TACTCTCCGG ACTACAGGCA GAACTCTCCA GCCAGTCGAT CGCGTTGCAC
CTGATGATCG TCGAGGACAC CCAGGCGGAG ATCGACACCT ATCAGCGTTG GGTGTCCGAA
CGCCGGGTCG ACGGCCTCGT GCTGATCGAC CTGAAGGTTC GCGACCCACG GATCGCCGCC
GTGGAGCAAC TCGGGATCCC GACGGTGGTG GTCGGTGGGC CGGGCAGGCA TGGCCGGGTA
TCCTCCGTGT GGGCCGACGA CCGGGCGGCG ATGCTGTCCG CGGTGGAATA CCTCGCCGTG
CTCGGACACA CCCGGATCGC GCACGTGTCC GGACTGCCCG AGTTCCAGCA CACCCAGCGC
CGGGCCAGGG CGTTGCGGGA CGCCGCCTCG CGGCTCTCCC TGCCCCGCGC ACGGTCCCTG
CACACCGACT TCAGCGATGC CGAGGGCGCG GCGGCCACCC GCAAGCTGCT CGCTGGGGCC
GACCGCCCGA CGGCGATCAT CTACGACAGC GACCTGATGG CGGTCGCCGG GCTGGGCGTC
GCGCTGGAGA TGGGGGTCGG CGTCCCGGAC GAGTTGTCGC TCATCTCCTT CGACGACTCG
GTGTTGGCCC AACTCACCCA TCCCTCACTC ACCGCACTGT CCCGCGACAC CTACCGGTTC
GGGGTGCAGG CGGCGCAGGC CATGGTGGCG GTCTTGGCCG ACCCCACGGC GACAAAGAAC
CACAAGACCG AGACACCACG GCTGATCACG CGGGAAAGCA CCGCACCGAC ACGGGGCAAC
CGGCTGTCGG ATAAATCGGT TTAG
 
Protein sequence
MQRPTIADIA RQAGVSKGAV SFALNGRPGV SATTRARILR VAEEINWRPH SAARALGGAR 
AGAVGLVIAR PARTLGVEPF FARLLSGLQA ELSSQSIALH LMIVEDTQAE IDTYQRWVSE
RRVDGLVLID LKVRDPRIAA VEQLGIPTVV VGGPGRHGRV SSVWADDRAA MLSAVEYLAV
LGHTRIAHVS GLPEFQHTQR RARALRDAAS RLSLPRARSL HTDFSDAEGA AATRKLLAGA
DRPTAIIYDS DLMAVAGLGV ALEMGVGVPD ELSLISFDDS VLAQLTHPSL TALSRDTYRF
GVQAAQAMVA VLADPTATKN HKTETPRLIT RESTAPTRGN RLSDKSV