Gene Sare_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1947 
Symbol 
ID5705766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2241682 
End bp2243205 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content69% 
IMG OID641271452 
ProductXRE family transcriptional regulator 
Protein accessionYP_001536823 
Protein GI159037570 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.505699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0597402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGA CTCCACGAGA ACTCGCACCC CACATCTCCG CCAAGCACTA CTTCGGCGCC 
CAACTGCGAT CATGGCGAGA ACAGCGCGGC TGGTCACAAG CGCGGCTCGG CGAGCACCTG
CACGTCAGTT CGGATCTCAT CGCGAAGATC GAGAAGGCCC TGCGCTGGCC GACCCCGGAA
TTCGCAACCG CCTGCGACAC CGCGCTCGCC GCAGGAGGCG CCCTCACCAA CCTGCTTCCG
CTGGTCGAAC TGGAGCGACA CCAGGAGCGT GCCGCCGTGG CCACCACCGC CCGCGCGGTG
CGCGCCGCCG TCTCCCAGCA GCGGCGCGGC ACGCCAACGG CATCCGCGTT CGCCGCCTCA
CCGGGTGAGA TCGCCACTGG TGCGGCAGCG GATTGGGCAT CAGCGCTGCC CGGTGTGCAA
CTCACGGCGC CCCCACCGGC ATGGCCGATC GACCGACTGC TGACCCTGCC ATACGGCCGA
GCCGTACCCG CTACGACCCT GACCCTCACC AGCATTCCCG TCCAAGACCG CTGCCACGCC
GCCCGCTATC CACGGACGCT CAACGGTGGC CACGCCGCCG CCTTCGGCCT GCGCGATCTC
GTGGCGACGG AACTGCTGGA CGGGGACGAG CCCATCCTCG CCGTGGCCAC AACGCCCACC
CTTGGATTGG CGGTACCGGC ATACCGCCTC GACGCCTTCA CCATCGGCAT CCTGTGGGCC
CTGTGCGGCA TGGACGACGC TCTCCTGGCC GATGATGCCG CGCTCGCCGA CAGCGTCCCC
CAACTCCGCC GCTACGCCCA CCTACCCGGC TCCGCAGTCA GCCGCAACGT CGCCGCCGAC
CTCAGCACCT TGAGCCAGAT GTGGCTTGGC TCAGACTTCT GCGCCCGCTA CATCAGCCAC
CGCCTCGACG CCGCGACCGA CCAGCCCGTC TTCTGGACCC GCGAGCAGTA CGGCGAAGAA
GCCACTACCT GGCTGCTCTT CCGCCACAAG ATCGATTATC TGCACGCCAC AAGCGAACGC
TTCACCACAC CCACCGCCCC GGCCGCCCGC GTCTTCTGCA TCCCAGAAAC CGCCGTGCGG
GGCAGCCCCC ACGCGGAACG CATCCTGCTG CTGCTCTCGG CAGCGTTCAT GGAATCCCTG
CGCATCGCCG TGCACGTGAG TCCAGACCCG GCGTACGCCA CCGTCGAAGG CTTCGTGCTG
ACCCCACACA CGCAGGTCAT TCTCGCCAAC TGGGTGCGCG CCGATGGACT CTGGCACGTC
GACGCCCTCG ATCGCCGTGT CGCGCTACGC CGCTACGACG ACGTTGCCCG CAGCGGCCAA
GCCGGCTCCA TCACCGCATC GGGCCGGTCC ATTCGCCGCC TACGAGTCCT CGCCGAGTAC
CTCGGCCTGG AATGGCCCTG GCTGCGTCGA CGCTGCACCG AATTGGCCGC CGTCGGCATC
GACGGGATGA TCCGACCGCG CAGCCGGTTA CTGACCACTG ACGGGATCAA CGCAGCCTGC
CGCTACCTGG CCAGCCTCCC TTGA
 
Protein sequence
MAQTPRELAP HISAKHYFGA QLRSWREQRG WSQARLGEHL HVSSDLIAKI EKALRWPTPE 
FATACDTALA AGGALTNLLP LVELERHQER AAVATTARAV RAAVSQQRRG TPTASAFAAS
PGEIATGAAA DWASALPGVQ LTAPPPAWPI DRLLTLPYGR AVPATTLTLT SIPVQDRCHA
ARYPRTLNGG HAAAFGLRDL VATELLDGDE PILAVATTPT LGLAVPAYRL DAFTIGILWA
LCGMDDALLA DDAALADSVP QLRRYAHLPG SAVSRNVAAD LSTLSQMWLG SDFCARYISH
RLDAATDQPV FWTREQYGEE ATTWLLFRHK IDYLHATSER FTTPTAPAAR VFCIPETAVR
GSPHAERILL LLSAAFMESL RIAVHVSPDP AYATVEGFVL TPHTQVILAN WVRADGLWHV
DALDRRVALR RYDDVARSGQ AGSITASGRS IRRLRVLAEY LGLEWPWLRR RCTELAAVGI
DGMIRPRSRL LTTDGINAAC RYLASLP