Gene Sare_4979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4979 
Symbol 
ID5706127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5650028 
End bp5651788 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content58% 
IMG OID641274374 
Productputative transcriptional regulator 
Protein accessionYP_001539716 
Protein GI159040463 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0011144 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCCCACG CCGTACAACA GGATCTTGAG GCCATCCTCG CCGGCACTGC GGCGCGGAAG 
CGTGAGAGCA GCATCCTCGA CTTCAAGGTG GCCAAATCCG ACCTTAAGGA CGCATGGGCA
GACCTGGCCG AAGCGGCTGT GTGCTTCGCG AACGCATCTG GCGGAACCAT CGTCGTCGGA
GTGTCCGACA CCCCGGGCGG CCACGGGGCG TTCATTGGGT GCGATCTAGA CGAGAATATG
TTGCGTCAGC GCATCTACCA TCTTACGATG CCGGGCCTTC TTGTCGAGGT AGACGTCGTT
CGCTTCGCTA ACAAGCGCCT CCTAGCAATC AGGGTGCCGG AAGGGTTGGA GGTGTACTCC
ACTACTCGCG GATACACCTA TCACCGAGTC AATGACGAAT GCCTCCCTAT GCGTCCCGCC
GAGGTAAGCC GCCTAGCAGA GGAGCGCAGA GGAGTTGACT GGTCTGCCGC GTCTAGTTCA
CGCTCTTTAG AGGACGTAGA TCCTCTTGCC CTTAGGCAGT GCAGGCGACT TCTTTCCAAT
TCCGTCGACT CTCGACGCCA GTCGTACGCA CGCCTTTCGG ACCATGACCT ACTCCGGTCA
CTGAAAGCAG TAGGCGACGA CGAAAGACTG ACTCGCGCGG GTGAACTGTT ACTATGCACG
GCTGCTTCGT CCGGCCCAGA AGATGCAGTA GTTTATCAAC ATAGAAAGAC TCAGGCAGGG
GAGCCCGACG CGATCATGCG TCTGGGCACC CCACTCGTTC TCGCATTTGA TGAGCTCCTA
CAGGCCATCC GCGCACGCCA GGGAATAACC CCAGTAACGC TGGCCGATGG ACAACAGCTT
CAAATTGAAG ACTATCCGAT GGCCGCAGTT CGGGAAGCCG TCGCTAACGC ACTAATCCAC
GGCGACTGGC GCGCCCGACT GCCTGTGTCA GTCGAACACT CGTCGCAGTA CTTGAAAGTA
ACATCCCCCG GTCCACTGGT AAGCGGCATT ACTGTCGACA ACATTCTGAC CAAGGGATCC
AGAGCCCGCC ACCCCGCCTT GGCCTCCGCC TTTCGCCTGC TTGGCCTGGC AGAAGAAGTT
GGACAGGGCG TTGATCGCAT GTACCGGGAG ATGATCCGGT CCGGGCGAGA CACGCCCCTC
ATCTCCGATA ACAACGACCA GGTAACGGTT CTTTTTCGCG GGCAGTCGCC CAATACCCGC
ATTACCAAAT TTCTAGCGAC GCTTCCTCCA GAAGAGCAGG ACGACACGGA CGCCCTCCTG
ATTGTCCTCG TCCTTTGCTC GAAGCGAACA ATCACAGCGA AGCAGCTAGC ACCGATCATT
CAGCGCTCCG AACTGGAAGG GCAGACTGTC TTGCGACGCC TGTCAAATGA TCCTTCCATG
CTATTGGAAC CGACTAGGGG AACCGCGAAT CGGACTCAAC CAACCTACCG ACTAACGGCT
GACGCCCTTA CCCGCCTAGG GAACGCCGTC GCTTACCACG GTCGCACAAG CGATGAGGTA
GACCGGAAAG TCATCGAACA TATGCGAGAC TACGGCGAGA TCAACAACCG AACCGTTCAA
CGTCTGTTCG ATGTCGACGT ATACGCTGCT CGCGACATTC TAAAAGACCT GGTCGAGAGG
CAAATCATTA CCCGAACCTC GGAGCAGACA CGCGGTGTCG CTGTCCGATA CGGACCGGGT
TCGCTCTTTC CAGCGGCAGG AAAGAAGGGA AAACCCCCCA AGAACAAGAG GGTTACCGAC
TTGGAAGATA AGCTGTTCTA G
 
Protein sequence
MSHAVQQDLE AILAGTAARK RESSILDFKV AKSDLKDAWA DLAEAAVCFA NASGGTIVVG 
VSDTPGGHGA FIGCDLDENM LRQRIYHLTM PGLLVEVDVV RFANKRLLAI RVPEGLEVYS
TTRGYTYHRV NDECLPMRPA EVSRLAEERR GVDWSAASSS RSLEDVDPLA LRQCRRLLSN
SVDSRRQSYA RLSDHDLLRS LKAVGDDERL TRAGELLLCT AASSGPEDAV VYQHRKTQAG
EPDAIMRLGT PLVLAFDELL QAIRARQGIT PVTLADGQQL QIEDYPMAAV REAVANALIH
GDWRARLPVS VEHSSQYLKV TSPGPLVSGI TVDNILTKGS RARHPALASA FRLLGLAEEV
GQGVDRMYRE MIRSGRDTPL ISDNNDQVTV LFRGQSPNTR ITKFLATLPP EEQDDTDALL
IVLVLCSKRT ITAKQLAPII QRSELEGQTV LRRLSNDPSM LLEPTRGTAN RTQPTYRLTA
DALTRLGNAV AYHGRTSDEV DRKVIEHMRD YGEINNRTVQ RLFDVDVYAA RDILKDLVER
QIITRTSEQT RGVAVRYGPG SLFPAAGKKG KPPKNKRVTD LEDKLF