Gene Snas_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0994 
Symbol 
ID8882179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1050840 
End bp1053827 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content62% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003509797 
Protein GI291298519 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTAC TCGTACTCGG GCCTCTCGAG GTACGCCACA ACGACCGTAC CGTGGCGATC 
CGTGGACGCG TTCATCCACG GTTGTTGGCC ATACTGGCTC TCAACGCGGG GAAGGTGGTG
TCACTGACCA CCCTGATCGA CACGGTGTGG GATGACAATC CACCGGCCAC CGCCAAGCGA
CAGGTGCAGA ACGCTCTAGC GCAACTGCGC AATCAGCTGA GTCAACGGCT CATCGAATCG
GTGGGGCAGG ACTATCGCTT GAACCTCGAC ATCGCCGAGG TCGACGCGCA CCAGTTCAAC
ATGATGGTCA AGCAGGCTCA GCAGGAACGC GTACGAGGCC ATCACGCATC GGCTCTAGCT
CGGCTCCGCG AAGCCCTGGG GCTGTGGCAC GGCCCAGCTT TGGCCGGGCT GACGGGACAC
GCACTCCAGC TCAAGGCCCG CCACCTGGAC AACGCTCGGC TTGCCGCCAC AGAGGATCGC
ATCGAACTGG AACTACAGCT CGGAAAGACT CTCGACATAG CCGAACTCGG CACACTAACC
GCACAACACC CACTCAATCA GCGGCTCGCC TCCCACTTGA TGCTGGCGCT CTACCGCGAT
CACCGCACTG CCGAAGCGCT GGCCGTGTAC ACAGACATTC AGCAACGCCT CGCTGATGAG
CTCGGCATCG ACCCCGGCAA AGCACTACGC GAACAACGCA CGGCGATCCT TCGGGAGGAT
CCCTCCCTCG CGGCCCCGGC GGTGGCGGAG AGGTCCGTCT CAAGGCCGGT TCCGGCGCAG
CTTCCGGCTG ATATCGCGGG ATTCACCGGA CGGCATGATC AACTGGCACA GTTGGACAGT
CTGCCGAAAG CGGGCGCGAC CAGCGCGATT CTGTCCACCA TCGGCGGCAT CGGAGGCGTG
GGCAAGACAG CACTGGCGAT CCACTGGGCA CACCGCAACC GGCGCCGTTT CCCCGACGGT
CAGCTCTATG TCAACCTGCG CGGCTTCGAC CGCGAAGAGC CGCTCGCGCC CCTAAAGGCA
TTGACGCGCT TCCTGCGAGC ATTCGACGTT CCCGCCGACA CCATTCCCTC CGACACGGAG
TCAGCGGCGG CACTCTTCCG GTCCCTGGTG ATCGACAAAC GCCTTCTGGT TGTTCTCGAC
AACGCGCGCG ATGTCGAACA GGTTCGCCCA CTTGTGCCCG GGGGACCGGA AACTCTGACC
CTGGTCACCA GTCGCAATCG GCTGGTCGGA CTCACGGCGC TTCACGGCGC CGTCCCCATA
ACGGTGGGCG CCATGTCCCG AACGGAATCC CTTGACGTCC TGAATAACCT GGTCGGCAAA
GACCGCCTTC ACGCGGAATC CTCGGCTTCC CGGCAGCTAG CGCGACTCTG CGCTGACCTC
CCGCTGGCGC TGCGGATAGC GGGTGCAAAT CTCGGCACCA CGTCTGAATT GAGTGTCGCT
GAATACGTCC AGGAACTCGA AGGCCCACAA CGGCTTGAGC GCCTATCCAT TGAGGGGGAG
CCTCAAACCG CAGTCAGCGC CGCGCTCAGC CTGTCGGTTC AGGCACTGCC TGTCGCTGCC
CAACAGCTCT TCATGCGAGT GGGTCTGATT CCCGGTGAGG ACTTCCACCA GGATCTCGTC
ACCGTCATCG GACAAGAACC CCCGACGGAG GCACTCCGGC TGTTGCGAAC CCTGGTGTCC
GGCAACTTGC TTGAGCCGTA TCGTACGAAC CGCTTCCGCT TTCACGACCT CGTCCGTGAA
TACGCCGCGA CAATCGCCAA GGACTCTTTG GATGCTTCAG AACATGAGGC CACAGCCGAT
CGTATTGTTC AGTGGTACTA CGACACCAGA GCGGAGACCG CTGCGTCGGA GTACGGGAAT
GTGGTCGCCG CTTTCAAAGC CTGGCAAGAC CATCGCCGTT CCCTGTCGCT GATCCCCGTA
CTCCAGATCA ATCTGCACAA CGGACTTCAC TTGTCCGAGG TCTTGTCGCA TCTAGACACA
GCGCATCAAT TGGCCCTGCG AGTGAACGAT CAGCTCAGCC TGCAACGCAC GACAACAGCG
CTGACCGCGT ACAGCTGGGC GACAGGCGAC TACACCGCCG CATTCGCATA TGGACGTCAG
GCCGTCACGC ACGCTCTCGC CTTCGACGAC GACGCTGATG GCATCGCTCG CGGTAACCTC
GGCACTCTCT ACAGCCATGA CGGCAACTTC AGGCAGGCCC AACCCCTGCT CGAAGAGGCC
CTGGAAGCCG CGACACACTC CGACCAACCT GCTTTCGCTG TGCCTGTCGG CATCGCTCTA
GCCCATTTGC TATTGGACCG TGGAGAATAC CTCCGTGCCG GCGAGGTCAT ACGACAACTC
GATGCCATTG AGTGCCCTCC GGCTTCGACG GTGTTTCTCA TGACGGCTCG GGTAGATCTC
GAGGCAGCCC GTGGCGAACT GCAAGTGGCG CTTGAACTTG CCACTCGCAC GCTCAGAACC
GCTCGCGAAC ACTCACATCT ACGTGCGGAG CTGTATGCGC TCCAGAAACG GTCCCGGATC
CGCCGGAGAT TGAACGATCT CAGTGGTGCC CGCGCCGATA CCGCGCTGGC GCTCGAACTC
GCTGCCGAAA ACGGGTATCC GCTCCCTGAG TCGATCATGC GATCGGAGCA CGCGCTGTCC
TTGTGTGATA CCGATTCCCC GCACGAGGCC CGTACACAGC TCGCCAGGCT CGACGAGACC
TCCGTGTACT CCGGCGCCAA GTCCCTTCAG GCGATGGCCG CAGCGACGTT GAGCAATGTG
TACAACAAAC TGCGTGAATA CGCTGACTCG ATCAAGCATG GAACGCGAGC CCTGGAGTTC
TTCAGCGCTA TGCCCTACCC ATTGGCCCAG GCGCGAGTAC TCCGCACGCT CGCCGACTCA
CATGACGCTT TGGGGGATTC CGCCATCGCA CGCCAGCAAC GTGAGGAAGC TCTGGACATC
TTCACCCGAT TGGGCGTGCC CGTAAACGAC ACATCATGCC CAGAATGA
 
Protein sequence
MELLVLGPLE VRHNDRTVAI RGRVHPRLLA ILALNAGKVV SLTTLIDTVW DDNPPATAKR 
QVQNALAQLR NQLSQRLIES VGQDYRLNLD IAEVDAHQFN MMVKQAQQER VRGHHASALA
RLREALGLWH GPALAGLTGH ALQLKARHLD NARLAATEDR IELELQLGKT LDIAELGTLT
AQHPLNQRLA SHLMLALYRD HRTAEALAVY TDIQQRLADE LGIDPGKALR EQRTAILRED
PSLAAPAVAE RSVSRPVPAQ LPADIAGFTG RHDQLAQLDS LPKAGATSAI LSTIGGIGGV
GKTALAIHWA HRNRRRFPDG QLYVNLRGFD REEPLAPLKA LTRFLRAFDV PADTIPSDTE
SAAALFRSLV IDKRLLVVLD NARDVEQVRP LVPGGPETLT LVTSRNRLVG LTALHGAVPI
TVGAMSRTES LDVLNNLVGK DRLHAESSAS RQLARLCADL PLALRIAGAN LGTTSELSVA
EYVQELEGPQ RLERLSIEGE PQTAVSAALS LSVQALPVAA QQLFMRVGLI PGEDFHQDLV
TVIGQEPPTE ALRLLRTLVS GNLLEPYRTN RFRFHDLVRE YAATIAKDSL DASEHEATAD
RIVQWYYDTR AETAASEYGN VVAAFKAWQD HRRSLSLIPV LQINLHNGLH LSEVLSHLDT
AHQLALRVND QLSLQRTTTA LTAYSWATGD YTAAFAYGRQ AVTHALAFDD DADGIARGNL
GTLYSHDGNF RQAQPLLEEA LEAATHSDQP AFAVPVGIAL AHLLLDRGEY LRAGEVIRQL
DAIECPPAST VFLMTARVDL EAARGELQVA LELATRTLRT AREHSHLRAE LYALQKRSRI
RRRLNDLSGA RADTALALEL AAENGYPLPE SIMRSEHALS LCDTDSPHEA RTQLARLDET
SVYSGAKSLQ AMAAATLSNV YNKLREYADS IKHGTRALEF FSAMPYPLAQ ARVLRTLADS
HDALGDSAIA RQQREEALDI FTRLGVPVND TSCPE