Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0994 |
Symbol | |
ID | 8882179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 1050840 |
End bp | 1053827 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003509797 |
Protein GI | 291298519 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCTAC TCGTACTCGG GCCTCTCGAG GTACGCCACA ACGACCGTAC CGTGGCGATC CGTGGACGCG TTCATCCACG GTTGTTGGCC ATACTGGCTC TCAACGCGGG GAAGGTGGTG TCACTGACCA CCCTGATCGA CACGGTGTGG GATGACAATC CACCGGCCAC CGCCAAGCGA CAGGTGCAGA ACGCTCTAGC GCAACTGCGC AATCAGCTGA GTCAACGGCT CATCGAATCG GTGGGGCAGG ACTATCGCTT GAACCTCGAC ATCGCCGAGG TCGACGCGCA CCAGTTCAAC ATGATGGTCA AGCAGGCTCA GCAGGAACGC GTACGAGGCC ATCACGCATC GGCTCTAGCT CGGCTCCGCG AAGCCCTGGG GCTGTGGCAC GGCCCAGCTT TGGCCGGGCT GACGGGACAC GCACTCCAGC TCAAGGCCCG CCACCTGGAC AACGCTCGGC TTGCCGCCAC AGAGGATCGC ATCGAACTGG AACTACAGCT CGGAAAGACT CTCGACATAG CCGAACTCGG CACACTAACC GCACAACACC CACTCAATCA GCGGCTCGCC TCCCACTTGA TGCTGGCGCT CTACCGCGAT CACCGCACTG CCGAAGCGCT GGCCGTGTAC ACAGACATTC AGCAACGCCT CGCTGATGAG CTCGGCATCG ACCCCGGCAA AGCACTACGC GAACAACGCA CGGCGATCCT TCGGGAGGAT CCCTCCCTCG CGGCCCCGGC GGTGGCGGAG AGGTCCGTCT CAAGGCCGGT TCCGGCGCAG CTTCCGGCTG ATATCGCGGG ATTCACCGGA CGGCATGATC AACTGGCACA GTTGGACAGT CTGCCGAAAG CGGGCGCGAC CAGCGCGATT CTGTCCACCA TCGGCGGCAT CGGAGGCGTG GGCAAGACAG CACTGGCGAT CCACTGGGCA CACCGCAACC GGCGCCGTTT CCCCGACGGT CAGCTCTATG TCAACCTGCG CGGCTTCGAC CGCGAAGAGC CGCTCGCGCC CCTAAAGGCA TTGACGCGCT TCCTGCGAGC ATTCGACGTT CCCGCCGACA CCATTCCCTC CGACACGGAG TCAGCGGCGG CACTCTTCCG GTCCCTGGTG ATCGACAAAC GCCTTCTGGT TGTTCTCGAC AACGCGCGCG ATGTCGAACA GGTTCGCCCA CTTGTGCCCG GGGGACCGGA AACTCTGACC CTGGTCACCA GTCGCAATCG GCTGGTCGGA CTCACGGCGC TTCACGGCGC CGTCCCCATA ACGGTGGGCG CCATGTCCCG AACGGAATCC CTTGACGTCC TGAATAACCT GGTCGGCAAA GACCGCCTTC ACGCGGAATC CTCGGCTTCC CGGCAGCTAG CGCGACTCTG CGCTGACCTC CCGCTGGCGC TGCGGATAGC GGGTGCAAAT CTCGGCACCA CGTCTGAATT GAGTGTCGCT GAATACGTCC AGGAACTCGA AGGCCCACAA CGGCTTGAGC GCCTATCCAT TGAGGGGGAG CCTCAAACCG CAGTCAGCGC CGCGCTCAGC CTGTCGGTTC AGGCACTGCC TGTCGCTGCC CAACAGCTCT TCATGCGAGT GGGTCTGATT CCCGGTGAGG ACTTCCACCA GGATCTCGTC ACCGTCATCG GACAAGAACC CCCGACGGAG GCACTCCGGC TGTTGCGAAC CCTGGTGTCC GGCAACTTGC TTGAGCCGTA TCGTACGAAC CGCTTCCGCT TTCACGACCT CGTCCGTGAA TACGCCGCGA CAATCGCCAA GGACTCTTTG GATGCTTCAG AACATGAGGC CACAGCCGAT CGTATTGTTC AGTGGTACTA CGACACCAGA GCGGAGACCG CTGCGTCGGA GTACGGGAAT GTGGTCGCCG CTTTCAAAGC CTGGCAAGAC CATCGCCGTT CCCTGTCGCT GATCCCCGTA CTCCAGATCA ATCTGCACAA CGGACTTCAC TTGTCCGAGG TCTTGTCGCA TCTAGACACA GCGCATCAAT TGGCCCTGCG AGTGAACGAT CAGCTCAGCC TGCAACGCAC GACAACAGCG CTGACCGCGT ACAGCTGGGC GACAGGCGAC TACACCGCCG CATTCGCATA TGGACGTCAG GCCGTCACGC ACGCTCTCGC CTTCGACGAC GACGCTGATG GCATCGCTCG CGGTAACCTC GGCACTCTCT ACAGCCATGA CGGCAACTTC AGGCAGGCCC AACCCCTGCT CGAAGAGGCC CTGGAAGCCG CGACACACTC CGACCAACCT GCTTTCGCTG TGCCTGTCGG CATCGCTCTA GCCCATTTGC TATTGGACCG TGGAGAATAC CTCCGTGCCG GCGAGGTCAT ACGACAACTC GATGCCATTG AGTGCCCTCC GGCTTCGACG GTGTTTCTCA TGACGGCTCG GGTAGATCTC GAGGCAGCCC GTGGCGAACT GCAAGTGGCG CTTGAACTTG CCACTCGCAC GCTCAGAACC GCTCGCGAAC ACTCACATCT ACGTGCGGAG CTGTATGCGC TCCAGAAACG GTCCCGGATC CGCCGGAGAT TGAACGATCT CAGTGGTGCC CGCGCCGATA CCGCGCTGGC GCTCGAACTC GCTGCCGAAA ACGGGTATCC GCTCCCTGAG TCGATCATGC GATCGGAGCA CGCGCTGTCC TTGTGTGATA CCGATTCCCC GCACGAGGCC CGTACACAGC TCGCCAGGCT CGACGAGACC TCCGTGTACT CCGGCGCCAA GTCCCTTCAG GCGATGGCCG CAGCGACGTT GAGCAATGTG TACAACAAAC TGCGTGAATA CGCTGACTCG ATCAAGCATG GAACGCGAGC CCTGGAGTTC TTCAGCGCTA TGCCCTACCC ATTGGCCCAG GCGCGAGTAC TCCGCACGCT CGCCGACTCA CATGACGCTT TGGGGGATTC CGCCATCGCA CGCCAGCAAC GTGAGGAAGC TCTGGACATC TTCACCCGAT TGGGCGTGCC CGTAAACGAC ACATCATGCC CAGAATGA
|
Protein sequence | MELLVLGPLE VRHNDRTVAI RGRVHPRLLA ILALNAGKVV SLTTLIDTVW DDNPPATAKR QVQNALAQLR NQLSQRLIES VGQDYRLNLD IAEVDAHQFN MMVKQAQQER VRGHHASALA RLREALGLWH GPALAGLTGH ALQLKARHLD NARLAATEDR IELELQLGKT LDIAELGTLT AQHPLNQRLA SHLMLALYRD HRTAEALAVY TDIQQRLADE LGIDPGKALR EQRTAILRED PSLAAPAVAE RSVSRPVPAQ LPADIAGFTG RHDQLAQLDS LPKAGATSAI LSTIGGIGGV GKTALAIHWA HRNRRRFPDG QLYVNLRGFD REEPLAPLKA LTRFLRAFDV PADTIPSDTE SAAALFRSLV IDKRLLVVLD NARDVEQVRP LVPGGPETLT LVTSRNRLVG LTALHGAVPI TVGAMSRTES LDVLNNLVGK DRLHAESSAS RQLARLCADL PLALRIAGAN LGTTSELSVA EYVQELEGPQ RLERLSIEGE PQTAVSAALS LSVQALPVAA QQLFMRVGLI PGEDFHQDLV TVIGQEPPTE ALRLLRTLVS GNLLEPYRTN RFRFHDLVRE YAATIAKDSL DASEHEATAD RIVQWYYDTR AETAASEYGN VVAAFKAWQD HRRSLSLIPV LQINLHNGLH LSEVLSHLDT AHQLALRVND QLSLQRTTTA LTAYSWATGD YTAAFAYGRQ AVTHALAFDD DADGIARGNL GTLYSHDGNF RQAQPLLEEA LEAATHSDQP AFAVPVGIAL AHLLLDRGEY LRAGEVIRQL DAIECPPAST VFLMTARVDL EAARGELQVA LELATRTLRT AREHSHLRAE LYALQKRSRI RRRLNDLSGA RADTALALEL AAENGYPLPE SIMRSEHALS LCDTDSPHEA RTQLARLDET SVYSGAKSLQ AMAAATLSNV YNKLREYADS IKHGTRALEF FSAMPYPLAQ ARVLRTLADS HDALGDSAIA RQQREEALDI FTRLGVPVND TSCPE
|
| |