Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_3484 |
Symbol | |
ID | 8884683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 3685572 |
End bp | 3688517 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003512240 |
Protein GI | 291300962 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATTCT CCGTTTTGGG TCCACTTCGG GTAGAGGCGG GAGACCGCAT TGTAGACATC GACCGGCCAC GCCGACGATC CGTCCTGGCC TACCTGCTGA TGCGCTCCAA CCGCCAGGTC TCCACCGACC AACTCGTCAC CGCCATCTGG GGTGACCGCC CCGTCCCCAC CGCCACCACC CAGGTGCACA CCGCGATCTC CGTGCTGCGC AGGACATTCC GGGACGCGGG CCACCCGGAC CTGATCCAGA GCCGATCCTC CGGCTACACC CTCAACGTCG ACCCCGAGCA CCTCGACCTC GCCGTGTTCG CCGCCGCGGT GTCCGCCGCC GGAACCGCCA ACGCCGGTGA CACCGACATC ATCACCGCCC TGCGACGAGC CCTGGACCTG TGGCGGGGCC AAGCCCTGGA GGGAATCAGC GGAGCCTTCG TCGACACGGC GAGTGCCCGC CTGGAAGAAC AGCGGTTGTC GGTCTTCGAG ACCCTGATGG ACCTGGAGAT CGCACGGGGA AACCACCGCG ACATCACCCC GGAACTGATG GGCTACATCG AATCCCATCC GCTCCGAGAA TCCCTGGTCG AACGCCTCAT GCGCGCCCTC TACCACAGTG GACGCAAGTC GGACGCGCTG CGCGAGTACG TGCGCATCCG CGAGCGCCTC ACGTCCGAAC TCGGTGTGGA GCCGGGCTCG GCACTGCGCG GCCTGCACCG GGAGATCCTC CAGGACGCAC CCGTAGCGCG CCCCCGCGAG TCAACCCGAG TGGAAGCACC CCGCGAACCC GCCGGCCCCC GGCAACTGCC GCCACCGTCC CGAATGTTCA CCGGCCGAGC CGCCGAACGC GCCCAACTGC GCCACGCGCT CACCCCAACG CGCGGTCAGG AGCGCCCCAT CGTCGCCCTG CACGGCTCCG GCGGCGTCGG AAAATCCACA CTGGCCATCC AGGTCGCCCA CGACCTCAAC CCCACCTTCC CCGACGGCCA GCTCTACGTC GACCTGCAAG GCTCCACACC CGGCCTGCCA CCGCTGACCC CGCTGGAAAT ACTGCGGCGC CTGCTGTCCG CGCTGGGACA GCCCGACGGC GAGATCCCCA CCGACGCCAC CGAAGCCGCC CGCCGCTACA CCGACCTCAG CGACGGTTCC CAACACCTCA TCCTGCTCGA CAACGCGACC GACCCGCGCC AGGTCGAACC CGTCATCCGG GCCAGCCGCA GCGGCGGCCT CCTCATCACC GGCCGGGCCC CACTGGCACT GTCCGACGTC CAACTGTCCC TGCGCCTGGA CGTGCTGCCC CCGGCCGACG CGATCACCCT GCTCGACCGG ATCGCCGGAC GCACCGGCGC CGACTGGTCC GACTTCAGTC AGATCGCCGC CTACTGCGAC TACCTGCCGC TGGCCCTGTG CATCGCCGGT GGCCGACTCG CGCGCGAACC CGACCTGTCC GGCAAACGCC TGGCCGCCAG TCTGTCCGAC CACCGCGACC GGCTCGACAC CCTCGAAGTG GACGGAGTCG GGGTCCGCTC CAGCATCCGG GTCGGCTACG ACCTCATCGC CTCCGGCAGC AGTCCCGCCG ACCGCGTCGC CGCCGACGCC TTCCGCGCCC TGGGTCTGCT GCCACTACCC ACCATCGACG CCGGGGTCAT CGCCGCCATG CTGTGTCCCG ACGACCCTCC CATGGCCACC ACCGCCCTGG CCCGCCTGAC CCGCGCCCAA CTGATCGCCT CCGACGGCGA CACCCGATAC CAGCCACACG ACATCGTCCG CGCCGTCGCG ACCGGCTACG CCGAGGAAAC CATGAGCGCC CAACAGCGCA CCCAACTGCG CCACCGGGGC ATCGCCTTCT ACGCCGCCTG CGCCATGCTC GCCGACAACC TGCTGCGCCC GAGTCGCAAA GGCATCGCCT CCCAACCCGA CCCCACCGAA CTGGCCCCGC AGCAAGTCCT CCGACTCGCC CTGCGCGGCC CCGACGACGT CGGCCCCTGG CTCGACGAAA CCCTGCCCAA CCTCGTCGCG GCAGCCCAAC TCGCGGCCAC CGATTCCCCC GAAGCCGCCC GCCACGCCAT CACCATCGCC CACTCCCTGT CCTGGGCCCT GCGCAAACGC GGCGAATTCC ACCGCGAACA CGTCCTCGCC ACCAGCGCCG TAGCCGCCGC CGAACGCCTT GACGAACCGA ACACCCTGCG TCGAGCCCTC ATCTACCTGG GCCGCGTCGA GATCTACCTC GCCGAGTACG ACAGCGCCCT GGCCCACATC AACCGAGCCC TCGACTCGGC CATCGCCGAC GACGACCTCT ACCTCCAGAT CGCGGCACTC AACGACCTCT GCCTGGTCGC CATCAAACAA GACGACCTAC CCCAAGCCCG GCGACTCCTC CTGGACTGCC TGGAACGCGG CACCCCCATC GAAGGCTGGC AACGCGTCGG CGCCACCCCT CGACACAACC TCGCCGCGGT CCAAGCCCTA CTGGGCGAGT GGAAGGAAGG CGCCCGACTG CTGCGCCACA ACCTCCCCCT GCGCCGCGAC ACCAACGACC GCGCCGGAGA AGGCACCGAC CTCGTCCTGC TCGGCACCAT CTGCTGTGGA CTGGACCAAC TCGACGAAGC CGCCCTCCAC CTCACCGAGG GCATCGACAT CTGCGAAGAG CTCGGCGACC GCCTCGACAA ATGGTTCGGC CTGGCCGCCC TGACCCTGGT ACACCTGCGC CAAGGCCGTC ACCTGGAAGC CACAACGGTC GGCCAGCAAT GCCTCCTCGT CGCCCGAGGC ATCAACCAAC CCTTCGCCGA GAAGTGCTCC CACCGCCTAC TCGCACTGGC CCACCAGGCA TCCCAGGAAC CCAGCGACGC CAGAAGCCAC GCCCGCAACG CCGAAGCGAT CGACGCCGGT TCGTCCAGCC TCGAAGGCCG CATCATCGAC ACCCTGCTGG ACACTTACGA GTCCACCAGA CCCTGA
|
Protein sequence | MRFSVLGPLR VEAGDRIVDI DRPRRRSVLA YLLMRSNRQV STDQLVTAIW GDRPVPTATT QVHTAISVLR RTFRDAGHPD LIQSRSSGYT LNVDPEHLDL AVFAAAVSAA GTANAGDTDI ITALRRALDL WRGQALEGIS GAFVDTASAR LEEQRLSVFE TLMDLEIARG NHRDITPELM GYIESHPLRE SLVERLMRAL YHSGRKSDAL REYVRIRERL TSELGVEPGS ALRGLHREIL QDAPVARPRE STRVEAPREP AGPRQLPPPS RMFTGRAAER AQLRHALTPT RGQERPIVAL HGSGGVGKST LAIQVAHDLN PTFPDGQLYV DLQGSTPGLP PLTPLEILRR LLSALGQPDG EIPTDATEAA RRYTDLSDGS QHLILLDNAT DPRQVEPVIR ASRSGGLLIT GRAPLALSDV QLSLRLDVLP PADAITLLDR IAGRTGADWS DFSQIAAYCD YLPLALCIAG GRLAREPDLS GKRLAASLSD HRDRLDTLEV DGVGVRSSIR VGYDLIASGS SPADRVAADA FRALGLLPLP TIDAGVIAAM LCPDDPPMAT TALARLTRAQ LIASDGDTRY QPHDIVRAVA TGYAEETMSA QQRTQLRHRG IAFYAACAML ADNLLRPSRK GIASQPDPTE LAPQQVLRLA LRGPDDVGPW LDETLPNLVA AAQLAATDSP EAARHAITIA HSLSWALRKR GEFHREHVLA TSAVAAAERL DEPNTLRRAL IYLGRVEIYL AEYDSALAHI NRALDSAIAD DDLYLQIAAL NDLCLVAIKQ DDLPQARRLL LDCLERGTPI EGWQRVGATP RHNLAAVQAL LGEWKEGARL LRHNLPLRRD TNDRAGEGTD LVLLGTICCG LDQLDEAALH LTEGIDICEE LGDRLDKWFG LAALTLVHLR QGRHLEATTV GQQCLLVARG INQPFAEKCS HRLLALAHQA SQEPSDARSH ARNAEAIDAG SSSLEGRIID TLLDTYESTR P
|
| |