Gene Snas_3484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3484 
Symbol 
ID8884683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3685572 
End bp3688517 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content69% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003512240 
Protein GI291300962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTCT CCGTTTTGGG TCCACTTCGG GTAGAGGCGG GAGACCGCAT TGTAGACATC 
GACCGGCCAC GCCGACGATC CGTCCTGGCC TACCTGCTGA TGCGCTCCAA CCGCCAGGTC
TCCACCGACC AACTCGTCAC CGCCATCTGG GGTGACCGCC CCGTCCCCAC CGCCACCACC
CAGGTGCACA CCGCGATCTC CGTGCTGCGC AGGACATTCC GGGACGCGGG CCACCCGGAC
CTGATCCAGA GCCGATCCTC CGGCTACACC CTCAACGTCG ACCCCGAGCA CCTCGACCTC
GCCGTGTTCG CCGCCGCGGT GTCCGCCGCC GGAACCGCCA ACGCCGGTGA CACCGACATC
ATCACCGCCC TGCGACGAGC CCTGGACCTG TGGCGGGGCC AAGCCCTGGA GGGAATCAGC
GGAGCCTTCG TCGACACGGC GAGTGCCCGC CTGGAAGAAC AGCGGTTGTC GGTCTTCGAG
ACCCTGATGG ACCTGGAGAT CGCACGGGGA AACCACCGCG ACATCACCCC GGAACTGATG
GGCTACATCG AATCCCATCC GCTCCGAGAA TCCCTGGTCG AACGCCTCAT GCGCGCCCTC
TACCACAGTG GACGCAAGTC GGACGCGCTG CGCGAGTACG TGCGCATCCG CGAGCGCCTC
ACGTCCGAAC TCGGTGTGGA GCCGGGCTCG GCACTGCGCG GCCTGCACCG GGAGATCCTC
CAGGACGCAC CCGTAGCGCG CCCCCGCGAG TCAACCCGAG TGGAAGCACC CCGCGAACCC
GCCGGCCCCC GGCAACTGCC GCCACCGTCC CGAATGTTCA CCGGCCGAGC CGCCGAACGC
GCCCAACTGC GCCACGCGCT CACCCCAACG CGCGGTCAGG AGCGCCCCAT CGTCGCCCTG
CACGGCTCCG GCGGCGTCGG AAAATCCACA CTGGCCATCC AGGTCGCCCA CGACCTCAAC
CCCACCTTCC CCGACGGCCA GCTCTACGTC GACCTGCAAG GCTCCACACC CGGCCTGCCA
CCGCTGACCC CGCTGGAAAT ACTGCGGCGC CTGCTGTCCG CGCTGGGACA GCCCGACGGC
GAGATCCCCA CCGACGCCAC CGAAGCCGCC CGCCGCTACA CCGACCTCAG CGACGGTTCC
CAACACCTCA TCCTGCTCGA CAACGCGACC GACCCGCGCC AGGTCGAACC CGTCATCCGG
GCCAGCCGCA GCGGCGGCCT CCTCATCACC GGCCGGGCCC CACTGGCACT GTCCGACGTC
CAACTGTCCC TGCGCCTGGA CGTGCTGCCC CCGGCCGACG CGATCACCCT GCTCGACCGG
ATCGCCGGAC GCACCGGCGC CGACTGGTCC GACTTCAGTC AGATCGCCGC CTACTGCGAC
TACCTGCCGC TGGCCCTGTG CATCGCCGGT GGCCGACTCG CGCGCGAACC CGACCTGTCC
GGCAAACGCC TGGCCGCCAG TCTGTCCGAC CACCGCGACC GGCTCGACAC CCTCGAAGTG
GACGGAGTCG GGGTCCGCTC CAGCATCCGG GTCGGCTACG ACCTCATCGC CTCCGGCAGC
AGTCCCGCCG ACCGCGTCGC CGCCGACGCC TTCCGCGCCC TGGGTCTGCT GCCACTACCC
ACCATCGACG CCGGGGTCAT CGCCGCCATG CTGTGTCCCG ACGACCCTCC CATGGCCACC
ACCGCCCTGG CCCGCCTGAC CCGCGCCCAA CTGATCGCCT CCGACGGCGA CACCCGATAC
CAGCCACACG ACATCGTCCG CGCCGTCGCG ACCGGCTACG CCGAGGAAAC CATGAGCGCC
CAACAGCGCA CCCAACTGCG CCACCGGGGC ATCGCCTTCT ACGCCGCCTG CGCCATGCTC
GCCGACAACC TGCTGCGCCC GAGTCGCAAA GGCATCGCCT CCCAACCCGA CCCCACCGAA
CTGGCCCCGC AGCAAGTCCT CCGACTCGCC CTGCGCGGCC CCGACGACGT CGGCCCCTGG
CTCGACGAAA CCCTGCCCAA CCTCGTCGCG GCAGCCCAAC TCGCGGCCAC CGATTCCCCC
GAAGCCGCCC GCCACGCCAT CACCATCGCC CACTCCCTGT CCTGGGCCCT GCGCAAACGC
GGCGAATTCC ACCGCGAACA CGTCCTCGCC ACCAGCGCCG TAGCCGCCGC CGAACGCCTT
GACGAACCGA ACACCCTGCG TCGAGCCCTC ATCTACCTGG GCCGCGTCGA GATCTACCTC
GCCGAGTACG ACAGCGCCCT GGCCCACATC AACCGAGCCC TCGACTCGGC CATCGCCGAC
GACGACCTCT ACCTCCAGAT CGCGGCACTC AACGACCTCT GCCTGGTCGC CATCAAACAA
GACGACCTAC CCCAAGCCCG GCGACTCCTC CTGGACTGCC TGGAACGCGG CACCCCCATC
GAAGGCTGGC AACGCGTCGG CGCCACCCCT CGACACAACC TCGCCGCGGT CCAAGCCCTA
CTGGGCGAGT GGAAGGAAGG CGCCCGACTG CTGCGCCACA ACCTCCCCCT GCGCCGCGAC
ACCAACGACC GCGCCGGAGA AGGCACCGAC CTCGTCCTGC TCGGCACCAT CTGCTGTGGA
CTGGACCAAC TCGACGAAGC CGCCCTCCAC CTCACCGAGG GCATCGACAT CTGCGAAGAG
CTCGGCGACC GCCTCGACAA ATGGTTCGGC CTGGCCGCCC TGACCCTGGT ACACCTGCGC
CAAGGCCGTC ACCTGGAAGC CACAACGGTC GGCCAGCAAT GCCTCCTCGT CGCCCGAGGC
ATCAACCAAC CCTTCGCCGA GAAGTGCTCC CACCGCCTAC TCGCACTGGC CCACCAGGCA
TCCCAGGAAC CCAGCGACGC CAGAAGCCAC GCCCGCAACG CCGAAGCGAT CGACGCCGGT
TCGTCCAGCC TCGAAGGCCG CATCATCGAC ACCCTGCTGG ACACTTACGA GTCCACCAGA
CCCTGA
 
Protein sequence
MRFSVLGPLR VEAGDRIVDI DRPRRRSVLA YLLMRSNRQV STDQLVTAIW GDRPVPTATT 
QVHTAISVLR RTFRDAGHPD LIQSRSSGYT LNVDPEHLDL AVFAAAVSAA GTANAGDTDI
ITALRRALDL WRGQALEGIS GAFVDTASAR LEEQRLSVFE TLMDLEIARG NHRDITPELM
GYIESHPLRE SLVERLMRAL YHSGRKSDAL REYVRIRERL TSELGVEPGS ALRGLHREIL
QDAPVARPRE STRVEAPREP AGPRQLPPPS RMFTGRAAER AQLRHALTPT RGQERPIVAL
HGSGGVGKST LAIQVAHDLN PTFPDGQLYV DLQGSTPGLP PLTPLEILRR LLSALGQPDG
EIPTDATEAA RRYTDLSDGS QHLILLDNAT DPRQVEPVIR ASRSGGLLIT GRAPLALSDV
QLSLRLDVLP PADAITLLDR IAGRTGADWS DFSQIAAYCD YLPLALCIAG GRLAREPDLS
GKRLAASLSD HRDRLDTLEV DGVGVRSSIR VGYDLIASGS SPADRVAADA FRALGLLPLP
TIDAGVIAAM LCPDDPPMAT TALARLTRAQ LIASDGDTRY QPHDIVRAVA TGYAEETMSA
QQRTQLRHRG IAFYAACAML ADNLLRPSRK GIASQPDPTE LAPQQVLRLA LRGPDDVGPW
LDETLPNLVA AAQLAATDSP EAARHAITIA HSLSWALRKR GEFHREHVLA TSAVAAAERL
DEPNTLRRAL IYLGRVEIYL AEYDSALAHI NRALDSAIAD DDLYLQIAAL NDLCLVAIKQ
DDLPQARRLL LDCLERGTPI EGWQRVGATP RHNLAAVQAL LGEWKEGARL LRHNLPLRRD
TNDRAGEGTD LVLLGTICCG LDQLDEAALH LTEGIDICEE LGDRLDKWFG LAALTLVHLR
QGRHLEATTV GQQCLLVARG INQPFAEKCS HRLLALAHQA SQEPSDARSH ARNAEAIDAG
SSSLEGRIID TLLDTYESTR P