Gene Snas_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3332 
Symbol 
ID8884531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3525878 
End bp3528955 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content69% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003512091 
Protein GI291300813 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00786892 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.331119 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACA CCGCCAGCCG CAGCCGGAAC GAGCCCGACG CGACGCCCAC CCCGGCGTCC 
GGCGGCGACA CCGGCTTCCG GCTGCTGGGC CCACTCGAGG TGACCCACCA CGGCGAGGTC
CTCGAGGTTC CGCCCGGACG GCAGCAGGCG CTGCTCGCGG CACTGCTGCT GCGGCCGGGC
GAGATCGTGT CGGCACCGAC GCTCATCGAG CAGGTCTGGG GCGGCGACGC CGATCGCACC
GTCCTCAACG TGTGCGTGAT GCGACTGCGA CGGTCCCTCA AGGACCCCAA CCGCGAACTG
GTTCGCGCCG AGGGCAACGG TTACCGCATC GCGGCCGCAC CCGACACCGT CGATCTGCAC
CGGTTCCGGA AACTGTGCGA GCGGGCCGGA ACGGCCCGCG ACCGAGGCGA CCCCGCGGCC
GAACGCGACG CCCTCCACCA GGCACTCGCG TTGTGGCGCG GCGACGCCCT GTCCGGGATC
GCCTCGTCGG TGCTGCAAGC CGAGGTCGTG CCGATCCTCG AAGAGGAGCG GCTGGCCGCG
ACCGAACGAC GCGTGGACGC CGACCTCGAC GGCGAGGCCC ACGCCGAGGT GATCGGCGAA
CTGCGCCAGC TGGTCGCCGA TCACCCGTTC CGGGAACGGT TCTGGTGCCA GCTGCTGCTG
GCCCTGTACC GGTCGGGCCG CCAGTCCGAG GCGCTGGACG CCTACCGCAC GGCACGGGAA
CGACTCGCCG ACGAGCTCGG CATCGAGCCG AGCCAGGAGT TGCGCGAGCT CCACCACCGC
ATCCTCACTT CCGATCCGAC GCTGCTGACG CCCCGGCCCG CCGAGCCGTC GGTCCCACGT
GTCGTCCCGG CCCAACTGCC CCCCGCCACA GCCGGATTCA GCGGCCGCAG AGACCAACTG
CGGCAGCTGG ATCGCCTACT TGACGACACC GAACCCGCAC CAACCACATT GATCACCGGG
CCGCCGGGAG CCGGTAAGAC GACCCTGGCG GTGCACTGGG CCGGTCGCCA CCGTGACCGG
TGGCCCGACG GTCAGCTCTA CATCGACCTG CGCGGATACG GGCCCGAGCC GTTGGTCCAG
CCGATCGAGG CCCTGGCGTA CTTCCTGCGC GCGATCGGAC TGCCTACCGA CCAGGTGCCA
CCTCAGCAGG CGGAGGCGTC GGCTTTGTTC CGGTCACGGA TCGCCGGCCG CAGGCTGCTG
ATCGTGCTCG ACAATGCCGC CACCGTCGAT CAAGTGCGTC CACTGCTGCC CGGCGGTGGC
GACTGTCTGG CCATAGTGAC CAGCCGAGAC CGGTTGACCG GCCTGGTCGC CAAGGAGGGC
GCCCGTGTCC TGAGTGTCGA CGTGATGGCC GACGCCGAAG CCCAGACACT GCTGTCCGAT
GCCGTCGGTG ACGCCCGGCT CGCCGCCGAA CCCGATGCGG CCGCCGCACT GGCACGCCTG
TGTGGCAACC TGCCGCTGGC GCTGCGGATC ACGGCCGCAG ACCTGATCAA CCATCCACAG
CGGAGTCTCA CCCGGCACGT GGACCGGCTG CGCACCGGCA ACCGCCTGGA CGCGCTTCAG
GTCGCCGACG ACGACGGTAC CGCGGTGCGA GCGGCCTTCC GATCCTCCTA CGCTCGACTG
CCCGAACCAG TGAGGCGACT GTTCCGGTTG CTGGGACTGT TCCCCGGTTC GGAGATCGGC
CTGGAGTCCG CCGCCTCGCT GGCGGGCGTC GAGGCCGCCG CGACCGAACC GCTGCTGGAT
CGTCTGATCT CCGCCCATCT GGTGACGCCG CGGGGAGCCG AACGGTTCGC GTTGCACGAT
CTGCTGCGCT TGTTCGCCCA GGAGCTGGGG CAGGAGGAGG ACGCCGACGC CGAACGCGAG
GCCGCCACGC GGCGGCTCTA CGACCACTAT TCACGCGTCG CCGTCGCCGC GGCGAATCGG
ATGTACTCGT TCGTGGTGAC GCTGCCGTTG TCGGAGGAGG CGGCGAGGAT CCCGGTGAGC
TTCGACGACG ACACCACGGC GTCGGCATGG TTCGACACCG AACACCCCAA CCTCGTCGCA
CTGGTTCGGT ACGCCGCCGA TCGCGGCGAC CACCTCGACG CCTGCCGGCT GGCCGCCACG
ATGCAAAGCT ATCTGCACAG CAAGATGCAC GCCGTCGACT GGCTCAACGT CGCTCGCGCC
TATCTGTTCG CCGCCCGCCA GATTGTGGAT TTCAGGGCCC AGATCCTGGC TCATCTGAGC
TTGGCCAACC TGCACCGCTC CCGCGCTCAG CGTAACGCCG CCATCAAACA CTTCGAGCAG
GCGCTGGCGT TGAGCCGTCA CAGCGGCTGG GTCGACGGAC AGTCCATAAC CCACAACAAT
CTCAGTGGTG TCTACGACTA CGAGGGAAAA CTGCGACTGA CGGTTCATCA CCTGCATCAG
GCGCTGGAAC TGCGCTCGCC AACGCGGGAC ATCACGTCGA AGGCGGTCGT TCTCAGCAAC
CTCGGCAGGG CGTTGCTGCG GTTGGGGCAC GTGGAGGACG CGGTCGACCA CCTGGAACGT
GCGACGAACA TTCACCAACG GCGCGGTGCC AGGGTGAGCG AGTCCCGCAG CCGCGCCAGT
CTCGGCGAGG CACTGTGCGA GCTGGGACAG CACCACCGTG CGCTGGCATT GCTGAACCAC
GCGGTAGCGG TGCAACGGGA GCTGGACGAC AGAATGTTCC TACCCAACAC CCTGTGCCGA
CTGGCGAACG CCCACCTCGA CGTCGGCCAG ACCGATCTCG CCGCCGAACT GGTGCCGACC
GCGATGGCAC TGGTACGCGA AACCGGACAT CGCCGTTCAG AGGCATGGGT CACCTACGTG
ACCGCCCGGG TGCATGAGCA CACCGGCTGC CCTGATGCCG CGCTCGATGG CTACGCCCAG
GCGCTACGGC TGAGCCGGGA ACGCGGTAAC GGATATCTTC AGGTGCAGGC GATGATCGGC
ATGAGCTCCG CCCATCACCA ACTCGGTCAG CGGGAGTCGG CCCGCCGCTG TATCGACAAG
GCCCTCGCGA GAGCTCGCCG TTTCGAGTAC GCGCTTCTGG AACGACAGGC CGAGGCCGTC
CGGGTCGCCC TGAGCTGA
 
Protein sequence
MTDTASRSRN EPDATPTPAS GGDTGFRLLG PLEVTHHGEV LEVPPGRQQA LLAALLLRPG 
EIVSAPTLIE QVWGGDADRT VLNVCVMRLR RSLKDPNREL VRAEGNGYRI AAAPDTVDLH
RFRKLCERAG TARDRGDPAA ERDALHQALA LWRGDALSGI ASSVLQAEVV PILEEERLAA
TERRVDADLD GEAHAEVIGE LRQLVADHPF RERFWCQLLL ALYRSGRQSE ALDAYRTARE
RLADELGIEP SQELRELHHR ILTSDPTLLT PRPAEPSVPR VVPAQLPPAT AGFSGRRDQL
RQLDRLLDDT EPAPTTLITG PPGAGKTTLA VHWAGRHRDR WPDGQLYIDL RGYGPEPLVQ
PIEALAYFLR AIGLPTDQVP PQQAEASALF RSRIAGRRLL IVLDNAATVD QVRPLLPGGG
DCLAIVTSRD RLTGLVAKEG ARVLSVDVMA DAEAQTLLSD AVGDARLAAE PDAAAALARL
CGNLPLALRI TAADLINHPQ RSLTRHVDRL RTGNRLDALQ VADDDGTAVR AAFRSSYARL
PEPVRRLFRL LGLFPGSEIG LESAASLAGV EAAATEPLLD RLISAHLVTP RGAERFALHD
LLRLFAQELG QEEDADAERE AATRRLYDHY SRVAVAAANR MYSFVVTLPL SEEAARIPVS
FDDDTTASAW FDTEHPNLVA LVRYAADRGD HLDACRLAAT MQSYLHSKMH AVDWLNVARA
YLFAARQIVD FRAQILAHLS LANLHRSRAQ RNAAIKHFEQ ALALSRHSGW VDGQSITHNN
LSGVYDYEGK LRLTVHHLHQ ALELRSPTRD ITSKAVVLSN LGRALLRLGH VEDAVDHLER
ATNIHQRRGA RVSESRSRAS LGEALCELGQ HHRALALLNH AVAVQRELDD RMFLPNTLCR
LANAHLDVGQ TDLAAELVPT AMALVRETGH RRSEAWVTYV TARVHEHTGC PDAALDGYAQ
ALRLSRERGN GYLQVQAMIG MSSAHHQLGQ RESARRCIDK ALARARRFEY ALLERQAEAV
RVALS