Gene Snas_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3059 
Symbol 
ID8884258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3225960 
End bp3229004 
Gene Length3045 bp 
Protein Length1014 aa 
Translation table11 
GC content68% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003511823 
Protein GI291300545 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.163733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGGCG TGAAGTTTCG GATCTTGGGA CCACTGGAGG TGTCCCGCGA CGGCGAGCCG 
GTGACCATAT CGGGCAGACA CCATCCGAAA CTGCTCGCCC TGCTGCTCCT CGAGACCGGC
CGCGTCGTCA CCGTCTCCCG ACTCGTCGAC GCCCTCTGGG AAAACGACCC GCCCGCCACG
GCCCGCCGCC AGATCCAGAA CACCATGGCC AGCCTGCGCC GCCAACTGGC GGGCGACGAC
TCCCCCACCC TGGAAGCCAC CGGCGAAGGC TACCGACTCC TGGTACCCGC CAAAACCGTT
GACGCCCAAT GCTTCACCGA CCTGGTCCGC CAAGCCCGCA CCGCCCGCGA CACCAACGAC
CTCCCCACGG CGTCCCGCCT GTTCGCCGAA GCCCTCGCCC TGTGGCGAGG CGAAGCCCTC
GCGGGCCTGT CCGGCCGCGT CGTCGAAGCG GCAGTGGTGC GCCTCAACGA ATCCCGCCTG
TCCGCGATCG AAGACCGCTG CGACATCGAC ATCGCGCTGG GCCGCCACGG CCAGGTCGTC
GGCGAACTGC GCGAACTCCT GGAACACCAC CCCTACCGCC AACGCGTCGC CGGTCTGCTC
ATGACCGCCC TGCACCACAG CGGACGAACC CCCGAAGCCC TCGAGGTCTT CACCGAAGTC
CGGGCCCGCC TGTCCGACGA ACTCGGCCTC GACCCCGACC CCGACCTGAA CCGCCTCCAC
GGCGAGATCC TGCGCGGCGA CCTCGACACC CCCACCACCG AACCCACCCC ACCCTCGGCA
CCCCGCCCGG CCCAACTCCC CGCCGACACC GCCACCTTCA CCGGCCGCGA AGAACAACTC
GCAGCCCTCG ACAACCTCCT GGCCGACGGC CGCACCGCCA CGGTCGTGTC CGCCATCGCG
GGCATGGGCG GCGCGGGCAA AACCGCCCTC GCCGTCCACT GGGCCCACCA CGTCCGCGAC
CGATTCCCCG ACGGCCAGCT CTACATCAAC CTGCGCGGCT ACGACGAAGC CGCCCCGGTC
TCCCCCGCAG ACGCCCTCAC CCGCTTCCTC AACGCCCTGG GCCAACCCGG CGCCGCGATC
CCCACCGACC CCGACGAAGC CGGAGCCATG TACCGCTCCC TGCTCGCCGA CCAACGCATG
CTCATCCTCC TCGACAACGC CCGCGACGCC GCCCAGGTCC GCCCCCTCCT CCCCGGCGGC
GGCGGAAACT TCGCCCTCAT CACCAGCCGC GACCGCCTCA CCAGCCTGGT AGCCCTCGAC
GACGTCGCCC CGCTGCGCAT CGACACGCTG AGCCACGAGG AGTCGGTGGA TCTGCTGTCC
AACCTCGTCG ACCCCGTCCG TCTCCACTCC GAACCCGAAG CCACCCATCA GCTGGCGCGA
CTGTGCGGCC ACCTCCCCCT GGCGTTGCGC ATCGCCGGAG CCAACCTTGC CGACCGGCCC
GAAACCAACG TCACCCAGTT CGTGGCCGAA CTCGAAGGGC CGCAACGACT CCAGAAGCTG
ACCGCCCCCG ACGACCCCGC CGTCGCCATC ACCCGGACCC TCCACCTGTC CGTCAGCGCC
CTCACCCCGG CCGCGCGGCA GTTGTTCACC CTGCTGGGAA TCCTGCCGGG CGAGGACTTC
TCACACGACC TGGCCGCCCA CCTGGCCGGA ACCGTCACCG ACGACGCTCC GCGAGCGATC
AACGAACTGG AAGCTGCCCA CCTCGTCGAG AGCCACCACG ACAACCGGCT GCGCTTCCAC
GACCTGGTTC GCGAATACGC CAACGCCCGA GCATCGCAGT GGGACGACGC CGACCGCGGC
GAGGCCGTCA CCCGCGTCAT CGGCTGGTAC GACCACAACA AAGCCACCCT GCCCACCGAC
GAACGCGACA ACGTCCTGCG AATGCTCTCC GCCTGGAACC ACCGCCCCGA CTCCTGGCGC
CTGGCGGCGG TTCTCGGCAG GTTCGTCCAC TACGGACCCG ACCTGCCCCG GCAACTCGAA
CTACTGAGAC ACGAACTGAG CAAAGCCGAA CACAACCAGG ACCACCCCGG CCGCTGCCAG
ATGAACTCAA CCCTGGCCAT CGTCCACCGG GAAATGGGCC ACCGGACCAC CGCCATCGAG
TATGCGCGGG AAGCCGTCCA GATCATGCGC GATCACGACG TGGACGACCC GGTGGGCAAG
TACGTGGGCA ACCTCGGTCT CTACCTGGGC GACATGGGCC GCGTCGCCGA GGCCGTCCCC
CTCGTCCTCG AGTCGTACAA CGCCGCGGTG GCCACCGGCG ACGACCTCTT CGCGACCATC
CGCGCCTCCA CCCTTGGAAC CCTCTACGCC GAGTTGGGCG ATTACGGAGA AGGCGAGAAG
TGGACAACGA TGGCGCTGAA GCTGACTGAG CAACCATCCC TGCGATTCTT CCGCCAAGGC
ATCAGCTACT GCCTGTGCGA ACAATACGTC AGCTCCCGTC GCTTCAGTGA CGCCGAACCA
CTGATCACCG ACATCCTGAA CCAACCGGAT TCCAGCGGCG CCAAGTACCA TGCCCTCACC
CTGATCCTCC GCGCCGAGAT CAACCGGGCG CGCGGCCGCT ACGACGCCGC GCATGAGGAC
CTGGCCGAGG CGCTGCGCCA CGCGTCCCAG ACCGACCGCT CCGGCTTGCG CGACATGGCT
GAATGCGGGA TGGCCGAACT CGAAATCCAA ACCGGCCATC CCCGCCAGGC CATTGACCGG
CTCATGTCCT CCCACCCGGC CAACACCGAT CAGATGGGAG CATTGCAGCG AGCTCAGGCT
GACCGTCTAC TTTGCCTCGC GCACGCGCGC CTTGGCGACG GTGATACGGC GATCAACTAT
GGCGACGCGG CGCTGGCGGC GTTCCGTTCG ATGCCACGCC CCCTGTTGGA GGCCAGAACC
TTGGTGGCCC TCGCCGAAGC GCACGACACG GGAGGCGACA AACTTTCTGC CCAACGCGAC
CGTGAGTCGG CGCTCGAAAT CTTCACTCGC CTCGGAATTC CGGTCGAAGA AAGTCGCGAG
GTGCCGCTTG GCCATGGCCA GCGTCCACCC CATTGGGATA CGTAG
 
Protein sequence
MGGVKFRILG PLEVSRDGEP VTISGRHHPK LLALLLLETG RVVTVSRLVD ALWENDPPAT 
ARRQIQNTMA SLRRQLAGDD SPTLEATGEG YRLLVPAKTV DAQCFTDLVR QARTARDTND
LPTASRLFAE ALALWRGEAL AGLSGRVVEA AVVRLNESRL SAIEDRCDID IALGRHGQVV
GELRELLEHH PYRQRVAGLL MTALHHSGRT PEALEVFTEV RARLSDELGL DPDPDLNRLH
GEILRGDLDT PTTEPTPPSA PRPAQLPADT ATFTGREEQL AALDNLLADG RTATVVSAIA
GMGGAGKTAL AVHWAHHVRD RFPDGQLYIN LRGYDEAAPV SPADALTRFL NALGQPGAAI
PTDPDEAGAM YRSLLADQRM LILLDNARDA AQVRPLLPGG GGNFALITSR DRLTSLVALD
DVAPLRIDTL SHEESVDLLS NLVDPVRLHS EPEATHQLAR LCGHLPLALR IAGANLADRP
ETNVTQFVAE LEGPQRLQKL TAPDDPAVAI TRTLHLSVSA LTPAARQLFT LLGILPGEDF
SHDLAAHLAG TVTDDAPRAI NELEAAHLVE SHHDNRLRFH DLVREYANAR ASQWDDADRG
EAVTRVIGWY DHNKATLPTD ERDNVLRMLS AWNHRPDSWR LAAVLGRFVH YGPDLPRQLE
LLRHELSKAE HNQDHPGRCQ MNSTLAIVHR EMGHRTTAIE YAREAVQIMR DHDVDDPVGK
YVGNLGLYLG DMGRVAEAVP LVLESYNAAV ATGDDLFATI RASTLGTLYA ELGDYGEGEK
WTTMALKLTE QPSLRFFRQG ISYCLCEQYV SSRRFSDAEP LITDILNQPD SSGAKYHALT
LILRAEINRA RGRYDAAHED LAEALRHASQ TDRSGLRDMA ECGMAELEIQ TGHPRQAIDR
LMSSHPANTD QMGALQRAQA DRLLCLAHAR LGDGDTAINY GDAALAAFRS MPRPLLEART
LVALAEAHDT GGDKLSAQRD RESALEIFTR LGIPVEESRE VPLGHGQRPP HWDT