Gene Snas_6374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_6374 
Symbol 
ID8887600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6722911 
End bp6726039 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_003515083 
Protein GI291303805 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGGGC CGTTCGAGGT TCGCCGGGAA GACGGCAGTA CCGCCGACGT ACCGGGCGCC 
CGGTTGCGGG GCCTGTTGAT CGCGTTGGCG CTGAACCCGG GACGGGTGGT CCAGAAGGGG
ACGCTCGTCG ACTGGATCTG GGGCGAACAG CCGCCCTCCG ACGCCGCGAA CGCCTTGCAG
CGCTTGGTGT CGCGGTTGCG GAAGGTGCTG CCGCCGGGCT CGGTCGAGGG CCAGACCGAC
GGGTACCGGC TGAACCTGGA TCCCGAAGCC GTTGACGCGG TGCGGTTCGA GCGGCTGGTG
AGCCGGTCCC GCGACGACGA CAACCCGCTG CGGGTGCGGC GGCTGCGAGA GGCGCTCGAC
CTGTGGCGGG GCGCGGCCAT GCAGGACGTC GGGCTGCCCG ACAGTGCCTC CTTCGACGCG
GCCGTCACCC GGCTGGAGCG GCTGCGGTTG AGCGCCGTCG AGGAGCGCTT CGACGCCGAG
GTCGGGCTCG GCCACGGCGC GGACCTGGTC GAGGAACTGA CCGATCTGGT GGCCGCGAAC
CCGGGCCGGG AGCGGCTGGC CGCCGCGTTG ATGCGGGCGC TGGTCATGGC GGGACGCGAC
AACGAGGCGC TGCTGGTCTA CGAGCGCACC AAGGAGGCGC TGGCCGACGC GCTGGGTGTC
GACCCCTCGC CGGAGCTGTC GGAACTGCAC GTCGCGCTGC TGCGGGGCGA GCTGGGACGG
CAGGAGGAGA ACCGTGACAC CAACCTGCGC GAGGAGCTCA CCAACTACAT CGGCAAGGAC
GCCGACGTCA CCGCGGTCGG CAAGCTCATC GGCGAACACC GGCTCACCAC CCTGACCGGA
CCGGGCGGCT CCGGGAAGAC CCGGATGGCG ACCGAGACCG CGCGCACGCT GCTGACCGAC
CTGCCCGACG GGGCGTGGCT GGTGGAACTG GCGGCCATCG GCGCCGACGG CGACGTGGCG
CAGGCGGCGC TCACCGCGCT GGGGCTGCGG GACGCGCTGC TGGGCGAGCC GTCCACTCTG
GATCCGACCG AACGGCTCGT CGCCGCGTTG CGCGACCGGG ACGCGCTGTT GATCCTGGAC
AACTGCGAGC ACGTCATCGA GTCGGCCGCC GCCCTGGCGC ACCGGCTGCT GGGCGAGTGC
CGACGGCTGC GGATCCTCGC CACCAGCCGG GAACCGCTCG GCATCACCGG TGAGGCACTG
TGGCCGGTCG GGCCGCTGGA GCTGCCGGAC GAGGACGCCG ACCTCGACGC GATCGAGGCC
GCGCCGGCGG TCCAACTGCT GCGGGACCGG GCCCGGGCGG TGCGCAAGGA CTTCGCCGTC
GACGCCCCCA CCCTGGCGAC GATGGCCCGG ATCTGCCGGA CGTTGGACGG GATGCCGCTG
GCGATCGAAC TGGCCGCGGC CCGGTTGCGC ATCATGACCA TCGAACAGCT CGCCACCCGG
CTCGACGACC GGTTCCGGGT GTTGACCGGC GGCAGCCGCA CCGCGCTGCC CCGGCACCGG
ACGCTGCGCG CGATGGTCGA CTGGAGCTGG GAACTGCTGT CCGACGCCGA ACGGACGGTG
CTGCGTCGGC TGTCGGTGTT CGCGGGCGGA GCCAGCCTGG AGGCCGCCGA ACGCGTCTGC
GGCGGCGACA CGGTCGAAGT CGACGAGGTG CTGGAACTGC TCGCCGCGCT GACCGAGAAG
TCGCTGCTGG TCACCGGAGG CGAAGGCGCG CCGCGCTACC GGATGCTCGG CACCATCAAG
GAGTACGCCG CGCAGCGGCT CGCCGAGGCC GGGGAGTCGG AACTGGCGCG CCACGCGCAC
CTGGCCTATG TCACCGAGCT CGCCGAGACC GCCGACCCGC AGCTTCGCCT CGGCGACCAG
CTGAAGTGGA TCGCCGTGCT GGAGGCCGAA CGCGACGACA TCAGCGCCGC GATGCGCGGC
GCGATCGCCG CCGGGGAAGC CCAGGCCGCG ATGCGGCTGG CTGCGGGCGC CGGTTTCTAC
TGGTGGCTGT GCGGACGCCG CACCGAGGGA ACGGAACTGG TCACCGCCGC CAGCGAACTG
CCCGGCGACG TCGACGACGA GACCCGGGCC ATCGTGTACG GACTGGCCGT GTCGTTCGCC
AGCGCCGGAC GGGCGTCCGA CGAGAACCAC GTCGAGGAAC TGATCCACAA GGCGTACCAC
TACGCGCAGC GCAGCGATTC CCGCAATCCG ATGCTGGCCA TCGCCGTCCC GCTGGAACGC
ATGATGCAGG GGCCCGACCA GATCCTGCCC GCCTGGGAAA CGTTGCTGGA CAACGAGGAC
CCGTGGGTGC GCGCGCTGGC CCGGTTGCAT CTGGGCAAGT CGCGGATCCT GCTCGGCCAC
GGTGGGCCGG AAGCCGACGA GAGCCTCGCA CAGGCGCTCA CCGAGTTCCG GGCGCTGGGC
GAACGGTTCG GCATCTCCTT CGCCCTGACC GAACTGGCCG ACCGCGTCGC CATGCGCGGC
GAGTTCGCCG CCGCCTGTGA GTACTACGAG CAGGCGGCCG TGGTCGTCAC CGAGGTCGGC
GCCTTCGAGG ACGTCACCCG GATGCGGTCG CGGCAGGCGC TGCTGTACTG GCTGCTGGGC
GACACCGAGG CCAGCGCCGC CGCGATGGCC GAGGCCGAAC GGCTCGCCGA GCGGGTCACC
TGGCCGGAGG CGCTGACGGA GCTGACCTTC GCGAAGGCCG AACTGGCGCG GTGGCGTGGC
GACACCGACG AGGTGCGCCG ACAGCTCGAC GTCGTGACGT CGCTGCTGAC CGGAATGACG
GAACGGCCGA CCGTCCGGGT ACTGACACAC AGCCTGCTGG CCTACCTCGC CGAGGATCTC
GACGAGGCCC GGGAACACTG CGCCGTCGTC TACCGGTCGA TCGTCGAACT GGGGCACCCC
GCCCTGATCG CGCACGGGCT GCTCATGATC GCGGGCCTGG CGCTGCGGCG CGGACAGTAC
GAGCAGGCCG CGCGGCTGCT GGCGGCGAGC GAGGCCGTGC GCGGTCTGCC GGACCGCTCG
CAGCCGGACA CCGACCGCAT CGAGCGGGAA ACGCGAGACC GCCTCGGCGA CAAGGAGTAC
GCCGAGGCGG TTCGAGAGGG AACTGAGACG AGCTGGACTC AGCTGGTCGA GGTCACGCTC
GCTTCTTGA
 
Protein sequence
MLGPFEVRRE DGSTADVPGA RLRGLLIALA LNPGRVVQKG TLVDWIWGEQ PPSDAANALQ 
RLVSRLRKVL PPGSVEGQTD GYRLNLDPEA VDAVRFERLV SRSRDDDNPL RVRRLREALD
LWRGAAMQDV GLPDSASFDA AVTRLERLRL SAVEERFDAE VGLGHGADLV EELTDLVAAN
PGRERLAAAL MRALVMAGRD NEALLVYERT KEALADALGV DPSPELSELH VALLRGELGR
QEENRDTNLR EELTNYIGKD ADVTAVGKLI GEHRLTTLTG PGGSGKTRMA TETARTLLTD
LPDGAWLVEL AAIGADGDVA QAALTALGLR DALLGEPSTL DPTERLVAAL RDRDALLILD
NCEHVIESAA ALAHRLLGEC RRLRILATSR EPLGITGEAL WPVGPLELPD EDADLDAIEA
APAVQLLRDR ARAVRKDFAV DAPTLATMAR ICRTLDGMPL AIELAAARLR IMTIEQLATR
LDDRFRVLTG GSRTALPRHR TLRAMVDWSW ELLSDAERTV LRRLSVFAGG ASLEAAERVC
GGDTVEVDEV LELLAALTEK SLLVTGGEGA PRYRMLGTIK EYAAQRLAEA GESELARHAH
LAYVTELAET ADPQLRLGDQ LKWIAVLEAE RDDISAAMRG AIAAGEAQAA MRLAAGAGFY
WWLCGRRTEG TELVTAASEL PGDVDDETRA IVYGLAVSFA SAGRASDENH VEELIHKAYH
YAQRSDSRNP MLAIAVPLER MMQGPDQILP AWETLLDNED PWVRALARLH LGKSRILLGH
GGPEADESLA QALTEFRALG ERFGISFALT ELADRVAMRG EFAAACEYYE QAAVVVTEVG
AFEDVTRMRS RQALLYWLLG DTEASAAAMA EAERLAERVT WPEALTELTF AKAELARWRG
DTDEVRRQLD VVTSLLTGMT ERPTVRVLTH SLLAYLAEDL DEAREHCAVV YRSIVELGHP
ALIAHGLLMI AGLALRRGQY EQAARLLAAS EAVRGLPDRS QPDTDRIERE TRDRLGDKEY
AEAVREGTET SWTQLVEVTL AS