Gene Snas_4166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4166 
Symbol 
ID8885367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4459760 
End bp4462909 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_003512910 
Protein GI291301632 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.497343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGATCG ACCTGTTGGG CCCGTTGCGA ATCCTCACCG GTGACCGGGA ACTGCCGATC 
TCGGGCGCCC GGCTGCGGGG CCTGCTGACG CTGCTGGCGC TGCACGCCGG GAGACCGGTG
ACGGCCGAAC GGATCGCGGA CGCGCTGTGG ACCGGCGAGG CGCCCAGCGC CAACACCGTC
CAGTCGCTGG TGTCGCGGCT GCGTGGCGTG CTCGGCGACC GCGAACTCAT CGAATCCGGG
CCCGGCGGCT ACCGGCTGGC GATCGATCAA TCGCATGTGG ACATCCACGC CTTCGAGGAG
CTCGCGGCCA CCGGCCGGGG CGCCTTGTCC GGCAAGGACT TCGCGACGGC GGCTCCCGCG
CTGCGACAGG CACTGGCGTT GTGGCGTGGC GAACTGGGCG AACTGTCCGC CGTCGACGAC
CGGGCCGCGG TGCGGGCGCT GACCCTGCGC GAGGAGGCCC GCGACGACCT CGCCGACGCC
GAACTCGCGC TGGGAAACGC CGCCACGATC CTGGCCGAAC TGCGCGAACG CGCCGCCGCG
CAGCCGTTCC GGGAACGCGT GCAGGCACAG TTGCTGCGGG CGCTGGCCGC CACCGGCGCG
TACGCCGAAG CCCTGTCCCG CTTCGAGACC GTCCGGGAAC GGCTCGCCGA CGAACTCGGC
GTCGACCCCG GCCCGGCGCT GACCGAGGCG CACCTGGCGG TGCTGCGCGC CGAGAGCCAA
CCGGCCAAAC CCGGCAACAA CCTCCCGGCG CCGCTGACCA GCTTCATCGG CCGCGACGCC
GACGTGGTGG CGCTGGAGGA CGCGCTGTCG ACCGACCGCC TGGTCATGGT GACCGGTCCC
GGCGGTTGCG GCAAGACCCG GCTGGCCGTC CACGTCGCGC ACCGGCTCGC CGACACCATG
GACGTCCGGA TGGCGGAACT GGCGCCGGTC ACCGAGGGCA GCGAGGTGCC GCACACGGTC
GTGACCGCGA TGGGCCTGCA CGAGGCCGCG GCCGGTTTCG GCGCCACCGT CACCGGCTTC
ACCGACCCGA TCGACCGGAT CGCCGCGGCC ATCCGCGACC AGCCCTGGCT GCTGGTCATC
GACAACTGCG AACACCTGCT GGACGCGGTG GCCGGGCTGG CGTCGCGGCT GTTGACCCGC
TGCCCCCGGT TGCGGATCCT CGCCACCAGC CGGGAACCGC TGCGCATGAC CGGCGAGGTC
ATCCGCCCGG TCCGGGCACT GCCCTACCCC TACGAGGACG TCAGTCTCGC CGAGGCCGCG
AAGTACCCGG CGGTGCGGCT GCTGGCCGAA CGCGGCCACG CCGCCAACGC CGCGTTCACA
CTGGACACCG GCAACCTGAC CGACGTCGTG GAGATCTGCC GCCGCCTCGA CGGCCTGCCG
TTGGCCATCG AACTGGCCGC GGCCCGACTG CGGGCCCTGA CCCCGCACCA GTTGGCGGTG
CGGCTGGGGG AGCGGTTCCG GCTGTTGACC GGCGGCGACC GCACCGCGCT GCCCCGGCAG
CGGACCCTGC GCGCGGTCGT CGACTGGAGC TGGGACATCC TCGACAAACC CGAGCGGCTG
CTGCTGGCCC GCATGTCGGT GTTCGTCGGC ACCATGACCC TGGAGTCCGT CGAGGCGGTG
TGCGGCGACG AACTCGACGA GACCGCCTAC ACGCTGGCCT CGCTCGTCGA CAAGTCGCTG
GTCTCGCTCG TCGGCGAGCG CTACCGGATC CTGGAGACCA TCCGCGAGTA CGGCTCCGAG
AAACTCGCCG CGATGGGGGA GTCGACGGCG CTGCGCCGTC GGCACGCCGA ACACTTCACC
GTCCTGGCCG AACAGGCCGA CGCGAACCTG CGCGGCCACG ACCAGGTCCG CTGGCTGGCC
CGGTTGACCA CCGACCACGA CAACATCATC GCCGCGGCCC GGCGCGACAT CGGCGACGGG
AACGTCGACC GGCCCGCCCG GGTGGTGTCG GCGATGCTGT GGTTCTGGTG GCTGCGCGGC
CAGCACGCCG AGGCCATCGA CCTGGCCGAA CAGGTGCTGG CGATGCCGGG CGAACCGGAG
CCGAGGCAGG CCGCCCTGGT CCGGGTCGCG TCCACCTTCG GGCTGTTCGA CATGGACATC
TCGCTGGCCG AGGCCCGCGG CCGGGTCCTG GAGGCCCTGG CGATCCGCGA CGCCAACGGC
GTCACCGACC CGCACCCGTT CCTGCGGATG CTGGAGCTGA TGGCCGAGAC GATGGGCGGC
AGCCCGCTCC GCATGATCCG GCTGGTGCAC CGACTGAACG AAGACCCGGA CCCCTGGATC
CGCGCCAGCG CGATCAGCTT CCGCGCCAAC GTCCTGCTCA ACTCCGGCCG CGTCGCGCGC
GCCGAACGGC TGTTCCGTCG CGCCGTCGAG CTGTACCGGC GGGTGGGGGA CCGCTGGGGC
CTGGCGGTGG CCGCTTCCGC GCACGTCGAG GTCGCGGCGC TGCGCGAGGA CCCGACCGAG
GCGCGCGAGC TCATCGAACT GGCGGTGCGC ACCGAATCCG AGTTCGGCAT CCACCCCGGC
AAGTCCCACA TCAGCACCCG GCTCGAGTAC TTCGTGCTGC GCGACACCGA CCCGTGGGAG
AAACTGCGGG AACTGGAAGC CGAGATCGAA GCCTGCCACC GGATCGGCAA CTTCGAGATG
ACCGGGTACA TGCACCTGCT CGCGGCACAG TACCTGCGTC TGACCGGCGA CCCCGACGCC
GCGCTGGCCC ACCTGAACAC CAGTAGGACC ATCCTGGCCC CGCATCTGGC GGCGATGAAG
GACCGGGGCG GCGGGCCACC GGACCTGCCG TCGCTGCTGC GCATCGTCGA AGCCCGGTTC
AGCCTGGACC ACGACCGGCT CGACGAGACC GAGACCCTGC TGGCCGAAAC ACTGTCGATG
GCCCTGCGTG TCGGCGAGGC GCAACTGATC GGCCGGATCC TGGAGACCGA GGCCGAACTG
CTGCTGCGAC GCGGCGACCA CGACACCGCC GCACGGCGAC TGGGTCAGGC CGAGCTGGCC
CGCGGCACCA GGAACGCCTC GTCCCCGGAC GTCGCCCGCA CCGAATCCCG GCTGCGGGAA
GTACTGGGCG ACAACAGGTT CCGCGAGCTC TACGACCGGA GCCGCACGGG TTCGCGCACC
GGGTTGTTCG CGGAACTGCG GGAATCCTGA
 
Protein sequence
MRIDLLGPLR ILTGDRELPI SGARLRGLLT LLALHAGRPV TAERIADALW TGEAPSANTV 
QSLVSRLRGV LGDRELIESG PGGYRLAIDQ SHVDIHAFEE LAATGRGALS GKDFATAAPA
LRQALALWRG ELGELSAVDD RAAVRALTLR EEARDDLADA ELALGNAATI LAELRERAAA
QPFRERVQAQ LLRALAATGA YAEALSRFET VRERLADELG VDPGPALTEA HLAVLRAESQ
PAKPGNNLPA PLTSFIGRDA DVVALEDALS TDRLVMVTGP GGCGKTRLAV HVAHRLADTM
DVRMAELAPV TEGSEVPHTV VTAMGLHEAA AGFGATVTGF TDPIDRIAAA IRDQPWLLVI
DNCEHLLDAV AGLASRLLTR CPRLRILATS REPLRMTGEV IRPVRALPYP YEDVSLAEAA
KYPAVRLLAE RGHAANAAFT LDTGNLTDVV EICRRLDGLP LAIELAAARL RALTPHQLAV
RLGERFRLLT GGDRTALPRQ RTLRAVVDWS WDILDKPERL LLARMSVFVG TMTLESVEAV
CGDELDETAY TLASLVDKSL VSLVGERYRI LETIREYGSE KLAAMGESTA LRRRHAEHFT
VLAEQADANL RGHDQVRWLA RLTTDHDNII AAARRDIGDG NVDRPARVVS AMLWFWWLRG
QHAEAIDLAE QVLAMPGEPE PRQAALVRVA STFGLFDMDI SLAEARGRVL EALAIRDANG
VTDPHPFLRM LELMAETMGG SPLRMIRLVH RLNEDPDPWI RASAISFRAN VLLNSGRVAR
AERLFRRAVE LYRRVGDRWG LAVAASAHVE VAALREDPTE ARELIELAVR TESEFGIHPG
KSHISTRLEY FVLRDTDPWE KLRELEAEIE ACHRIGNFEM TGYMHLLAAQ YLRLTGDPDA
ALAHLNTSRT ILAPHLAAMK DRGGGPPDLP SLLRIVEARF SLDHDRLDET ETLLAETLSM
ALRVGEAQLI GRILETEAEL LLRRGDHDTA ARRLGQAELA RGTRNASSPD VARTESRLRE
VLGDNRFREL YDRSRTGSRT GLFAELRES