Gene Snas_5616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5616 
Symbol 
ID8886831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5969141 
End bp5972434 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content71% 
IMG OID 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_003514339 
Protein GI291303061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00878101 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0410106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCATCA ACCTGCTGGG ACCCTTCGAG GTCCTCGACG CCGACGGCCG CACCGTCGAC 
GTCGGCGGCC CCCGGGTGCG CGCACTGGCC GCCCGTCTCG CCCTCGCCGA CGGCCGTACC
GTCTCCGCCG CACTGCTGAT CGACGACCTG TGGGGCGACA ACCCGCCCGC CGGAGCCAGC
GGCACCCTGC ACCGGCTCGT CTCCCGGCTG CGGAGTGCGT TGCCGGACAC CGGAACGGGC
CACCCCCTGC GCTCCGAACC CGGCGGCTAC CGACTGGAGG CCACGGTCGA CGCCCGCCGC
TTCGAGGACC TCGCGGCGTC CGGACGCCAA GCACTCGCCG ACACCGACCC CGCCACAGCG
GCCCGGCTAC TGCGCGATGC CGAACGACTC TGGCGCGGCC CCGCGCTCGG CGGCCTGGGC
GAAGCCCCGT ACCTGACCGG CGCCACCGCC CGACTGTCGG ACCTGCGACT GCGGGTCTCG
GAGCATCGGT TCGAGGCCGA ACTCGCCCTG GGGCGACACG CCGCCGTCGC CACCGAGGTC
GAACAGCTGG CGGCGAACCA CCCGCTGCGG GAACGCTTGC AGGGCTTGCT GATGCGGGTG
CGATACGCCA CCGGACGGCA AGCCGAGGCG CTGGCCGTCT TCGAGCGGGT CCGCGCCGAA
CTGGCGGACC GGCTGGGAGC GGACCCGTCG CCGGAGCTGG CCGCTGTCCA CGTCGCGGTG
CTTCGCCAGG ACCCCGACCT GGCCCCGAAC CCACCCCACC CGACGCCGTC GGCCGCGCTG
ACGAGCTTCG TCGGCCGCGA GTCCGAACTA GACCAACTGC GTTCGCTGCT CGGCCGCGAG
CGACTGGTGA CGATCCTCGG CCCCGGCGGA GCGGGCAAGA CCCGACTGGC CCGCGAGCTG
CTCGTCCAGC TGTCCCCGGA CGGCGGCGAA ACCCGCTTCG TTGAACTGTC CATGGTCGAC
GGTACTGCGG GGCTGATCCC CGCGATCCTG GACAGCCTCG GCACCCGCTC GCCGTTCCCG
GGCGGCCCGG CCGATACCGA CGGCCACAGC CGACGTGACT CGGGCGCCGG TGAGTCCATT
ACCGCCGACT CGCCCACCGA GTTCGAAGCG CTCACCGCCA CACTGCACGG CCGCGCGCTG
CTTCTGGTCC TCGACAACTG CGAACACCTC GCCACCGACG TCGCCATCCT CGTCGAGCGA
CTGCTCGACA CCAACCCCCG GCTACGGATC CTGTGCACCG GTCGGCAACC GCTGGACATC
GCGGGCGAGC AACGCTTCCC CTTGCCGCCG TTGGGTTTGC CAAACTCAGA CCGACCCACC
ACCGAAGCCG CCGCCGTCCG GCTGTTCGCC GACCGCGCCG CGAAGGTGCG CCCCGGCTTC
ACCGTCACCG ACGCCAACGC CGAAACCGTC GCGGAGATCT GCCGCCGCCT CGACGGTCTG
CCGCTGGCCA TCGAACTGGC CGCCGCCCGG ACCCGCATCA TGACGCCGTC CCAGCTCGCC
CACCGTCTCG ACGACCGGTT CCAGCTGCTG ACCGGCGGCG TTCGCACCGC CGATGCCCGG
CACCAGACGC TGCGAGCCGT CGTGGAGTGG AGCTGGGATC TGCTGGACGA GCCGGAGCGG
CGGTTGGCGC GGCGGTTCTC GGTATTCGCG GGTGGGGCCA CGCTGGCCGC CGTCGAGGCT
GTCTGCGGTG GACCGGATCT GCCCGCCGAC CGGATCCTCG ACGTCGTCGC CGCTCTTGCC
GACAAGTCGC TGCTCGAAGC CACCGACACC GACGGTCCCG AGCCGCGCTT TCGCATGCTG
GACACCATCC GCGCCTTCGC CACCGATCAG CTAGCCGAGG CGGGCGAGAG GTCCGGCCCC
GGCCATGCCG CCGCCCCTGA CGGCCCCGCC AGCCCGCACC CCACCGAAGC CACCGCCACC
CGCACCAAGC ACGCCCACTA CTTCCGCCAA CTCGCCGAGC ACGCCGATCC CCAACTACGT
GGCCCCGACC AGCCCACGGC CCTGGCCATG TTCCAAGCCG AGCAGCCAAA CCTGTCGTCC
GCCTTGCGAT GGGCCCTCAG CACCGCCGAC ACTGAACTCG CGCTGCGACT GGCCGCCTCC
CAAGTCTGGT ACCGACTGCT ACGCGGCTCC CGATACGACA CCGCCATCAC CGACGACGTC
CTCAACCTGC CCGGCGACGA CTTCCCCACC GAACGCTCCA CAGTCGCCAC CGCGCTCGCC
ATGGCGGGCG TCGGCTTCGC CGGAGTCGAC GTCCCCGTCG CACAGCGAGC CCTGACCGTC
GCACGCCACC ACATCGACCA AGCCGACCGC TCCCGACACC CGCTGCTGTC CCTGTGGGAA
CCGCTGCTCG CCTTGCACGA CAAAGACCTG CACGGCACCC GCCTCGAACT CGAAAAGCTG
CTGAACCCGA ACGACGCGTG GACCCAAGCC ACCGCGTCGC TGTTCCTCGG CTTCGTCCAC
AACCTCGACG GCGACACCTC GACCGCCCGC CACCATCTGG AGCAAGCCGC CGACCACTTC
GAACGCCTGG GTGACCGCTG GGGCCGGTTC CTCACCGCGC AGGCCCTAGC CCCGATACGT
TCCCTCAACG GCGATCCCAC CTCCGCCGCG ACCACCTACC GTGAAGCACT CGCCCACCTC
ACAGCACTGG GCACCACCGA AGACGTCCCG ATGCTACTGG CCCAAGTGGG CCACGAACTA
CTGCGCGCCA ACGACACCAA CGCCGCCCGC TCCGAACTGG AGTCCGCGCT GCGACTCGCC
GACCGCCACG GCAACCGCGA GGCCCGAATC TGGAGCCACT GCGGCCTAGG CGATCTCGCC
GTCGCAACCG CCAACCTCCC CGAAGCCGAA CACCACTACC GGCTCGCCCG GGAAGCGATA
GCCGCCGACT CTCCCACCCG CCGACTCATA CCCGTCATCG AAAGCCGCAC CGCCCGACTC
CTGCACGCCA ACGGATCCCC AGTAGCGGCA AGGTCCCGAC TCCAGACCGC CATCACCGCA
GCCCTCGCCG CCGACGACCT GCCGAGCCTC GCCTACGCCG CCAACACCCT CGCCGAAATC
GTCCTGGCCG ACGGCGATCC CGCCACCGCC GCCGAAATCC TCGGCCTGGC CGAAACCATC
CGCGGCGCCC CCGACCAAGG CGACCCCAAC GTCGCCCGCA CCACCACTGC GGCCCGTACC
GCTCTCGGCA CCGCTGACTT CACCACCGCT CACCAGCACG GCGCACACCG TACCCGCGCC
GAGTCCCTGA AGCGACTGAC CGACCTCGCC GAACAGACCC CCGCCTCCGG CTGA
 
Protein sequence
MRINLLGPFE VLDADGRTVD VGGPRVRALA ARLALADGRT VSAALLIDDL WGDNPPAGAS 
GTLHRLVSRL RSALPDTGTG HPLRSEPGGY RLEATVDARR FEDLAASGRQ ALADTDPATA
ARLLRDAERL WRGPALGGLG EAPYLTGATA RLSDLRLRVS EHRFEAELAL GRHAAVATEV
EQLAANHPLR ERLQGLLMRV RYATGRQAEA LAVFERVRAE LADRLGADPS PELAAVHVAV
LRQDPDLAPN PPHPTPSAAL TSFVGRESEL DQLRSLLGRE RLVTILGPGG AGKTRLAREL
LVQLSPDGGE TRFVELSMVD GTAGLIPAIL DSLGTRSPFP GGPADTDGHS RRDSGAGESI
TADSPTEFEA LTATLHGRAL LLVLDNCEHL ATDVAILVER LLDTNPRLRI LCTGRQPLDI
AGEQRFPLPP LGLPNSDRPT TEAAAVRLFA DRAAKVRPGF TVTDANAETV AEICRRLDGL
PLAIELAAAR TRIMTPSQLA HRLDDRFQLL TGGVRTADAR HQTLRAVVEW SWDLLDEPER
RLARRFSVFA GGATLAAVEA VCGGPDLPAD RILDVVAALA DKSLLEATDT DGPEPRFRML
DTIRAFATDQ LAEAGERSGP GHAAAPDGPA SPHPTEATAT RTKHAHYFRQ LAEHADPQLR
GPDQPTALAM FQAEQPNLSS ALRWALSTAD TELALRLAAS QVWYRLLRGS RYDTAITDDV
LNLPGDDFPT ERSTVATALA MAGVGFAGVD VPVAQRALTV ARHHIDQADR SRHPLLSLWE
PLLALHDKDL HGTRLELEKL LNPNDAWTQA TASLFLGFVH NLDGDTSTAR HHLEQAADHF
ERLGDRWGRF LTAQALAPIR SLNGDPTSAA TTYREALAHL TALGTTEDVP MLLAQVGHEL
LRANDTNAAR SELESALRLA DRHGNREARI WSHCGLGDLA VATANLPEAE HHYRLAREAI
AADSPTRRLI PVIESRTARL LHANGSPVAA RSRLQTAITA ALAADDLPSL AYAANTLAEI
VLADGDPATA AEILGLAETI RGAPDQGDPN VARTTTAART ALGTADFTTA HQHGAHRTRA
ESLKRLTDLA EQTPASG