Gene Snas_4139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4139 
Symbol 
ID8885340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4426906 
End bp4429857 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content61% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003512883 
Protein GI291301605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.180514 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGACTTTC GGGTTCTTGG CCCACTGGAA GTGACGAGAG CTGAGTCTCG TGTCGACGTG 
CGAGGCCGCA TCGGGAAGCG GCTACTGGCG GTGCTTCTGA TTGATGCCGG ACACGGTGTC
TCCATGGCCC GCCTCATCGA GGGCATATGG GACGGTGATC CACCGGAAAC GGCGCTTCGC
CAAGTCAGAC ACGCCGCTTC TCGGCTGCGA AAAGTCTTGG GGGCTGAGCG CTTGGAATCG
GCCGGGGACG GTTATCGCCT CAATATGGAT GGCGCCACCT GCGACGCGAT TCTCTTCGAA
CAGAAGGTAC GTGAAGCCCG TGAACTGTTC CTGCGGCGGG ACAACGCGCA GCGGCTAGCG
GTCTTGCGCA CGGCACTCGG ATTGTGGCAG GGCAGCGCGT ACAGCGGCCT GGAAGGCAGG
CTCATCAACA CCGAGGTAGC CAGGCTCGGA GGAATATGGC TGGCCGCGCT GGAGGAACGC
CTCGACGCCG AACTCGCACA CGGCGACCAT TCGCAGGTGA TCGGTGAGCT TCGGCTGCTG
GTCCTGGAAC ACCCGTTCCG GCAACGCTTG ACCGACTTGC TGATGCTGGC GCTGTATCGA
GACGGCAGGA CACCTGAGGC ACTCGAAGCG TACGAGCGTT TGCGGACAGA CCTCGCCGAG
AACCTGGGCC TCGATCCGGA CGTCTCGTTG CGGGAAAGAC ACGCTGCGAT ACTGCGACGC
GATCCGGTAC TGGACCTGTC CGACAACGCC TCGACTCGTC CGGACCGGCC CGTGCGGCCG
TCGTATGTTC AGCCCGGAAA GTCCGTGCCC GCTCAACTGC CGATCAAACC CGCGGGATTT
GTCGGACGCA ACATCGCGCT TACGGCATTG ACCAGGCAGT TTCACGACAG CGACAAGTCT
CGTTCCTGTG TGGTTGTTGG AGCCGCCGGG GTCGGCAAGA CCGCTCTGGC CATTCACTGG
GCACATGAGA ACCGCGACCA GTTCCCCGAC GGGCAGTTCT TCATCAATCT GCGCGGATTC
GATTCCGGTG CCCCCGTCTC GGCACATGAA GCACTCGGCC GTTTCATCCG AGCGCTGCGG
GATCCGAGCG TTTCGATACC ATCCGATGTC GACGAAGCGG CAGCCTTGTA TCGGTCATTG
ACCGACGGAA AACGCATACT GGTCGTACTC GACAATGCCA AGAACGCCGA CCAGATTCGA
CCTCTCATAC CGTCGTCGCC GAACGCCTTC ACGGTTGCCA CCAGCCGAAA TCGGCTCACA
GGACTCACCG CCGTTGATGA CACGGTGCCG CTTTTCCTTT CGCCCTTGAA TCACCTTGAG
TCCGTCGATC TTCTGGCCAA ATCCGCGAGT ACGGCTGGCT TGAGCATCGA CCGTGGCTCC
ACGCGTCGTA TTGCCGAACT TTGTGGACAT CTGCCGTTGG CGTTGCGGAT CTCGGCCGCG
CTACTCGTGG ATGGCTCGGG ACGCACCGCC AAGCAGTTGG CCGATGAACT CGGCAACCCG
GACCGCCTTG ACCTGCTTTC CGTCGACGGC GACTCGACGA TTGCCACTGC CCTGGACCAG
TCTTTCCAGA GTTTGACAGC AGAAGCCCAA CAGCTGCTGT GCCAGCTCGC TCTCATCCCC
GGTGATGACT TTCCGCAAGC GTTGGCATTG GAACTGGGCA CTGGCAAGGA CTTCGACCGC
TTGACCACCG CGAGCCTCAT CGATGAGCAT CGCCCAGGTC GATACCGGTT CCATGACCTC
ACCCGCGAGT ACGCCAAGAA AAAGGCTCAA GCGACCGGCA GCGAGATTCA CGTCGCTAAT
CAGGTAATCG GTTGGTACCA CGACAACTGC CGCGTGCTCA ACAGTCACGA CTACAACAAT
GTGATCGCCG CGGTTTCGGC TTGGCGTGCT CACTCAAGCT TTTGGCGTCT GGTTCACACG
GTGGTGGCGT TTGTGCGGTT GGGTGAGAAC ATATCGGTGG CCGCCCCGCT GGTCGATGAC
CAGCTCGCGG CCGCCCTGGA ACGTGGGGAC ACCGAAGCAG CACAAGCCAT GTTTGATGTG
AAGGCGCAGC TGTGCGCCGA AGCGGGTGAC AACCCAGGGG ATGTGAAGTT CGGACAGAAG
GCGTTGGACA TACACCGCGA CGAAACGGGT TTCTACCATT TCACTCACGG AGATGCCCGC
TTCAGCAATG GTGAGATGAC CCTGGCCGAA GACCACTTTC GAACGGCATT GCGGCTAGCC
AACGAGGCGC AACACCAGTA CACGATTCTC GCGAGCACTG CCGCCTTGGC CAATCTTTAC
CGCGTAACAG GTCGCTATGC CGAGGCCGAG ACGCTGTTCG ACGGCGCGCG CAAGTACTTC
GACCACAATC CCACCGGTGC AATGGAAATA GACGCAAGGA TGGCCTTTGT GACCTTCTTG
TGCGAGACGG GGCGTGTTGA TGAAGCCGAA GGATTGCTGC GACCCATCCT TGATTCTGAA
GTGGAGCTGG ACCCCGGTTC GTGGACGCAG GTGATCATGT TCCGCGCTGA TATACGCCAC
GCGCGTGGTC AATACCGTGA GGCGATACAG GATTTTGAAC ATGCCCTGGA ACTGACAGCT
GACAACAGCC GGATACAGTC GCGCCTCATC AGGACCAGCA TCGCCGACAT CTATTGCGAT
CTCGAGGATT ACGACAACGC CCTGAAGCAG TTGGAACTCA TTGACCTGGA CGCCGCGAGT
ACGAAACTGC GCGCCATCGG AATGCTTCAA CTCGCACGTG CCCACAACGG CAAGAAAGAC
CATGCGCGGG CGAAGGCCGC TGCCACCTAC TCGGCGGAGG TCTTCGCCGA GACCAGACTC
CGCTCGCACG CGCTCGCCCT CACAGTCCTC GCGGACGCAC ACGAAGGACT GGGCGAAGTC
AAAACGGCCC GCACAACGCG AGAACGAGCT CTGCGGATAC TGAACGAGCT GGGGCTACCC
GAGCCCGGCT AG
 
Protein sequence
MDFRVLGPLE VTRAESRVDV RGRIGKRLLA VLLIDAGHGV SMARLIEGIW DGDPPETALR 
QVRHAASRLR KVLGAERLES AGDGYRLNMD GATCDAILFE QKVREARELF LRRDNAQRLA
VLRTALGLWQ GSAYSGLEGR LINTEVARLG GIWLAALEER LDAELAHGDH SQVIGELRLL
VLEHPFRQRL TDLLMLALYR DGRTPEALEA YERLRTDLAE NLGLDPDVSL RERHAAILRR
DPVLDLSDNA STRPDRPVRP SYVQPGKSVP AQLPIKPAGF VGRNIALTAL TRQFHDSDKS
RSCVVVGAAG VGKTALAIHW AHENRDQFPD GQFFINLRGF DSGAPVSAHE ALGRFIRALR
DPSVSIPSDV DEAAALYRSL TDGKRILVVL DNAKNADQIR PLIPSSPNAF TVATSRNRLT
GLTAVDDTVP LFLSPLNHLE SVDLLAKSAS TAGLSIDRGS TRRIAELCGH LPLALRISAA
LLVDGSGRTA KQLADELGNP DRLDLLSVDG DSTIATALDQ SFQSLTAEAQ QLLCQLALIP
GDDFPQALAL ELGTGKDFDR LTTASLIDEH RPGRYRFHDL TREYAKKKAQ ATGSEIHVAN
QVIGWYHDNC RVLNSHDYNN VIAAVSAWRA HSSFWRLVHT VVAFVRLGEN ISVAAPLVDD
QLAAALERGD TEAAQAMFDV KAQLCAEAGD NPGDVKFGQK ALDIHRDETG FYHFTHGDAR
FSNGEMTLAE DHFRTALRLA NEAQHQYTIL ASTAALANLY RVTGRYAEAE TLFDGARKYF
DHNPTGAMEI DARMAFVTFL CETGRVDEAE GLLRPILDSE VELDPGSWTQ VIMFRADIRH
ARGQYREAIQ DFEHALELTA DNSRIQSRLI RTSIADIYCD LEDYDNALKQ LELIDLDAAS
TKLRAIGMLQ LARAHNGKKD HARAKAAATY SAEVFAETRL RSHALALTVL ADAHEGLGEV
KTARTTRERA LRILNELGLP EPG