Gene Snas_0687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0687 
Symbol 
ID8881871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp725043 
End bp728054 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content64% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003509493 
Protein GI291298215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.622837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTTC GGATTCTCGG ACCTCTAGAC GTCCGGCTCG ACGGCTCGAC CGTCCCCATT 
CTCGGCCAGC ATCAGCCGAA GCTGCTGGCG CTGTTGCTGC TGGAGAACGA CAGGACGGTC
TCACTGGGGC GGATGGTGGA CGCGCTGTGG GATGACGACC CCCCGGCGAC AGCCAAGCGT
CAGGTCCAAA ACGCCATGGC GGCGCTGCGG CGTTCACTGA CCGAAGCCGA GCTGGACCCG
GTAGCGCGAG TCGGCGAAGG CTACCGATTG ACCACGTCGG AACTGGACCA CAGCGAATTC
ACCACATTGG TCCGGCGTGG ACGCGGCGCC GCTGAGGCGG GCCGGTTCGA CACCGCCTTC
ATGACCTTGA CCGACGCTTT GGGTGTGTGG CGAGGGCCCG CGTTGGCGGG TATCCCCGGT
CGGATCTTCG AAGCAGCCGC CAATCGGTTG GAAGAGGAGC GGCTGTCGGT GATGGAGGAC
AGATTCGCCG CGGCGCTGGC CTCGGACAGA AGCGCGGAGT TCGTCGGCGA GCTGCGAGAG
CTGGTCGCCG AACACCCATA CCGGCAGCGA TTCACCCAGC ACCTCATGAC CGCGTTGCAC
CGCGTTGGGC AGACCGAGGT GGCATTGGCG GCCTTTGATC ACCTCGGCTC ACGGCTTCGC
GACGATCTGG GACTGGATCC CGACCCGGAC CTGCGACGAC TGCGTGACAG CATCCGCGAC
GGCGAGGACT CCACGATCCA CTCTTCCGCC GTCGTGTCGA CGCAGGCACC GCCACCAGCT
CAGCTTCCGG CGAGCATTCG CGGGTTCATT GGCCGCGGTG AACAGCTTTC CGAACTCGAC
GCATTGCTGG CGGAGAATCC TCAGCATTCG GTCGCGGTGC TTAGCGGAGT CGGCGGTTCG
GGGAAGACCG CGTTGGCTAT ACATTGGGCC GCCGAAAATC GAGAACAGTT CCCCGATGGA
CAGCTCTATG TGAACCTTCG CGGATTCGAC ACGACGGAAC CGGTCAAACC CGTCGACGCG
CTGCATGCCT TCATCCGGGC ATTGGGGCAC AACGGCGATT CACCAGCCAG TATCGATGAC
GCGGTGACGC TGTATCGCTC CCTCCTTGCC CGGCGGCGCG TCCTTGTCGT CTTGGACAAC
GCGCTGAATG CCGACCAGGT ACGGCCGCTG CTGCCAATCG GCGTCAACGT AACGCTGGTG
ACCAGCCGGG AACGCCTCAC CCCGCTGACC ACGACAGAGT CTGCCCAATC GGTTTCACTC
GACGCGTTGA GCCGGTCGGA GGCGTTCGAC TTGCTGACCG TGATGATCGA TACACGGCGA
CTGCACGAGG ATGAGCTCGC GGTCTATCGA CTTACCGACC TGTGTGGACA TCTTCCACTG
GCTTTGCGCA TGGCTGGCGC GAACCTTGCC AATCGTCCAC ACACCTCCGT CGCGACGTTC
GTCGACGAAC TGGACAGTTC CTCAGACCGG CTCGAACTGC TGGCCGTGGA GGGTGATCCA
AAAGCGGCAC TGACATCGGT GTTCGATCGG TCGATCGCCG CGATCAGCGA CGAAGCTCGA
CTCTTGCTGC TTCGGCTGGG CTTCATCCCG ACTGCCGACT ACGCCGAAGA GCTCGTCATC
GAACTCAGCA CGGAGGAAGC CGACAAGACA CGGGAGTTGT TGACGCGGCT CGCCGACGCG
CATCTGCTTG AACGCCATCG GCCCAACCGA TACCGTTTTC ACGACTTGAT CAAGGCGTAT
GTCAGCACAC GGCTGACCGT CGAACTGGAT GAAACCACCC GTGATGAGCT GGTGCGACGT
TTCACCGACT GGCAAGTATC AACGCCTCGC GGTGAAGAGT TCGACAACAT CGTGGAAGCT
TGCCGAGCCT GGCGTCGACG GCCACGTATC TGGAAACTGG CCGCCGGGCT CCGCCGACTG
CTGCATTGGA CGGTGAACGT CCACACCGTC ACGCAACTCG CGACCCAGCT GTTGGCGGAA
ACGATCGAGG CCGGCGACCC TTACGGCGAA ATCGAGATGC GTCATCTGCT AGGAATGACG
GCTTGGGTGG CCGGTCATGC AGACGAGGCC GAGCGTTACT GCCGAGAGGC CATCGCCTTG
GCCGCCGTGC AAGGCGATGG GGACGCGAAC GGAACGATCC GAACCAACCT GTCTACAGTG
CTTTCCACTC GGGGCGGCTT CGACGAAGCG GCGCAACTGC TGGCCGAGAC CCTCGAACTC
GCCGAACAGA CGGGCGCGAC GGTGGACCGC GAGGTGCGAG CCCTCAATCT GGGAGCTCTG
TATCGCCAAC TCGGACGATA CGACGACGCT TGGCGCATCG TCGTCAGTTC AGAAGTCGAG
ACTCGTACGA CCGTGCCGCA AAGTACCGCC TTGCCCGCGG GCATGCTGTA CTTCGACATG
GGCCGCTACG CGGACCTCAA AGCGTGGCTG AGCTGGGTCC TGGCCGAAGA GTATTCCCAG
CTCGTGCCGC CCCAGGTCCG CACCATCGCG CTGGGGCTGC GAGGTGAGGT GCACCGTATC
GAGGGCGAAT ACGAAGCAGC CCGGACCGAT CTCGATCAGG CCATCGCCAT CGCGGAACGC
TCCGGCAGGA CCGCGATCGC GTACGAGGCG CGCTGCACAC TGGCCCAGCT GCACTGTGAC
ACGGGTGAGG CCGAACGCGG GCTGGCTCTC GTCGAGCACG TTCTCGCCAC ACCCGACGCC
GCCAGAAAAC CTGATCTACT CGCGCTGGCC CGGTTGACAG CTGCCCGAAT AAGTCGGCGA
ATCGGCGAGT TCCCGACAGC GAACCACCAT GTGTCGGCCG CGTTGGAGAT ATCCGAGAGC
ATCTCCCAGC CGCTGCGGAT CGGCCGATGC CTGCTGGAGA AGGCCCTGCT GTGTGCCGAC
CTCGGCGACG GCTCCCAAGC CGAAACCCTC GGTCGCCAGG CACGAGACAT CTTCGACGAT
CTCGGCGTCC CGGAGGCGGA GCACGCGCGC TCGCTGCTCG ACACGCTGCG CGCCGCTCCC
CTGCAGCGGT GA
 
Protein sequence
MEFRILGPLD VRLDGSTVPI LGQHQPKLLA LLLLENDRTV SLGRMVDALW DDDPPATAKR 
QVQNAMAALR RSLTEAELDP VARVGEGYRL TTSELDHSEF TTLVRRGRGA AEAGRFDTAF
MTLTDALGVW RGPALAGIPG RIFEAAANRL EEERLSVMED RFAAALASDR SAEFVGELRE
LVAEHPYRQR FTQHLMTALH RVGQTEVALA AFDHLGSRLR DDLGLDPDPD LRRLRDSIRD
GEDSTIHSSA VVSTQAPPPA QLPASIRGFI GRGEQLSELD ALLAENPQHS VAVLSGVGGS
GKTALAIHWA AENREQFPDG QLYVNLRGFD TTEPVKPVDA LHAFIRALGH NGDSPASIDD
AVTLYRSLLA RRRVLVVLDN ALNADQVRPL LPIGVNVTLV TSRERLTPLT TTESAQSVSL
DALSRSEAFD LLTVMIDTRR LHEDELAVYR LTDLCGHLPL ALRMAGANLA NRPHTSVATF
VDELDSSSDR LELLAVEGDP KAALTSVFDR SIAAISDEAR LLLLRLGFIP TADYAEELVI
ELSTEEADKT RELLTRLADA HLLERHRPNR YRFHDLIKAY VSTRLTVELD ETTRDELVRR
FTDWQVSTPR GEEFDNIVEA CRAWRRRPRI WKLAAGLRRL LHWTVNVHTV TQLATQLLAE
TIEAGDPYGE IEMRHLLGMT AWVAGHADEA ERYCREAIAL AAVQGDGDAN GTIRTNLSTV
LSTRGGFDEA AQLLAETLEL AEQTGATVDR EVRALNLGAL YRQLGRYDDA WRIVVSSEVE
TRTTVPQSTA LPAGMLYFDM GRYADLKAWL SWVLAEEYSQ LVPPQVRTIA LGLRGEVHRI
EGEYEAARTD LDQAIAIAER SGRTAIAYEA RCTLAQLHCD TGEAERGLAL VEHVLATPDA
ARKPDLLALA RLTAARISRR IGEFPTANHH VSAALEISES ISQPLRIGRC LLEKALLCAD
LGDGSQAETL GRQARDIFDD LGVPEAEHAR SLLDTLRAAP LQR