Gene Snas_1496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1496 
Symbol 
ID8882684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1577359 
End bp1580442 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content56% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003510293 
Protein GI291299015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.905196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTTTC GGATTCTTGG CGCGCTTGAG GTTTGGAACG GTTCCGAGCG AATCAAGATC 
ACAGGTCGAT TGCATCCAAA GATCCTTGCC GGCTTGCTGT TGAGCACTGG ACGAACAGTG
TCGTTGCCGT GGCTGGTAGA CCTGCTTTGG AACGACGATC CCCCAGTGAC AGCACGACGG
CAGGTACAAA ACGCCGTAGC GGCACTGCGA CGACAACTGG AGGTCTTTCA CCCCGGCCTC
GTTCAGCAGG TTGGCCAGGA GTACCGCATC AACATCGCTG AAGAAAATCT CGACCTGCGC
CGATTCGAAT CGGCAACGCG AAAAGCTCGG CAACTTATAG AAACCGGCAG GTGGCAAGAA
GGCTTCGATG GCTTTCGTAC CGCCTTGGCG CTATGGCGCG GCCCAGCACT GAGCGGATTC
AAGGGCCAGG TTGTGGACTC TGCCGCCGCT CGCCTGAATG ACATGTATCT GTCCACCTAC
GAGGAATACG CCCAAGCTGC GTTGGTACTC GGTGAACCCG AACGCGTCAT CCCCAAACTT
CAGAAACTTG TTGGCGAGCA CCCTCTACGA CAGCGGCTTA CTGCTCGGTT GATGCATGCC
CTTCACCAAG CGCATCGCAC CCCGGAAGCA TTGCGCCTTT TCGATGGATT CCGTAAAGCT
TACGCCGATG AGCTGGGACT CGACCCTGGA ACCGAGCTCA TGAATCTACA CGTCCGGATA
GTGCAGGACG ACCCAGTGCT TCTCCCTCAG CCAGCATCCT CCAATTCCTT GCTGGACAGT
GCTTCGCGGC AAGGCACCGC CGTGTCCGCA GCCAAGCCTC AGAGCCCTGT TCCCGCTCAG
TTGCCTACGG ACGTCGCGAC GTTCACCGGT CGTAATGCGC ATCTGGCAGC TCTTGACCGC
TTGCGGGACA GTGGGGTGCG TACGGGGATC GTCACTGCGA TCACAGGAAT CGGCGGAGTA
GGCAAGACCG CGCTGGCCGT TCACTGGGGC CATGCGAGGC GGGAGAACTT TCCGGACGGG
CAGTTGTACA TCAACCTGCG GGGATTCGAT GAACGCAAAC CCTTGACTCC GCATGAGGCG
ATTTCTCGGT TGCTGCGAAC ATTGGGACAA CCCGCCAACA CCATTCCTTC CGATCTGGAG
GAAGCCGCTG GTCTTTACCG GTCGTTGTTG GCTGACAAAC GCATGCTGGT GGTGTTGGAC
AACGCACGCT CGCCCGAACA GGTCGGGCAG TTGTTGCCCG AAGGGTCCGG AAGCCTGGCG
CTTGTTACCA GCAGAAACCG ACTAGCCAGC CTCGCCACCA CTCACGGCGC AGAGCACGTG
AATCTGGATA CTCTGAGCCC CAACGAATCT CTTGACTTGT TCACGAACAT TCTCGGCCCA
CGTGCGCTGG AGGACATCGA GTCGACTCGT CGCGTGTGTG CACTATGTGG GCAACTACCG
TTGGCGCTGA GGGTGGTGGC CGCAAACCTC ATTCAATATC CGAATAAGTC ACTCGCACAG
CTTGCAAACG AGCTTGAAGG TGGTTCGCGG CTATCGCAGC TAAGTATTGA AGGCGACAAC
ACCACAAATC TCACCGCCGT CTTCAATCTT TCGTATAGCG CGCTAAGTGA TGCCTCGCAA
AGCGTCTTCC AATACTTGGG CGTTATTCCT GGAGACGACT TCACATCCTC CCTCGCCGCG
GCAATTACGA AGACCTCAGA AACCACGGTC CAGTCTGCAT TCCAAGAACT ACAGTCAGCT
CATTTGCTTG AACAACCTCA AGCAGGTCGT TTCCGATTTC ACGACCTCGT GCGCGAATAC
ACACAGACTC TTGCATCCGT GAGACTTCCG GAGCAAGCTC GCAACGACGC CGTAGGCCGA
CTTGTCAATT GGTACGGCGA CTCATCCAAA CCTCGCTTTC ACGAGTTTAA CAATGTCGTC
GCGGCGTGCG CGGCTTTTCA ACACCATGGC GAACTGTGGA TTCTAGTGAG AGAGCTTGCG
CGTTTTGCGA ACGAAGGAGC CAATACAGAC ATAGCTCGCC AGATCGCGGA AAGCGCTCTT
CAAACTGAAG AACAAGATCG CAACATAGTG GCCAAGCTAA GCATACTTAA CTCCCTTGCA
GGAATATACG ATGCGGGGGG AGACACTACG GCCGCGCTTG AGACTTCCAG AACCGCAGCA
TCTCTACTAT CCCAAACGGA AGACGAAGAT CTTCATGGCC AGATCCTCGG TAACCTTGGC
AGGCTTCTCT ATGGCTATGG TGAATACTTT GAAGCCGAAC AATACCTGCG ACAAGCGCTT
CAAATTGCCG AGAAGAACAA TGATCAACAG CGAATGATGA TCCGTGCCTC CAACTTGGGA
AGAGCCTGCC GAGGAATGGG AAGGTATGAT GAAGCTGAAA GTCATTTGCT TCATGCACGT
CACATTGCCA CAAAGAACCA AAAACCGAAC CTCTTGGCAT CCATACTCAT GTCACTAGGG
AATCTATATT GCGACGCTGG TAAATACGAT GACGTCCTTG CCGTTTCGCA CCAATCTCTG
GCCATCTCGC GCGAAATTGG AGCAACAAGA ATCGAAGCCA TGGCGATGCT GTTCATTGGC
CAAGCACTTC ACGTAAAATG TAAGCCTCAG GAAGCGTTGA GCTACTTGGG AAGCGCGCTA
CAAATCTTCC GTGACGAACA TCGGGTGGGA ATGCAAATTG ATGCACTGCA GAAAGTTGCA
GAATTGCAGC TCGATATAGG TGACGTACCC AGGGCAAGAA AGCATCTGGC TATGACCAAC
ACACTCATTG ACTCGAATCG GTCAAGTAAC GCTTTGAACG GAACTCAAGC ACGAATCCTT
TGTCGAGTTC ACTGCGCAGC TAGGGAATTC GAGCAAGCAA GGGCGCACGG TGAACTGGCT
TGCCAGATCT TTCGCACGAG CAACCAAGAT CCACTCCGCC TAGCCCGCTC CCTGGACGCC
CTGGGCGAAG CCCACCAAGG CAACAACGAC CCCGCCGCCG CCCGCGACTG CTGGACCGAA
GCCCTCGCCA TCTTCACCGA CCTCGACGTC CCCGAAGCCC CCAAGCTCCG CGCCAAGCTC
GCCGCCCTCC CCACGCATCC CTGA
 
Protein sequence
MDFRILGALE VWNGSERIKI TGRLHPKILA GLLLSTGRTV SLPWLVDLLW NDDPPVTARR 
QVQNAVAALR RQLEVFHPGL VQQVGQEYRI NIAEENLDLR RFESATRKAR QLIETGRWQE
GFDGFRTALA LWRGPALSGF KGQVVDSAAA RLNDMYLSTY EEYAQAALVL GEPERVIPKL
QKLVGEHPLR QRLTARLMHA LHQAHRTPEA LRLFDGFRKA YADELGLDPG TELMNLHVRI
VQDDPVLLPQ PASSNSLLDS ASRQGTAVSA AKPQSPVPAQ LPTDVATFTG RNAHLAALDR
LRDSGVRTGI VTAITGIGGV GKTALAVHWG HARRENFPDG QLYINLRGFD ERKPLTPHEA
ISRLLRTLGQ PANTIPSDLE EAAGLYRSLL ADKRMLVVLD NARSPEQVGQ LLPEGSGSLA
LVTSRNRLAS LATTHGAEHV NLDTLSPNES LDLFTNILGP RALEDIESTR RVCALCGQLP
LALRVVAANL IQYPNKSLAQ LANELEGGSR LSQLSIEGDN TTNLTAVFNL SYSALSDASQ
SVFQYLGVIP GDDFTSSLAA AITKTSETTV QSAFQELQSA HLLEQPQAGR FRFHDLVREY
TQTLASVRLP EQARNDAVGR LVNWYGDSSK PRFHEFNNVV AACAAFQHHG ELWILVRELA
RFANEGANTD IARQIAESAL QTEEQDRNIV AKLSILNSLA GIYDAGGDTT AALETSRTAA
SLLSQTEDED LHGQILGNLG RLLYGYGEYF EAEQYLRQAL QIAEKNNDQQ RMMIRASNLG
RACRGMGRYD EAESHLLHAR HIATKNQKPN LLASILMSLG NLYCDAGKYD DVLAVSHQSL
AISREIGATR IEAMAMLFIG QALHVKCKPQ EALSYLGSAL QIFRDEHRVG MQIDALQKVA
ELQLDIGDVP RARKHLAMTN TLIDSNRSSN ALNGTQARIL CRVHCAAREF EQARAHGELA
CQIFRTSNQD PLRLARSLDA LGEAHQGNND PAAARDCWTE ALAIFTDLDV PEAPKLRAKL
AALPTHP