Gene Snas_0796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0796 
Symbol 
ID8881980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp839868 
End bp842969 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content71% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003509601 
Protein GI291298323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.64254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGTTC GGTTGCTCGG TCCCCTGGAG GTACTGCGGG ACGGCACACC GGTGCCGATC 
CGGGGCCGGA TCCACCCCCG GCTGCTGGCC GTCCTGGCGC TGCACGCCGG CAACGTCGTC
GCGCGACCGG CACTGATCAC CGGCGTGTGG GACTGTGAAC CACCGGCTTC GGCCAAGCGA
CAGATCCAGA ACGCCGTCTC GGCGTTGCGC CAGGTACTGG ACGGCACCCT CATCGAAACG
GTCGGCGACG GCTACCGGCT GCGCCTTGAC GACGTCACCG TCGACGCCCG CGACTTCGAG
ACGACCGTCG CCGAGGCGGC TCGGCAGCGT GCGGGCGGCG ACTCCGACGC GGCACTGACC
TCGTTGCGGG ACGCGTTGGC GCTGTGGCGC GGCGAGGCCC TCGCCGGGTT GCCGGGACGC
GAACTGCGCT CCCGGGCCCA GCGGCTGGAG GAAGCCTGGC TGGCCGCCCG GGAGGAACTG
ATCGATGTGG AGCTGGAACT CGGCGAACCG GTGCCGGTCG GCGAACTGGG CGAGCTGGCC
CGGCTGCATC CGTACCGGCA GCGCCTGACG GGCCTGTACA TGCGGGTGCT GCACCGGCAG
GGGCGCACGC CCGACGCGCT GCGGGTCTTC GACGACATCC GGCGACGGCT CCTGGACGAA
CTGGGTATCA CCGTCGGACC GGCACTGCGC GAGCTGCACA CCGCGATCCT GCGGGAGGAC
CCGCGGCTGG ACGCGGCGGC ACCGGTGGCC GTGCCCGCGT CCCCGGCCGC CGCCCCGAGG
CTGGTGCCCG CCCAGCTGCC CGCCGCCATC GGCGAGTTCG TCGGCCGACA GGAACAACTG
GCGCGGCTGG ACGCGCTGAT CGCCAAGGGA GACAACACCG CCCTGCTGTC CACGGTGTCG
GGAACCGGCG GCGCCGGGAA GACGGCGCTG GCGATCCACT GGGCGCACCA CAACCGGGAC
CGGTTCCCCG ACGGACAGCT GTACGTCAAC CTGCGCGCGT TCGACCGCGC CGAACCGCTG
ACCCCGTACG ACGCCCTGAC CCGGTTCCTC GCGGCACTGG GGGTGACCGG CGGCGCGGTC
CCGTCCGATG TGGAAGCCGC CGCCTCGCTG TACCGATCGC TGCTGGACGG GCGCCGGATG
CTGGTCCTGC TCGACAACGC CGTCGACCTG GAACAGGTGC GCCCGCTGCT TCCCGGCAGC
GGCGGCAACG TCACGCTGGT GACCAGCCGC AACCGGATCA CGGGTCTGAC GGCGCTGCAC
GGCGCCGAGC TGATCGGTGT GGACACCATG TCGCGGACGG AATCCCTGGA GGTGCTGGGC
AACCTGGTGG GTGCGCGGCG GTTGCACGCC GACGCTGCGG CGGCACATCG GCTCGCGGAA
CTGTGCGCGG ACCTGCCGTT GGCGTTGCGG ATCGCGGGGG CGAACCTCGC CGTGAACTCG
CATGTGGAAC TCAGCGAGTA CGTCCGGGAA CTGGCGGGTC CCAACCGGCT GGAGTTGCTG
TCGATCGAGG GGGACCCGGA CTCCGCGGTG GCGTCGGTGT TCGCGCAGTC CTTTCGGGCG
CTGTCACCCG AGGCTCAGCG GCTGTTCGCG CGGCTGGGCT GGATACCCGG TGACGATTTC
GGCGAGGAGC TCGCGATCGC CGTCGCGGAC CTGCCGGAGG CGGACTGTCG TCGGTTGACG
CGCACCCTGG AGACCGGCAA TCTCGTCGAG CGGTATCAGG CGCGGCGGTT CCGGTTCCAC
GACCTGGTCC GGGAGTACGC GCGGCAGCAG GCGGAGAACA CACTGGACGA CGCCGAGCGC
GACGCGGTCG CGGATCGGGT CGTCCAGTGG TACTACGACA ACCGCCGGAC GCCCCGGGCC
GAGGACTACC CCAATCTCGT CGCGACGTTC CAGACCTGGC GGCACCACCC CCGGTGCCTG
ATGCTGCCGA GTGTGTTCGC GCAGCACGTC AACGCCGGTC GCGACCTCGC CGGAATGCGA
GCCCACCTCG ACGCCGCGTA CGCGCTGGCC AAGAGCCTCG GCGAACCACT GTCCGTGCAT
CGGGCCGCCG ACGCCCTGGC CGTGCTGGCC TGGGCGAAGG GCGACACCGG CACCGCCGTC
GAGTTCGGGC TCGAATCCCT GGCCAACGCC ATGAGCCACG ACGGGGACGC CCTGGGCACC
GCCCGGGCGA ATCTGGGACT GCACTACGCC GCCCACGGCG ACTACCGCAA AGCCGAACCG
CTGCAACGGG AAGCGCTGGA GGTCGCCGTC AGCCAGGCGG CGGCGGGCGC GCCGAACCGG
GGCCTGAACC TGGTCAATCT CTACTGTGGC CTGGGGCGGT ACGCGGACGC CACCACGCTC
GTCGAGCGGA TCCGGCACAT GCCCGGTATC GACGCGAACG ATCTCGTCCT CGGGGCCGTC
CACCGGATCA AGGCCCAGAT CCATATGAAT CTGGGCCGGT ACGGCGAAGC GCTCGCCGAG
ATCGAGGCGG GCCTGCGACT CGCCGGTCAA CGCTCGCAGC CGCGCAGCGA GAGCATCGCG
TTGCGACTGC GGGCCGAGAT CCGGCGCCGG GCCGGTGATC TCACCAACGC CCTGGCCGAC
GCGACCCGCG CGCTGGAGCT GGCCCGGCGA CACCAGCTGT CCAAACAGGA GAAGGACGCC
GTTTTCGAGC TCGCGGCACT GCACTGCGCG TCGGGCACGG CACAGAAGGC GGAGGAACTG
GTGCCGTCGC TGGCGGAGAC GGCTCGCGAC TCCAGCGGGC CACAGCGCGC GGAGGCGTTG
GCGCGACTGG CGGAGCTCCA CTTCCGGCGG GGTCGTCACG CCGAGGCGAT CGAGTGCGGC
ATGGCGGCGA AGGAGCTGTT CGCGAGCATT CCCCGGCCGC TGGGGCTGGC TCGCGTGTTG
CGCGATCTGG CCGAGATCCA CGACGACCTC GCCGAGACCG AGACGGCGCG CGGTTTCCGG
GCCGAGGCCC TGGACCTCTT CACCCGCCTG AGAGTCCCCG AGGCCGAAGA ACTCCGCCGC
CAGCTCGACA GCCCGCCGTC CGACTTACCT CAGACCCCGC CGTCGTCGAC ACCCGTGCGG
CGGTTGCGGT CCTCCTCCAC CAGCTTGTAC AAGGTGGACT GA
 
Protein sequence
MEVRLLGPLE VLRDGTPVPI RGRIHPRLLA VLALHAGNVV ARPALITGVW DCEPPASAKR 
QIQNAVSALR QVLDGTLIET VGDGYRLRLD DVTVDARDFE TTVAEAARQR AGGDSDAALT
SLRDALALWR GEALAGLPGR ELRSRAQRLE EAWLAAREEL IDVELELGEP VPVGELGELA
RLHPYRQRLT GLYMRVLHRQ GRTPDALRVF DDIRRRLLDE LGITVGPALR ELHTAILRED
PRLDAAAPVA VPASPAAAPR LVPAQLPAAI GEFVGRQEQL ARLDALIAKG DNTALLSTVS
GTGGAGKTAL AIHWAHHNRD RFPDGQLYVN LRAFDRAEPL TPYDALTRFL AALGVTGGAV
PSDVEAAASL YRSLLDGRRM LVLLDNAVDL EQVRPLLPGS GGNVTLVTSR NRITGLTALH
GAELIGVDTM SRTESLEVLG NLVGARRLHA DAAAAHRLAE LCADLPLALR IAGANLAVNS
HVELSEYVRE LAGPNRLELL SIEGDPDSAV ASVFAQSFRA LSPEAQRLFA RLGWIPGDDF
GEELAIAVAD LPEADCRRLT RTLETGNLVE RYQARRFRFH DLVREYARQQ AENTLDDAER
DAVADRVVQW YYDNRRTPRA EDYPNLVATF QTWRHHPRCL MLPSVFAQHV NAGRDLAGMR
AHLDAAYALA KSLGEPLSVH RAADALAVLA WAKGDTGTAV EFGLESLANA MSHDGDALGT
ARANLGLHYA AHGDYRKAEP LQREALEVAV SQAAAGAPNR GLNLVNLYCG LGRYADATTL
VERIRHMPGI DANDLVLGAV HRIKAQIHMN LGRYGEALAE IEAGLRLAGQ RSQPRSESIA
LRLRAEIRRR AGDLTNALAD ATRALELARR HQLSKQEKDA VFELAALHCA SGTAQKAEEL
VPSLAETARD SSGPQRAEAL ARLAELHFRR GRHAEAIECG MAAKELFASI PRPLGLARVL
RDLAEIHDDL AETETARGFR AEALDLFTRL RVPEAEELRR QLDSPPSDLP QTPPSSTPVR
RLRSSSTSLY KVD