Gene Snas_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1973 
Symbol 
ID8883165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2092183 
End bp2095209 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content70% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003510762 
Protein GI291299484 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.458427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTGTCC GGCTCCTCGG TGGGGTGGAG GTCGCGGGCG CTGAGGGCCA GTGGCGGACG 
GTGCCCGGCA AACGCGCCGG TGTCCTGGCG GTCCTGGCCG TCGCGGCGGG CGAACCGGTG
CCCGCCGACG AACTGGTGCA CCGGGTCTGG GGCGCCTCGG CCGTCGCTCT GGCCGACGGC
GCGGTCTACC CGCACATCAC CCGGCTGCGG GCGACGCTGT CACCGGCCGG ACTCACCATC
TCCCGGGCTC GCGGCGGCTA CGTCCTCGAC CTCGACGCCG ACCAGGTCGA TCTACTGACC
ATACGGGCGC TGGCCGTGCG GGCCCGCGAG GCCGCCGAGA CGGGGGAGGA CGCGAAGGCG
CTCACCGCCT GGCAACGGGC CACCGCGCTG TTGCGCGGCG AGGCGCTGGC CGGGGTCGAT
GGCGACTGGG CCGAGCGGTT CCGGCAGAGC TTCGGCGGCG AGGCGCGGTC GCTGCTGGCC
GAACGCTACG GCTGGGAGCT GCGCTGGGGC CGCCACCACG CCGTCGTCGA CGAACTGCTG
GCCGCCATGG TCCGGTACCC GACCTGCGAG AGCCTGGCCG AGCACCTGAT GCTGGCCCTG
TACCGGTGCG GTCGGCAGGC CGAGGCACTG GCGGTCTTCG ATCGCACCGC GCGGCTGCTG
CGGGACCGGC TCGGCGTCGA CCCTTCCGAG TCGCTGCGAA GGCTGCACCG CCGGATCCTC
AACCAGGATG CCGGACTCGC CGCCCCCGAA CCGATGGCGT TCAAGACAAC CACGTCAGAA
ACAACCACAG TAGACGATGC CCCGGCAGAC ACCGTTGTTC CGGCGCAATT GCCCGCCCCA
CCACGCGCCT TCGTCGGACG CGAATCCGAA CTGGCGGCCC TCAAACGCGA CGCGGACCGC
ACCTCGGTCC TGATCCTGGA CGGCATGCCC GGCGTCGGCA AGACCGCGAC CGCGGTACGT
CTCGCCACCG AACTGGCCCA GCGATACCCC GACGGACAGC TGTTCCTCGA CCTGCACGGC
TACTCCGGCG ACGTCCCGGC GGTCGAACCC GCCGAGGCAC TGGTGCGGCT GTTGCGCGGC
CTCGGCGCCG AGGCGGACCA GATCCCGACC GGCCTTGACG AACGCTCAGC GGAGCTGCGC
ACCCGGCTGG CGGGCCGTCG GGTGCTGATC CTGCTCGACA ACGCGGCCAC CAGCGCCCAG
GTCCGGCCGC TGCTGCCGGG CGGGACCGAC TGCCTCACGA TCATCACCAG CCGCCGCCGG
TTGCCCGACC TTCTGGAGGC GGCACCGGCG TCGCTCGACG TCCTGGAACC GGAGGAGGCG
GTGCGGCTGC TGGTCGCCGC GGTGGACGAC CCGAAGCGGG TCGCCGAGGA CTCCGCCGAC
ACCGCCGCGA TCGTCGAGGT CGCCGGTCGG CTGCCACTGG CGATCCGGCT GATCGCGGCC
CGGCTGCGCA ACCGCCGCAA CTGGACCGCC GGGTTCATGC TGGGACGGTT GCGGGACGAG
ACGATCCTGA GTGAACTGTC CGCACAGGAC GTCGCTGTGG CTTCGGCGTT CTCGATGTCC
TATGCCGAAC TGGACGATGG TCATCGCCGG ATGTTCCGGC TGCTGGGCCT GTTTCCCGGG
CAGGACTTCG ATGCCACCTT CGCGGCGGCC TTGGCCGACG TCGCGCCGGA GGCGGCCGAT
CGGATGCTGG AGGACCTGGT CGACGCGCAC CTGTTGCGCA GCGCCGAGCC GGGACGGTAC
CGGTTCCACG ACCTGATGCG CCACTACGCG GCGACCATCG CGACCGAGAC CGAGACGGCC
GCCGAAGTCG AGGCCGCTCG GACGCGCTTG TACGACACCG CCGCGGTCAT GCTGCGTCAC
GCTATTTCGC AGTACGACTC CTATGTGGGG TATTACCCAA GATTGATCGA AGCCGTCGAC
CCGCCCGAGT CCCCGTGGCG AACGCGGGCC GAGGCGTCCG CGTGGTTCGG CACCGAACTG
CCCAACCTGA AGGCGATGCT GCGATCCGCC AACGAGCACC GGATGGACCG TCACTGCGCG
GAGATGGCGG CGGCGATGTC GGCCTACTAC AGCCACCATC ATGCTGACCA CGACCTGGAC
CGTATCGGGG AATGGGGTCT GGGCAGTGCC CGCCGGATCG GCGATCGGGA GTGTGAGGGC
TACTTTCTCA ACAAGCTCGC AGGGGCTTAT CAGGCTTGGG GGGACGTCGG CATGGCCGAA
CGGCTCCACG AACAGGCGTT GGCCGTTCGG CGCGAGCTCG GCGACGCCCG CTATATCGTC
TCCAGCCTGT CCAACCTCGC TCTCGTGCAT GGGCACTCCG GCGAATATGA CCGGGCTGTC
GAGTTGCGCG GCGAGGCGCT CGCGCTGGCC GCCGACAATG GACTCGTCGA ACTCGAACGG
CTCATCTGCG TCTACATGGC CGGTTCACTG TGCGAGAGCG GCCGAATCGC CGAAGCCCGG
CTTCAGTTGG AGCGGGCCGG GGAACTGCTG ACCGGCTCCG ATGACGCGTT CGCTCGGATG
AGTCTCGACG CGCACTGGGG AAACGTGAAA CGCGGTGAGG GCGATCCCGT CGAGGCCCAA
CGGCTTCATG AACGGGCGTT GGCCGCGTAC GAGACCCACG GTCACTCGGT GGGACAGGTT
CAGATGCACT CCGAGATCGC CAGCGACCTG ATCGCGCGGG AACTGTGGGA CGAGGCGTGG
AAGTCCTGCG TCACGTCCAT GGAATTGCTG GGCGACACCG AACGTCCCGA GCTGCGGGCG
GAGAACCTGC TGACGATGGC AGAGGTGTGC CTGGCACGCG GTCACGACGA GGACGCGTTC
GAACAGCTCA GCGCGGTGGC GGACCTCGCC GAGAGCCGCG ATTCGGCCGT GCTGCGCGCC
AAGGCCGACT GGGGCCTGGC CCGGGTCGCG TCCGCCAAGG GGGACAACGA AAACGCCGTC
CAGCACGCCG AACGAGCCCT CGCCTACTAC TCCCGCTTCG ACACACCCCG CACCGAAGCC
ATCCGCCGCT TCCTGTCCCA GCCCTGA
 
Protein sequence
MLVRLLGGVE VAGAEGQWRT VPGKRAGVLA VLAVAAGEPV PADELVHRVW GASAVALADG 
AVYPHITRLR ATLSPAGLTI SRARGGYVLD LDADQVDLLT IRALAVRARE AAETGEDAKA
LTAWQRATAL LRGEALAGVD GDWAERFRQS FGGEARSLLA ERYGWELRWG RHHAVVDELL
AAMVRYPTCE SLAEHLMLAL YRCGRQAEAL AVFDRTARLL RDRLGVDPSE SLRRLHRRIL
NQDAGLAAPE PMAFKTTTSE TTTVDDAPAD TVVPAQLPAP PRAFVGRESE LAALKRDADR
TSVLILDGMP GVGKTATAVR LATELAQRYP DGQLFLDLHG YSGDVPAVEP AEALVRLLRG
LGAEADQIPT GLDERSAELR TRLAGRRVLI LLDNAATSAQ VRPLLPGGTD CLTIITSRRR
LPDLLEAAPA SLDVLEPEEA VRLLVAAVDD PKRVAEDSAD TAAIVEVAGR LPLAIRLIAA
RLRNRRNWTA GFMLGRLRDE TILSELSAQD VAVASAFSMS YAELDDGHRR MFRLLGLFPG
QDFDATFAAA LADVAPEAAD RMLEDLVDAH LLRSAEPGRY RFHDLMRHYA ATIATETETA
AEVEAARTRL YDTAAVMLRH AISQYDSYVG YYPRLIEAVD PPESPWRTRA EASAWFGTEL
PNLKAMLRSA NEHRMDRHCA EMAAAMSAYY SHHHADHDLD RIGEWGLGSA RRIGDRECEG
YFLNKLAGAY QAWGDVGMAE RLHEQALAVR RELGDARYIV SSLSNLALVH GHSGEYDRAV
ELRGEALALA ADNGLVELER LICVYMAGSL CESGRIAEAR LQLERAGELL TGSDDAFARM
SLDAHWGNVK RGEGDPVEAQ RLHERALAAY ETHGHSVGQV QMHSEIASDL IARELWDEAW
KSCVTSMELL GDTERPELRA ENLLTMAEVC LARGHDEDAF EQLSAVADLA ESRDSAVLRA
KADWGLARVA SAKGDNENAV QHAERALAYY SRFDTPRTEA IRRFLSQP