Gene Snas_0797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0797 
Symbol 
ID8881981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp843010 
End bp846015 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content73% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003509602 
Protein GI291298324 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTTC GGATTCTCGG TGCGCCCGAG GTGATACGCG ACGGTGCGCC GCTCGCTATC 
CGCGGCCGCA TCGCGCCGAG GCTGCTGGCC GTCCTGGCCC TGGAGGCCGG ACACGTGGTA
CCGATATCCA CTTTGGTGGA AGCCCTGTGG GAGGACCCGC CGGTCACCGC CCGGCGGCAG
GTGCAGAACA CCGTCTCGGC GCTGCGCGCC GTCCTGGGCG AGCGGACCGT CGAGGCCGTG
GCCGAGGGCT ACCGGCTCGC GGTTCCCGCC GAGACCGTCG ACGCCGGACG ATTCGCCGCC
GGGGTGCGGC GGGCCCGGCA GCTGCGGGAG GACGGCGACC CCGTCGCGGC CCTGGAACGA
CTGCGCGAGG CACTGGCGCT GTGGCGCGGA CACGCCCTGG CGGGCATGGG CGGGCAGGTG
CTGGAACGCG GCGCCCGGCG GCTCGGCGAG GACCGGCTGG CCGCCTTCGA GGAACGCGTC
GAACTCGAAC TCGAACTGGG ACAACGGGTC CCGGTCGGTG AACTGCGGCA GCTGCTCACC
GAGAACCCGT ACCGGCAGCG GCTGGCCGCG CTGTTGATGC TGGCGCTGTA CCGCGAGGGC
CGGGCGCCCG AGGCCCTGGA GGTCCACACC CGGATGCGGC AGCGGCTCAG CCGCGACCTG
GGCGTCGAAC CCGGTCCGGC GCTGCGGGAA CGCTACGCCG CGATCCTGCG CGAGGACCCG
GAACTGGACG TCACCGGCAC CCCGGCACCG GCGTCCCGCG GCGCCACCCG GCCGGAGTTC
GCCCCGGCGC AGCTCCCGGC GGCGCTGGCC GGGTTCACCG GCCGCACCTC ACAGCTGCGG
GCCCTGGACG CGCTCGGCGA CGACTCGGTC CTGGCGACCA TCACCGGTTG CGGCGGCAGC
GGCAAGACCG CGCTGGCCGT CCACTGGGCC CACCGCAACC GCGACCGGTT CCCCGACGGC
CAGCTGTACC TCAACCTGCG CGGGTTCGAC GCCGACGCCC CGCTGTCGCC GCAGGACGCG
CTGACCCGGC TGCTGCCCGC GCTGGGGCAA CCCGCCGACG CCGTCCCCGC CGAGCTGGAC
GCCGCCGCCG CGCTGTTCCG GTCGCTGTTG ACCGGGCGCC GGATGCTGCT GCTGTTGGAC
AACGCCCGCG ACGCGGCCCA GGTGGAACCG TTGCTGCCCA ACGAACCCGG GACCGTGACC
CTGGTGACCA GCCGCCATCG GCTCACCGAG CTGGCCGCCC ACGGCGCCAC CGCCATCTCG
CTGGACACGC TCGACGAGAC CGACGCGCTG GCCCTGTTGT CCACACTGGT CCCCGACGAC
CGGCTGGCCG AGGACCCGGC GGCCACCGCC GAACTCGTCT CCCGGTGCGG CGGTCTGCCG
CTGGCGCTGC GCATCGTCGG CGCCAACCTC GCCGGTCGTC CCTATTCGAC GGTCGCGGAG
TTCGCGGCCG AACACTCCGG TTCCGACCGG CTCGGCCTGC TCACCGTCGA CGGCGATCCC
AACGCCACCG TCGCCACCGT CTTCGAACGC TCCGCGCGCG CCCTCGACGA GGACACCCGG
CGGCTGTTCC TGCGGCTGGG CCTCATCCCC GGCGACGAGA TCCCCGAGGA CCTGTCGCGC
GGCACCGCGG ACCTGTCCGA GACCGTCAAC CGGGAACTGT TGGGACGGCT GGAAAGCGCC
CACCTCATCG AACGGCATCG CCCCGGGCTG TACCGGTTCC ACGACCTGGT GCGGCTGTAC
GCCCGCCAGC AGGCCGAATC CACACTCGAC CCGGCCGAGG CGGCCGAGGC CCGCACCGCC
GCGATCGACT GGTACGCCAG CTCCTTCGAG CAGGTCAGCG CCGACCCCGG CAACGTCATC
GCGGCGTTCA AGGCCTGGCA GGACCACCCC GACGCCTGGC GGCTGGCCCG GATGCTGCCG
GACTTCCTGC GGTACGGGCA CAGCCTGGCG GGCCTGCGTC CGCACGTCGA GACCGGACTG
GCGCTCGCCA AGGACTCCGG GGACGCCGGG GCGCTGAGCC AGATGTACGA CGCGATGGCC
TTCGTCCACT CCAACTCCGG CGACCACCTC ACCGCGCTCA CCTGTGGACG GAAAGCGGTC
GCGATCGCCC GGGGGCTTCC CGACGGCGAC GCCGACGGCC GGTTGCGCAG CGGGCTGGCG
GCGCTGCTGC TGTTCAACTC CTACTACGAG GAGTCGGCGG CCCTGTCCCG CGAGACCCTC
GCCATCGCCG AGGCCGACGG CGACATCGGC CGGATCTTCC ACGACCGCAT CTTCCTGGGC
GCCGCCTATC GCAGCAGCGG CCGGTTCGCC GATGCCGAGA CCTGCTTCGT TCAGGCGATG
CGGCTCGCCG ACACCTGCGA CGACGACGAT CCCCGCCGGG TCACCGCCCG GTTCAAGCTC
GCCCGGCTGT ACAACGACAT GGATCGCGTC GAGGCGGCCT GGCCGCTGCT GGAGGCGATC
GGGCAATGGT GGCAGCGCAC CGGTTCCGAC TTCATCCGCG AACGGGTGCT GTGGCTTCGG
GGCCAGTTGC AGCTGCGTTC CGGCCGCGTC GAGGCGGCCG AGCAGGACCT CGCCGACAGC
TGGGAGCTGT CGCACCGCAA CGGCATCCTG GTGTGGGCCT GGTACGCGCG GTTCGCGCAG
ATCGAACTGT GCTGCCAGCT CGGCGACCAC GAACGCGGAC TGGCCTACGT CGAGGAGCTC
GCCGAGTCCG GCGCCGACAC CTTCGAACGC GACGACCGGG CCCAGTGGGC GGCCCTCAGC
GCCAAGGTCC ACATCGGCCT CGGTGACTAC CGGGCGGCGA TCGCGGCGGC CGAACAGGCC
CGCACCGTCT TCACCGTCAC CAAGAACCCG CTGCGGCTGG CCCGCTGTCT GGTCACCCTC
GCCGAGGCCC ACGAACGACT GGGCGACACC GAGACCGCCG AACGGCACCG CGACGAGGCC
CGCCGCGGCT TCGCCGCGCT GGGTCTGTCC GAAAAGGCCA CCAGGATCGG CATGTCGACC
GGCTAG
 
Protein sequence
MEFRILGAPE VIRDGAPLAI RGRIAPRLLA VLALEAGHVV PISTLVEALW EDPPVTARRQ 
VQNTVSALRA VLGERTVEAV AEGYRLAVPA ETVDAGRFAA GVRRARQLRE DGDPVAALER
LREALALWRG HALAGMGGQV LERGARRLGE DRLAAFEERV ELELELGQRV PVGELRQLLT
ENPYRQRLAA LLMLALYREG RAPEALEVHT RMRQRLSRDL GVEPGPALRE RYAAILREDP
ELDVTGTPAP ASRGATRPEF APAQLPAALA GFTGRTSQLR ALDALGDDSV LATITGCGGS
GKTALAVHWA HRNRDRFPDG QLYLNLRGFD ADAPLSPQDA LTRLLPALGQ PADAVPAELD
AAAALFRSLL TGRRMLLLLD NARDAAQVEP LLPNEPGTVT LVTSRHRLTE LAAHGATAIS
LDTLDETDAL ALLSTLVPDD RLAEDPAATA ELVSRCGGLP LALRIVGANL AGRPYSTVAE
FAAEHSGSDR LGLLTVDGDP NATVATVFER SARALDEDTR RLFLRLGLIP GDEIPEDLSR
GTADLSETVN RELLGRLESA HLIERHRPGL YRFHDLVRLY ARQQAESTLD PAEAAEARTA
AIDWYASSFE QVSADPGNVI AAFKAWQDHP DAWRLARMLP DFLRYGHSLA GLRPHVETGL
ALAKDSGDAG ALSQMYDAMA FVHSNSGDHL TALTCGRKAV AIARGLPDGD ADGRLRSGLA
ALLLFNSYYE ESAALSRETL AIAEADGDIG RIFHDRIFLG AAYRSSGRFA DAETCFVQAM
RLADTCDDDD PRRVTARFKL ARLYNDMDRV EAAWPLLEAI GQWWQRTGSD FIRERVLWLR
GQLQLRSGRV EAAEQDLADS WELSHRNGIL VWAWYARFAQ IELCCQLGDH ERGLAYVEEL
AESGADTFER DDRAQWAALS AKVHIGLGDY RAAIAAAEQA RTVFTVTKNP LRLARCLVTL
AEAHERLGDT ETAERHRDEA RRGFAALGLS EKATRIGMST G