Gene Snas_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3989 
Symbol 
ID8885190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4257990 
End bp4261001 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content64% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003512734 
Protein GI291301456 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0138619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.888144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGTTCT CGATACTCGG ACCGTTGCGA GTCCAGCACG GCGGCCGTTC CGTCCCGATC 
TCCGGTCGCC ACGCTCCCAA GCTGCTTGCG GTATTGCTGG TCGACGCCGG GCGTCTCGTC
ACCGTCACCC GTCTGATCGA AACGCTGTGG GCCGAGGATC CACCCGCCAC AGCGAAGCGA
CAGGTCCAAA ACGTTATGGC GGCGTTGCGC CGCGCGTTGC CGGAACCGGA TGCCATCGAG
GCGACCGGCA ACGGCTACCG ATTGGAGCTT GAATCGGCCA CTGTGGATCT GCGGGAGTTT
GAGCGATTGC GGTCGCTCGC CCGAGCGGCC GAACCACGGG AGGCATTGCG ATATCTGCGA
GAAGCGTTGC GACTGTGGCG GGGCGAGGCC CTCGCGGGCT TGACGGGGAG CAGCATCGAA
GCGGCGGCGG CGAAGCTCAA CGAGGCGAGG ATCTCGGCCA TCGAGGAACG CGTCGAGCTC
GAGCTGGACC TGGAACGACG TTTGGACGTC GTCGGCGAAC TGCGGGAGCT GCTGGTGGCG
CACCCGTTCC GGCAACGGTT GACGGGACAA CTGATGCGGG CGCTTTGGTA CGCGGGGAGC
AAGTCAGAGG CGTTGGAGGT CTACCGGAAA CTGCGGAAAC GATTGGCCGA GGGGCTGGGT
CTGGATCCGG ACGCCGAACT GGCGCGGTTG CACTCGGCGA TCCTGCGGGG TGAGCCCGAA
ACCCCCGCTG CCGACCGGAC CACAGCGGCG GTGCGGGAGC GGACGGTCCC GGCACAGCTG
CCCGCCGCTC CGTCGACGTT CACGGGTCGG CGTAAGCAGG TGGTGGCGCT GGACGAGCTA
CTGGAGCAGG GGCGGAACAC GGCGGTGGTG TCAGCGATCG CGGGGATGGG TGGTGCGGGG
AAGACGGCGC TGGCGTTGTA TTGGGGGCAC CGGGTGCGGG AGCGGTTTCC AGACGGGCAG
TTGTACATCA ACTTGCGCGG GTACGACGAA GCGAAACCGG TCGCCGCGAT CGACGCGCTG
GGGCGGTTTC TGGTGGCACT GGGGCAAACC AGTACGACTG TCCCATCCGA TGTGGATGAG
GCGGCGGCAT TGTTCCGGTC GCTGCTGTCC GAGCGGCGGA TGCTCGTCAT TCTCGACAAC
GCCCGCGAGG CAGCCCAGGT CCGACCGCTG CTGCCGGGCG GCGCCGGGAA CCTGGCGATC
GTGACCAGTC GGGATCGGCT GGCAAGCCTC ACCGCACTTG AAGGCGCTGA GCCGATTCGG
CTGGATACCT TGAGCCAGAC GGAGTCACTG GAGTTGCTGG CGAACATCGT GGGAGCCGGG
CGGCTGGACA CTGACCCGGA AGCGGCACAT CGGATCGCCG AGCTGTGCGG ACGGCTGCCG
TTGGCCTTGC GGATCGCGGG GGCGAGCTTG GCGGCGCAGC CAGACCTGGC ACTCGGTGAG
TTCACCGATG TACTGGGCGG GCCGGATCGG TTGCGGCGAT TGGCGCTTGA CGGGGACAAG
CTCGCCAGCG TGTCCAACGT GCTCGAATTG TCCGTGGCAG CACTCGATGA CACGAGCCGC
GAACTGCTGC TGAAGCTGGC GCAGATCCTG GGCGACGACT TCTGTCATGG ACTGGCGGTC
CACCTGTCCG AACTGGACGA AACCGGGGCC GGGCGAAATC TGGCGGCCCT GGAGGCCGCA
CATCTCATTG AGCAGCACAT CCCGTCCCGG TACCGTATTC ACGACTTGAC TCGCGATTAC
ATGAGACAAC AGGGTCGCCG GACCTTCGAC GACGTTCGTC TTCACGACAT TCAAACCTGT
TTCATCACTT GGCATTACGC TGTCCGGCGC GAAATTTCCG TCACAGAGGC ATCGAATGTC
GTTTCCGCGT TCAACGCGTG GCGAGAACAT CCTGAGATCT GGAAACTCGC GACGATGTTC
TCAGTCTTCA GCGGGACCGA GTACAAGCCC ACTCAGCTAC TTCAGTTGGC CGAGTGGGCA
CTCGTCGCGA GAAAAGACGG CATGGATCCC ATGGGCAACG CGTATCTCCT CATGGAGATC
GCGATTCTGT ATCGCGCCAT GGGAAACCGG AATCTGGCGG TCCGCGAAGC AGAGAAGGCC
ATCGATGTCG TTCACCGAGC GGGTCTGGAG GATCCCGACG GGCGTTTCCG GGGGAATCTG
GCATTGATGT ACATGGAGGT GGGCCAGTAC ATCCAAGCCG AGACGCTCAT GCGAGACGCG
CTGCGATCGG CGCGAGAGTC AGGTGACGCC CAGAACATCA AGTCCTGTTC GTCGTCCCTG
GCCGCCATCT GCCGCAGGTT GGGTAAGTTC GCCGACGCCG AGACACTACT CAACGGTGTC
ATTGACAATC CGGAGCTGCC CACACAACCC ACTCTGGACA TTACCGCGAA GGCCCAACTA
GGCGCCCTCT ATCTGGACAT CGGCCGCCTT ACCGAGGCTC TCACGGTCCT TGACGAGGTG
CACTCGCTGC CGCCGGACGT GGGAGGAATG CGATCACGCA CGTTCTCGCG AATCCTGCGT
ACTGAGGCTC TCTGCGCGCT GGGCCGGTAC GAATCCGCGC GTCCAGAGTT GACCGACGTA
CTCGCTGTCG CCGTTCGCAT GGACCTCACC GGGGCGGTCA TGCTGGCTAC GATCCAGTTG
GCGCACCTCC ACAGCGATTC GGGAGATCAA CAAGCCGCGC TACGAGCCCT GGACACACTC
GGCCCCCATC ACCTCAACGA ATCCGACCAG AAATTCGCCG CTGAGATCGC GCGGCTGCGG
TGCATCACCA ACACCCGGCT TCGACGGTTC GCGAAAGCCG TCACGTTCGG CGGTTACGCG
TGTGACCGCT ACGCGAACAT GTCCTATCCG CTGATGCACG CCAGGTCGCT GGCGGCGCTG
GCCAACGCGT ACGAGGGTGC AAACAATCCC GCCCAAACTA CGGCCTGTCG CGCGCAGGCC
TTCGATATCT TCTCCCGGCT GGGCGTCCCC GAAGCGGACG AACTCCGCGA GCTCCTCGGC
CCCGCCCCGT AA
 
Protein sequence
MEFSILGPLR VQHGGRSVPI SGRHAPKLLA VLLVDAGRLV TVTRLIETLW AEDPPATAKR 
QVQNVMAALR RALPEPDAIE ATGNGYRLEL ESATVDLREF ERLRSLARAA EPREALRYLR
EALRLWRGEA LAGLTGSSIE AAAAKLNEAR ISAIEERVEL ELDLERRLDV VGELRELLVA
HPFRQRLTGQ LMRALWYAGS KSEALEVYRK LRKRLAEGLG LDPDAELARL HSAILRGEPE
TPAADRTTAA VRERTVPAQL PAAPSTFTGR RKQVVALDEL LEQGRNTAVV SAIAGMGGAG
KTALALYWGH RVRERFPDGQ LYINLRGYDE AKPVAAIDAL GRFLVALGQT STTVPSDVDE
AAALFRSLLS ERRMLVILDN AREAAQVRPL LPGGAGNLAI VTSRDRLASL TALEGAEPIR
LDTLSQTESL ELLANIVGAG RLDTDPEAAH RIAELCGRLP LALRIAGASL AAQPDLALGE
FTDVLGGPDR LRRLALDGDK LASVSNVLEL SVAALDDTSR ELLLKLAQIL GDDFCHGLAV
HLSELDETGA GRNLAALEAA HLIEQHIPSR YRIHDLTRDY MRQQGRRTFD DVRLHDIQTC
FITWHYAVRR EISVTEASNV VSAFNAWREH PEIWKLATMF SVFSGTEYKP TQLLQLAEWA
LVARKDGMDP MGNAYLLMEI AILYRAMGNR NLAVREAEKA IDVVHRAGLE DPDGRFRGNL
ALMYMEVGQY IQAETLMRDA LRSARESGDA QNIKSCSSSL AAICRRLGKF ADAETLLNGV
IDNPELPTQP TLDITAKAQL GALYLDIGRL TEALTVLDEV HSLPPDVGGM RSRTFSRILR
TEALCALGRY ESARPELTDV LAVAVRMDLT GAVMLATIQL AHLHSDSGDQ QAALRALDTL
GPHHLNESDQ KFAAEIARLR CITNTRLRRF AKAVTFGGYA CDRYANMSYP LMHARSLAAL
ANAYEGANNP AQTTACRAQA FDIFSRLGVP EADELRELLG PAP