Gene Snas_4966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4966 
Symbol 
ID8886173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5275500 
End bp5278592 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content68% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003513697 
Protein GI291302419 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.998552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGG AAATACGCCT GCTGGGAGCG ATCGAACTGT GGGCCGACGG TCGTCGCGTC 
GACATCGGCC CGGCCAAGCA GCGTGCCGTC TTCGCGATCC TGGCAGCTGA GGCGACCAGC
GTGGTGCCGA CCGACCGCCT CGAACACCAC ACCTGGGGCG ACGCCCCACC CAAGGACGCC
CGCCGGACCC TGCACGTATA CCTGACCCGG CTGCGCCGCG CGCTGTCAGG CATCGAGGGC
CTGTCGCTGG AGCGGCACGG CGGCGGCTAC GTCCTCGGCG TGGACACCGA ACAGGTGGAC
CTGCACCGCT ACCGGCGACT GTGCGGCGCC GCCCGCGACG CCGCCGACGC CTCCCACGCC
GCCTCACTGT GGCACGAGGC GTTCGCACTG TGGCGGGGCG AGCCGTTCAC CGACCACGAC
ATCCCCCGGC TGAACCGGCT GCGCCACGAG CTGCGCGCCG AACGCGAGAC CGCCGAACTG
GACCGTAACG ACGCCTACCT GCGCGCCGGA CGGCACACCG AACTGCTTGC CGACCTGACC
GAACAGGTCG AGCGACGTCC GTTGGACGAG CGACTGGCGG CCCAGTTCAT CGACGCGACA
CACCAGTCGG GTCGCACCGC CGAGGCCCTC ACCCACTACC GCGACCTCCG CGACCGGCTG
GTCGGCGAAC TGGGCCGCGA ACCCGGATCG AGCCTGCGCG ACCTGCACCG CCGCATCCTC
AACGACGACC AGGCCCCCAC CGTCACCGAG CAGTCGGTGC CGCGACAACT GCCGGTCACG
ACCGTGACGT TCACCGGCAG AGACGCGGAA TCGGCCCACG CGATCGCGCT GCTGGGTTCG
GGAACCCCGA TCGTGTCGGT CGACGGCATG GCCGGGGTCG GCAAGACCGC CTTCGCGGTC
CGGGTGGCCA CAGAAGTGTC CGACAAGTTC TGCGACGGCC AGCTCTTCGT GGACCTACGC
GGCTTCTCCG ACGACCTGGC ACCGCTACCC GCGAACGAGG CGATCGGCGG CATGCTGCGC
GACCTCGGCG TGCCGCAGAC CCAGATCCCC GCCGACCTGG CGGGACGCTC GGCGATGCTG
CGCAGCCGAC TGGCCGACCG ACGAGTCCTG CTGGTCCTCG ACAACACCAT CGGCACCGAA
CAGGTCCTAC CCCTGCTGCC CGGCCCCGGC GACAGCGCCG TACTGATCAC GAGCCGCCGC
AAACTGCCCG ACCTGCCCGA CGCCGAACCG ATCACACTGG ACGTCCTACC CCGCCACGAA
GCCCGCGAAC TGTTCACGAC GGTCGCGCAA CGAAACATCG ACGCCGAAAC CGACCCCGTG
AACGACATCG TCACGCTGGC CGGTCAACTC CCGCTGGCCC TGCGACTGGC GGCGGCCCGA
CTGCGCAGCC GTCCGGCATG GACGGTCACC GACCTTCGCG ACCGGATGGC CTCCGAACGA
CAAGGCGAAC GCCGCTCACC GGCCGGACGA AAACTCGGAG CCGCCTTCGA ACTGTCCCTA
CGCGCCCTCA CCGTCGAAAA ACGCGAGACA TTCCTGTCGG CGAGCCTGAT CCCGGTCCAC
GATCTCACCG CGGCATCGGT CGCCGCCGTA ACCCAACACC CCATCGACGA GGTCGAGGAA
ACCCTCGAAG AACTGTGCGA CCTCAACCTG CTCACCACCC CGACGGCGGG CCGCTACCAG
TACTTCGACC TGCTGCGCGA CTACGCCGCC CAGATAGCCG AAACGAACCA GCCCGCCCAC
ACCCGCCACG ACATCACCGG GCGAGCGTTG CGCTGGTACA TGGCGAATGC CCGCACCGCA
TGCGCGGCGG TACGTCTCCC GTTCCCCGAG AACCCGGGCC TGCCCACCGA CCCTGCGGAC
ATGAGGTTCG ACGACGAGCA GTCCGCGTTG GCCTGGCTGG ACTCCGAACG CGGGAACCTG
CTGGCGCTGT TGCGCCACTC AAGTGCCACA TCGATACCGA TGTGGACAAT GGTCGACGCG
ATCTCGAACT ACCTGCTGTA CCGCGCCGAA GGCGCCAGCC TGCTCGAGAT CTGCGACCTC
GCACTGAGCG AACCCGAGGC CCTGGCCGAC AACCTCGCCC AGGTCAAACT GTTGAGCCGC
AAAGCGAGCG CGGCACAGAG CCTGGGCGAC CGGACGGCCA TGCTGACCTA CACCCAAGCC
GCCCGCGAAC GCCTCACCCC CGACGCCGGA CCGAAACTGC GACTGGGCGC CCTGTCGCAA
CTGATCATGG TCCACCGAAC CCTCGGCAAC GTCGCGGAAG GAGCCACGGC CGCGGCGGAG
GCGCTCAAGA CCTACCGCGA ACTCGACGAA GACGGTGCCT CGTACATCCT GCAACAAGTC
GCGGCGGCGG TCTCCGACAC CGGCGACCTG CACGCGACCC GCGAACTGCT CGAAGAGGCC
AGTCGTGAGT TCCGGCGACG CGACAGCCTC AACCTGGGTT TCGCGCTCAC CGCGCTGGTG
GAGGTCTGCA CGGAACTGGG CGACTTCGAA GCGGCCGAAC ACTTCGCGGA CGAGACCCGC
AAGTGGATGG GCAAGGCCGG AACCGAAACG GCCCAGCCGC AACTGCACCA AGACATGGCG
CTGCTCCACC ACGCCCGGGG CGAAACCGAA CCAGCCCTAG AACACGCCCG CCGCGCGGTC
GACCTGGCCC GCCAAATGGG CATGCACGGC GTCGACAGCG CAGTCCTGTC CACACTGGCC
CGGGTCTCAC GAGACGTCTC ACCGGACGCA CACCAATACG CCGAGGAAGC AGTCAAAACC
AGCCGAGACC GCGAAGCGAT CGCCGAACAC ATCGTCGCCC TCTGGGTACT GGCCGACATC
CAACTCGCCG CGGGCGACAC CACCTCGGCC ACCCACACGG CGACCACAGC CCTGGACCTG
GCCCGCAAAC ACGGATACCG CCTCCTGAAG GCCAAGATCC TGACGGTACT GACCGAAGTC
CACCTGGCCA CAGGCGACCA CGACCAAGCC CGCCACACCG GAACCGAGGC CCTACGCGAC
CACCAGTGGT GCGGATCCCG CCCCCAACAT GCCAAAGTCC ACAAGCTACT AGCGCAGGCG
ACCACCGATG ACGGGTCAGG TCGGATCAGT TAG
 
Protein sequence
MTLEIRLLGA IELWADGRRV DIGPAKQRAV FAILAAEATS VVPTDRLEHH TWGDAPPKDA 
RRTLHVYLTR LRRALSGIEG LSLERHGGGY VLGVDTEQVD LHRYRRLCGA ARDAADASHA
ASLWHEAFAL WRGEPFTDHD IPRLNRLRHE LRAERETAEL DRNDAYLRAG RHTELLADLT
EQVERRPLDE RLAAQFIDAT HQSGRTAEAL THYRDLRDRL VGELGREPGS SLRDLHRRIL
NDDQAPTVTE QSVPRQLPVT TVTFTGRDAE SAHAIALLGS GTPIVSVDGM AGVGKTAFAV
RVATEVSDKF CDGQLFVDLR GFSDDLAPLP ANEAIGGMLR DLGVPQTQIP ADLAGRSAML
RSRLADRRVL LVLDNTIGTE QVLPLLPGPG DSAVLITSRR KLPDLPDAEP ITLDVLPRHE
ARELFTTVAQ RNIDAETDPV NDIVTLAGQL PLALRLAAAR LRSRPAWTVT DLRDRMASER
QGERRSPAGR KLGAAFELSL RALTVEKRET FLSASLIPVH DLTAASVAAV TQHPIDEVEE
TLEELCDLNL LTTPTAGRYQ YFDLLRDYAA QIAETNQPAH TRHDITGRAL RWYMANARTA
CAAVRLPFPE NPGLPTDPAD MRFDDEQSAL AWLDSERGNL LALLRHSSAT SIPMWTMVDA
ISNYLLYRAE GASLLEICDL ALSEPEALAD NLAQVKLLSR KASAAQSLGD RTAMLTYTQA
ARERLTPDAG PKLRLGALSQ LIMVHRTLGN VAEGATAAAE ALKTYRELDE DGASYILQQV
AAAVSDTGDL HATRELLEEA SREFRRRDSL NLGFALTALV EVCTELGDFE AAEHFADETR
KWMGKAGTET AQPQLHQDMA LLHHARGETE PALEHARRAV DLARQMGMHG VDSAVLSTLA
RVSRDVSPDA HQYAEEAVKT SRDREAIAEH IVALWVLADI QLAAGDTTSA THTATTALDL
ARKHGYRLLK AKILTVLTEV HLATGDHDQA RHTGTEALRD HQWCGSRPQH AKVHKLLAQA
TTDDGSGRIS