Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_3059 |
Symbol | |
ID | 8884258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 3225960 |
End bp | 3229004 |
Gene Length | 3045 bp |
Protein Length | 1014 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003511823 |
Protein GI | 291300545 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.163733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGGCG TGAAGTTTCG GATCTTGGGA CCACTGGAGG TGTCCCGCGA CGGCGAGCCG GTGACCATAT CGGGCAGACA CCATCCGAAA CTGCTCGCCC TGCTGCTCCT CGAGACCGGC CGCGTCGTCA CCGTCTCCCG ACTCGTCGAC GCCCTCTGGG AAAACGACCC GCCCGCCACG GCCCGCCGCC AGATCCAGAA CACCATGGCC AGCCTGCGCC GCCAACTGGC GGGCGACGAC TCCCCCACCC TGGAAGCCAC CGGCGAAGGC TACCGACTCC TGGTACCCGC CAAAACCGTT GACGCCCAAT GCTTCACCGA CCTGGTCCGC CAAGCCCGCA CCGCCCGCGA CACCAACGAC CTCCCCACGG CGTCCCGCCT GTTCGCCGAA GCCCTCGCCC TGTGGCGAGG CGAAGCCCTC GCGGGCCTGT CCGGCCGCGT CGTCGAAGCG GCAGTGGTGC GCCTCAACGA ATCCCGCCTG TCCGCGATCG AAGACCGCTG CGACATCGAC ATCGCGCTGG GCCGCCACGG CCAGGTCGTC GGCGAACTGC GCGAACTCCT GGAACACCAC CCCTACCGCC AACGCGTCGC CGGTCTGCTC ATGACCGCCC TGCACCACAG CGGACGAACC CCCGAAGCCC TCGAGGTCTT CACCGAAGTC CGGGCCCGCC TGTCCGACGA ACTCGGCCTC GACCCCGACC CCGACCTGAA CCGCCTCCAC GGCGAGATCC TGCGCGGCGA CCTCGACACC CCCACCACCG AACCCACCCC ACCCTCGGCA CCCCGCCCGG CCCAACTCCC CGCCGACACC GCCACCTTCA CCGGCCGCGA AGAACAACTC GCAGCCCTCG ACAACCTCCT GGCCGACGGC CGCACCGCCA CGGTCGTGTC CGCCATCGCG GGCATGGGCG GCGCGGGCAA AACCGCCCTC GCCGTCCACT GGGCCCACCA CGTCCGCGAC CGATTCCCCG ACGGCCAGCT CTACATCAAC CTGCGCGGCT ACGACGAAGC CGCCCCGGTC TCCCCCGCAG ACGCCCTCAC CCGCTTCCTC AACGCCCTGG GCCAACCCGG CGCCGCGATC CCCACCGACC CCGACGAAGC CGGAGCCATG TACCGCTCCC TGCTCGCCGA CCAACGCATG CTCATCCTCC TCGACAACGC CCGCGACGCC GCCCAGGTCC GCCCCCTCCT CCCCGGCGGC GGCGGAAACT TCGCCCTCAT CACCAGCCGC GACCGCCTCA CCAGCCTGGT AGCCCTCGAC GACGTCGCCC CGCTGCGCAT CGACACGCTG AGCCACGAGG AGTCGGTGGA TCTGCTGTCC AACCTCGTCG ACCCCGTCCG TCTCCACTCC GAACCCGAAG CCACCCATCA GCTGGCGCGA CTGTGCGGCC ACCTCCCCCT GGCGTTGCGC ATCGCCGGAG CCAACCTTGC CGACCGGCCC GAAACCAACG TCACCCAGTT CGTGGCCGAA CTCGAAGGGC CGCAACGACT CCAGAAGCTG ACCGCCCCCG ACGACCCCGC CGTCGCCATC ACCCGGACCC TCCACCTGTC CGTCAGCGCC CTCACCCCGG CCGCGCGGCA GTTGTTCACC CTGCTGGGAA TCCTGCCGGG CGAGGACTTC TCACACGACC TGGCCGCCCA CCTGGCCGGA ACCGTCACCG ACGACGCTCC GCGAGCGATC AACGAACTGG AAGCTGCCCA CCTCGTCGAG AGCCACCACG ACAACCGGCT GCGCTTCCAC GACCTGGTTC GCGAATACGC CAACGCCCGA GCATCGCAGT GGGACGACGC CGACCGCGGC GAGGCCGTCA CCCGCGTCAT CGGCTGGTAC GACCACAACA AAGCCACCCT GCCCACCGAC GAACGCGACA ACGTCCTGCG AATGCTCTCC GCCTGGAACC ACCGCCCCGA CTCCTGGCGC CTGGCGGCGG TTCTCGGCAG GTTCGTCCAC TACGGACCCG ACCTGCCCCG GCAACTCGAA CTACTGAGAC ACGAACTGAG CAAAGCCGAA CACAACCAGG ACCACCCCGG CCGCTGCCAG ATGAACTCAA CCCTGGCCAT CGTCCACCGG GAAATGGGCC ACCGGACCAC CGCCATCGAG TATGCGCGGG AAGCCGTCCA GATCATGCGC GATCACGACG TGGACGACCC GGTGGGCAAG TACGTGGGCA ACCTCGGTCT CTACCTGGGC GACATGGGCC GCGTCGCCGA GGCCGTCCCC CTCGTCCTCG AGTCGTACAA CGCCGCGGTG GCCACCGGCG ACGACCTCTT CGCGACCATC CGCGCCTCCA CCCTTGGAAC CCTCTACGCC GAGTTGGGCG ATTACGGAGA AGGCGAGAAG TGGACAACGA TGGCGCTGAA GCTGACTGAG CAACCATCCC TGCGATTCTT CCGCCAAGGC ATCAGCTACT GCCTGTGCGA ACAATACGTC AGCTCCCGTC GCTTCAGTGA CGCCGAACCA CTGATCACCG ACATCCTGAA CCAACCGGAT TCCAGCGGCG CCAAGTACCA TGCCCTCACC CTGATCCTCC GCGCCGAGAT CAACCGGGCG CGCGGCCGCT ACGACGCCGC GCATGAGGAC CTGGCCGAGG CGCTGCGCCA CGCGTCCCAG ACCGACCGCT CCGGCTTGCG CGACATGGCT GAATGCGGGA TGGCCGAACT CGAAATCCAA ACCGGCCATC CCCGCCAGGC CATTGACCGG CTCATGTCCT CCCACCCGGC CAACACCGAT CAGATGGGAG CATTGCAGCG AGCTCAGGCT GACCGTCTAC TTTGCCTCGC GCACGCGCGC CTTGGCGACG GTGATACGGC GATCAACTAT GGCGACGCGG CGCTGGCGGC GTTCCGTTCG ATGCCACGCC CCCTGTTGGA GGCCAGAACC TTGGTGGCCC TCGCCGAAGC GCACGACACG GGAGGCGACA AACTTTCTGC CCAACGCGAC CGTGAGTCGG CGCTCGAAAT CTTCACTCGC CTCGGAATTC CGGTCGAAGA AAGTCGCGAG GTGCCGCTTG GCCATGGCCA GCGTCCACCC CATTGGGATA CGTAG
|
Protein sequence | MGGVKFRILG PLEVSRDGEP VTISGRHHPK LLALLLLETG RVVTVSRLVD ALWENDPPAT ARRQIQNTMA SLRRQLAGDD SPTLEATGEG YRLLVPAKTV DAQCFTDLVR QARTARDTND LPTASRLFAE ALALWRGEAL AGLSGRVVEA AVVRLNESRL SAIEDRCDID IALGRHGQVV GELRELLEHH PYRQRVAGLL MTALHHSGRT PEALEVFTEV RARLSDELGL DPDPDLNRLH GEILRGDLDT PTTEPTPPSA PRPAQLPADT ATFTGREEQL AALDNLLADG RTATVVSAIA GMGGAGKTAL AVHWAHHVRD RFPDGQLYIN LRGYDEAAPV SPADALTRFL NALGQPGAAI PTDPDEAGAM YRSLLADQRM LILLDNARDA AQVRPLLPGG GGNFALITSR DRLTSLVALD DVAPLRIDTL SHEESVDLLS NLVDPVRLHS EPEATHQLAR LCGHLPLALR IAGANLADRP ETNVTQFVAE LEGPQRLQKL TAPDDPAVAI TRTLHLSVSA LTPAARQLFT LLGILPGEDF SHDLAAHLAG TVTDDAPRAI NELEAAHLVE SHHDNRLRFH DLVREYANAR ASQWDDADRG EAVTRVIGWY DHNKATLPTD ERDNVLRMLS AWNHRPDSWR LAAVLGRFVH YGPDLPRQLE LLRHELSKAE HNQDHPGRCQ MNSTLAIVHR EMGHRTTAIE YAREAVQIMR DHDVDDPVGK YVGNLGLYLG DMGRVAEAVP LVLESYNAAV ATGDDLFATI RASTLGTLYA ELGDYGEGEK WTTMALKLTE QPSLRFFRQG ISYCLCEQYV SSRRFSDAEP LITDILNQPD SSGAKYHALT LILRAEINRA RGRYDAAHED LAEALRHASQ TDRSGLRDMA ECGMAELEIQ TGHPRQAIDR LMSSHPANTD QMGALQRAQA DRLLCLAHAR LGDGDTAINY GDAALAAFRS MPRPLLEART LVALAEAHDT GGDKLSAQRD RESALEIFTR LGIPVEESRE VPLGHGQRPP HWDT
|
| |