Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0687 |
Symbol | |
ID | 8881871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 725043 |
End bp | 728054 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003509493 |
Protein GI | 291298215 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.622837 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTTC GGATTCTCGG ACCTCTAGAC GTCCGGCTCG ACGGCTCGAC CGTCCCCATT CTCGGCCAGC ATCAGCCGAA GCTGCTGGCG CTGTTGCTGC TGGAGAACGA CAGGACGGTC TCACTGGGGC GGATGGTGGA CGCGCTGTGG GATGACGACC CCCCGGCGAC AGCCAAGCGT CAGGTCCAAA ACGCCATGGC GGCGCTGCGG CGTTCACTGA CCGAAGCCGA GCTGGACCCG GTAGCGCGAG TCGGCGAAGG CTACCGATTG ACCACGTCGG AACTGGACCA CAGCGAATTC ACCACATTGG TCCGGCGTGG ACGCGGCGCC GCTGAGGCGG GCCGGTTCGA CACCGCCTTC ATGACCTTGA CCGACGCTTT GGGTGTGTGG CGAGGGCCCG CGTTGGCGGG TATCCCCGGT CGGATCTTCG AAGCAGCCGC CAATCGGTTG GAAGAGGAGC GGCTGTCGGT GATGGAGGAC AGATTCGCCG CGGCGCTGGC CTCGGACAGA AGCGCGGAGT TCGTCGGCGA GCTGCGAGAG CTGGTCGCCG AACACCCATA CCGGCAGCGA TTCACCCAGC ACCTCATGAC CGCGTTGCAC CGCGTTGGGC AGACCGAGGT GGCATTGGCG GCCTTTGATC ACCTCGGCTC ACGGCTTCGC GACGATCTGG GACTGGATCC CGACCCGGAC CTGCGACGAC TGCGTGACAG CATCCGCGAC GGCGAGGACT CCACGATCCA CTCTTCCGCC GTCGTGTCGA CGCAGGCACC GCCACCAGCT CAGCTTCCGG CGAGCATTCG CGGGTTCATT GGCCGCGGTG AACAGCTTTC CGAACTCGAC GCATTGCTGG CGGAGAATCC TCAGCATTCG GTCGCGGTGC TTAGCGGAGT CGGCGGTTCG GGGAAGACCG CGTTGGCTAT ACATTGGGCC GCCGAAAATC GAGAACAGTT CCCCGATGGA CAGCTCTATG TGAACCTTCG CGGATTCGAC ACGACGGAAC CGGTCAAACC CGTCGACGCG CTGCATGCCT TCATCCGGGC ATTGGGGCAC AACGGCGATT CACCAGCCAG TATCGATGAC GCGGTGACGC TGTATCGCTC CCTCCTTGCC CGGCGGCGCG TCCTTGTCGT CTTGGACAAC GCGCTGAATG CCGACCAGGT ACGGCCGCTG CTGCCAATCG GCGTCAACGT AACGCTGGTG ACCAGCCGGG AACGCCTCAC CCCGCTGACC ACGACAGAGT CTGCCCAATC GGTTTCACTC GACGCGTTGA GCCGGTCGGA GGCGTTCGAC TTGCTGACCG TGATGATCGA TACACGGCGA CTGCACGAGG ATGAGCTCGC GGTCTATCGA CTTACCGACC TGTGTGGACA TCTTCCACTG GCTTTGCGCA TGGCTGGCGC GAACCTTGCC AATCGTCCAC ACACCTCCGT CGCGACGTTC GTCGACGAAC TGGACAGTTC CTCAGACCGG CTCGAACTGC TGGCCGTGGA GGGTGATCCA AAAGCGGCAC TGACATCGGT GTTCGATCGG TCGATCGCCG CGATCAGCGA CGAAGCTCGA CTCTTGCTGC TTCGGCTGGG CTTCATCCCG ACTGCCGACT ACGCCGAAGA GCTCGTCATC GAACTCAGCA CGGAGGAAGC CGACAAGACA CGGGAGTTGT TGACGCGGCT CGCCGACGCG CATCTGCTTG AACGCCATCG GCCCAACCGA TACCGTTTTC ACGACTTGAT CAAGGCGTAT GTCAGCACAC GGCTGACCGT CGAACTGGAT GAAACCACCC GTGATGAGCT GGTGCGACGT TTCACCGACT GGCAAGTATC AACGCCTCGC GGTGAAGAGT TCGACAACAT CGTGGAAGCT TGCCGAGCCT GGCGTCGACG GCCACGTATC TGGAAACTGG CCGCCGGGCT CCGCCGACTG CTGCATTGGA CGGTGAACGT CCACACCGTC ACGCAACTCG CGACCCAGCT GTTGGCGGAA ACGATCGAGG CCGGCGACCC TTACGGCGAA ATCGAGATGC GTCATCTGCT AGGAATGACG GCTTGGGTGG CCGGTCATGC AGACGAGGCC GAGCGTTACT GCCGAGAGGC CATCGCCTTG GCCGCCGTGC AAGGCGATGG GGACGCGAAC GGAACGATCC GAACCAACCT GTCTACAGTG CTTTCCACTC GGGGCGGCTT CGACGAAGCG GCGCAACTGC TGGCCGAGAC CCTCGAACTC GCCGAACAGA CGGGCGCGAC GGTGGACCGC GAGGTGCGAG CCCTCAATCT GGGAGCTCTG TATCGCCAAC TCGGACGATA CGACGACGCT TGGCGCATCG TCGTCAGTTC AGAAGTCGAG ACTCGTACGA CCGTGCCGCA AAGTACCGCC TTGCCCGCGG GCATGCTGTA CTTCGACATG GGCCGCTACG CGGACCTCAA AGCGTGGCTG AGCTGGGTCC TGGCCGAAGA GTATTCCCAG CTCGTGCCGC CCCAGGTCCG CACCATCGCG CTGGGGCTGC GAGGTGAGGT GCACCGTATC GAGGGCGAAT ACGAAGCAGC CCGGACCGAT CTCGATCAGG CCATCGCCAT CGCGGAACGC TCCGGCAGGA CCGCGATCGC GTACGAGGCG CGCTGCACAC TGGCCCAGCT GCACTGTGAC ACGGGTGAGG CCGAACGCGG GCTGGCTCTC GTCGAGCACG TTCTCGCCAC ACCCGACGCC GCCAGAAAAC CTGATCTACT CGCGCTGGCC CGGTTGACAG CTGCCCGAAT AAGTCGGCGA ATCGGCGAGT TCCCGACAGC GAACCACCAT GTGTCGGCCG CGTTGGAGAT ATCCGAGAGC ATCTCCCAGC CGCTGCGGAT CGGCCGATGC CTGCTGGAGA AGGCCCTGCT GTGTGCCGAC CTCGGCGACG GCTCCCAAGC CGAAACCCTC GGTCGCCAGG CACGAGACAT CTTCGACGAT CTCGGCGTCC CGGAGGCGGA GCACGCGCGC TCGCTGCTCG ACACGCTGCG CGCCGCTCCC CTGCAGCGGT GA
|
Protein sequence | MEFRILGPLD VRLDGSTVPI LGQHQPKLLA LLLLENDRTV SLGRMVDALW DDDPPATAKR QVQNAMAALR RSLTEAELDP VARVGEGYRL TTSELDHSEF TTLVRRGRGA AEAGRFDTAF MTLTDALGVW RGPALAGIPG RIFEAAANRL EEERLSVMED RFAAALASDR SAEFVGELRE LVAEHPYRQR FTQHLMTALH RVGQTEVALA AFDHLGSRLR DDLGLDPDPD LRRLRDSIRD GEDSTIHSSA VVSTQAPPPA QLPASIRGFI GRGEQLSELD ALLAENPQHS VAVLSGVGGS GKTALAIHWA AENREQFPDG QLYVNLRGFD TTEPVKPVDA LHAFIRALGH NGDSPASIDD AVTLYRSLLA RRRVLVVLDN ALNADQVRPL LPIGVNVTLV TSRERLTPLT TTESAQSVSL DALSRSEAFD LLTVMIDTRR LHEDELAVYR LTDLCGHLPL ALRMAGANLA NRPHTSVATF VDELDSSSDR LELLAVEGDP KAALTSVFDR SIAAISDEAR LLLLRLGFIP TADYAEELVI ELSTEEADKT RELLTRLADA HLLERHRPNR YRFHDLIKAY VSTRLTVELD ETTRDELVRR FTDWQVSTPR GEEFDNIVEA CRAWRRRPRI WKLAAGLRRL LHWTVNVHTV TQLATQLLAE TIEAGDPYGE IEMRHLLGMT AWVAGHADEA ERYCREAIAL AAVQGDGDAN GTIRTNLSTV LSTRGGFDEA AQLLAETLEL AEQTGATVDR EVRALNLGAL YRQLGRYDDA WRIVVSSEVE TRTTVPQSTA LPAGMLYFDM GRYADLKAWL SWVLAEEYSQ LVPPQVRTIA LGLRGEVHRI EGEYEAARTD LDQAIAIAER SGRTAIAYEA RCTLAQLHCD TGEAERGLAL VEHVLATPDA ARKPDLLALA RLTAARISRR IGEFPTANHH VSAALEISES ISQPLRIGRC LLEKALLCAD LGDGSQAETL GRQARDIFDD LGVPEAEHAR SLLDTLRAAP LQR
|
| |