Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4139 |
Symbol | |
ID | 8885340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 4426906 |
End bp | 4429857 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003512883 |
Protein GI | 291301605 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.180514 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGACTTTC GGGTTCTTGG CCCACTGGAA GTGACGAGAG CTGAGTCTCG TGTCGACGTG CGAGGCCGCA TCGGGAAGCG GCTACTGGCG GTGCTTCTGA TTGATGCCGG ACACGGTGTC TCCATGGCCC GCCTCATCGA GGGCATATGG GACGGTGATC CACCGGAAAC GGCGCTTCGC CAAGTCAGAC ACGCCGCTTC TCGGCTGCGA AAAGTCTTGG GGGCTGAGCG CTTGGAATCG GCCGGGGACG GTTATCGCCT CAATATGGAT GGCGCCACCT GCGACGCGAT TCTCTTCGAA CAGAAGGTAC GTGAAGCCCG TGAACTGTTC CTGCGGCGGG ACAACGCGCA GCGGCTAGCG GTCTTGCGCA CGGCACTCGG ATTGTGGCAG GGCAGCGCGT ACAGCGGCCT GGAAGGCAGG CTCATCAACA CCGAGGTAGC CAGGCTCGGA GGAATATGGC TGGCCGCGCT GGAGGAACGC CTCGACGCCG AACTCGCACA CGGCGACCAT TCGCAGGTGA TCGGTGAGCT TCGGCTGCTG GTCCTGGAAC ACCCGTTCCG GCAACGCTTG ACCGACTTGC TGATGCTGGC GCTGTATCGA GACGGCAGGA CACCTGAGGC ACTCGAAGCG TACGAGCGTT TGCGGACAGA CCTCGCCGAG AACCTGGGCC TCGATCCGGA CGTCTCGTTG CGGGAAAGAC ACGCTGCGAT ACTGCGACGC GATCCGGTAC TGGACCTGTC CGACAACGCC TCGACTCGTC CGGACCGGCC CGTGCGGCCG TCGTATGTTC AGCCCGGAAA GTCCGTGCCC GCTCAACTGC CGATCAAACC CGCGGGATTT GTCGGACGCA ACATCGCGCT TACGGCATTG ACCAGGCAGT TTCACGACAG CGACAAGTCT CGTTCCTGTG TGGTTGTTGG AGCCGCCGGG GTCGGCAAGA CCGCTCTGGC CATTCACTGG GCACATGAGA ACCGCGACCA GTTCCCCGAC GGGCAGTTCT TCATCAATCT GCGCGGATTC GATTCCGGTG CCCCCGTCTC GGCACATGAA GCACTCGGCC GTTTCATCCG AGCGCTGCGG GATCCGAGCG TTTCGATACC ATCCGATGTC GACGAAGCGG CAGCCTTGTA TCGGTCATTG ACCGACGGAA AACGCATACT GGTCGTACTC GACAATGCCA AGAACGCCGA CCAGATTCGA CCTCTCATAC CGTCGTCGCC GAACGCCTTC ACGGTTGCCA CCAGCCGAAA TCGGCTCACA GGACTCACCG CCGTTGATGA CACGGTGCCG CTTTTCCTTT CGCCCTTGAA TCACCTTGAG TCCGTCGATC TTCTGGCCAA ATCCGCGAGT ACGGCTGGCT TGAGCATCGA CCGTGGCTCC ACGCGTCGTA TTGCCGAACT TTGTGGACAT CTGCCGTTGG CGTTGCGGAT CTCGGCCGCG CTACTCGTGG ATGGCTCGGG ACGCACCGCC AAGCAGTTGG CCGATGAACT CGGCAACCCG GACCGCCTTG ACCTGCTTTC CGTCGACGGC GACTCGACGA TTGCCACTGC CCTGGACCAG TCTTTCCAGA GTTTGACAGC AGAAGCCCAA CAGCTGCTGT GCCAGCTCGC TCTCATCCCC GGTGATGACT TTCCGCAAGC GTTGGCATTG GAACTGGGCA CTGGCAAGGA CTTCGACCGC TTGACCACCG CGAGCCTCAT CGATGAGCAT CGCCCAGGTC GATACCGGTT CCATGACCTC ACCCGCGAGT ACGCCAAGAA AAAGGCTCAA GCGACCGGCA GCGAGATTCA CGTCGCTAAT CAGGTAATCG GTTGGTACCA CGACAACTGC CGCGTGCTCA ACAGTCACGA CTACAACAAT GTGATCGCCG CGGTTTCGGC TTGGCGTGCT CACTCAAGCT TTTGGCGTCT GGTTCACACG GTGGTGGCGT TTGTGCGGTT GGGTGAGAAC ATATCGGTGG CCGCCCCGCT GGTCGATGAC CAGCTCGCGG CCGCCCTGGA ACGTGGGGAC ACCGAAGCAG CACAAGCCAT GTTTGATGTG AAGGCGCAGC TGTGCGCCGA AGCGGGTGAC AACCCAGGGG ATGTGAAGTT CGGACAGAAG GCGTTGGACA TACACCGCGA CGAAACGGGT TTCTACCATT TCACTCACGG AGATGCCCGC TTCAGCAATG GTGAGATGAC CCTGGCCGAA GACCACTTTC GAACGGCATT GCGGCTAGCC AACGAGGCGC AACACCAGTA CACGATTCTC GCGAGCACTG CCGCCTTGGC CAATCTTTAC CGCGTAACAG GTCGCTATGC CGAGGCCGAG ACGCTGTTCG ACGGCGCGCG CAAGTACTTC GACCACAATC CCACCGGTGC AATGGAAATA GACGCAAGGA TGGCCTTTGT GACCTTCTTG TGCGAGACGG GGCGTGTTGA TGAAGCCGAA GGATTGCTGC GACCCATCCT TGATTCTGAA GTGGAGCTGG ACCCCGGTTC GTGGACGCAG GTGATCATGT TCCGCGCTGA TATACGCCAC GCGCGTGGTC AATACCGTGA GGCGATACAG GATTTTGAAC ATGCCCTGGA ACTGACAGCT GACAACAGCC GGATACAGTC GCGCCTCATC AGGACCAGCA TCGCCGACAT CTATTGCGAT CTCGAGGATT ACGACAACGC CCTGAAGCAG TTGGAACTCA TTGACCTGGA CGCCGCGAGT ACGAAACTGC GCGCCATCGG AATGCTTCAA CTCGCACGTG CCCACAACGG CAAGAAAGAC CATGCGCGGG CGAAGGCCGC TGCCACCTAC TCGGCGGAGG TCTTCGCCGA GACCAGACTC CGCTCGCACG CGCTCGCCCT CACAGTCCTC GCGGACGCAC ACGAAGGACT GGGCGAAGTC AAAACGGCCC GCACAACGCG AGAACGAGCT CTGCGGATAC TGAACGAGCT GGGGCTACCC GAGCCCGGCT AG
|
Protein sequence | MDFRVLGPLE VTRAESRVDV RGRIGKRLLA VLLIDAGHGV SMARLIEGIW DGDPPETALR QVRHAASRLR KVLGAERLES AGDGYRLNMD GATCDAILFE QKVREARELF LRRDNAQRLA VLRTALGLWQ GSAYSGLEGR LINTEVARLG GIWLAALEER LDAELAHGDH SQVIGELRLL VLEHPFRQRL TDLLMLALYR DGRTPEALEA YERLRTDLAE NLGLDPDVSL RERHAAILRR DPVLDLSDNA STRPDRPVRP SYVQPGKSVP AQLPIKPAGF VGRNIALTAL TRQFHDSDKS RSCVVVGAAG VGKTALAIHW AHENRDQFPD GQFFINLRGF DSGAPVSAHE ALGRFIRALR DPSVSIPSDV DEAAALYRSL TDGKRILVVL DNAKNADQIR PLIPSSPNAF TVATSRNRLT GLTAVDDTVP LFLSPLNHLE SVDLLAKSAS TAGLSIDRGS TRRIAELCGH LPLALRISAA LLVDGSGRTA KQLADELGNP DRLDLLSVDG DSTIATALDQ SFQSLTAEAQ QLLCQLALIP GDDFPQALAL ELGTGKDFDR LTTASLIDEH RPGRYRFHDL TREYAKKKAQ ATGSEIHVAN QVIGWYHDNC RVLNSHDYNN VIAAVSAWRA HSSFWRLVHT VVAFVRLGEN ISVAAPLVDD QLAAALERGD TEAAQAMFDV KAQLCAEAGD NPGDVKFGQK ALDIHRDETG FYHFTHGDAR FSNGEMTLAE DHFRTALRLA NEAQHQYTIL ASTAALANLY RVTGRYAEAE TLFDGARKYF DHNPTGAMEI DARMAFVTFL CETGRVDEAE GLLRPILDSE VELDPGSWTQ VIMFRADIRH ARGQYREAIQ DFEHALELTA DNSRIQSRLI RTSIADIYCD LEDYDNALKQ LELIDLDAAS TKLRAIGMLQ LARAHNGKKD HARAKAAATY SAEVFAETRL RSHALALTVL ADAHEGLGEV KTARTTRERA LRILNELGLP EPG
|
| |