Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_5531 |
Symbol | |
ID | 8886745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 5876920 |
End bp | 5879811 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003514255 |
Protein GI | 291302977 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.394906 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATTT CCGTGACCGC CCCGTTGCGT TTTCAACTGC TGGGACCGGT CCGCGCCTGG CGGGACGGTG AGGAACTTCG ACTCGGTTCG CCGCAGCAGC GGCTGGTGCT CGTCCGGCTG CTGATCGGCG AGGGCCGTCC GGTGTCCATC GACCAGCTTT CCGCCACTCT GTGGCGCGAC GATCCGCCGC CCGCGGCCCG CAGCACGGTG CGCACCTACC TGTCCCGGCT GCGTTCGGTA CTGGGCGCGG GGATCATCGA CTCCCGCGAG CACTGGTATT CGCTGCGGGT CGGCGAGGTG GACGCGTGGC GGTTCGCCGC GCTGCTGCGC GAGGCCGAGG CCACCCCGGA CCGCGACCGG GCCCGGCGGC TGTTGAGCCA GGCTCTCGAC CTGTGCCAGG ACATCCCGCT GGCCGAGCTG CCGGGCGAAT GGGCACGCGC GCAACGCATT CGGCTCGAGG AGGAACGCGA CCGGGCCGTG GAACGACGGG CCCGCCTCGA CCTGGACTTC GGACGGCCCG CCGACGCCGC CGCCGCGCTG GCACGGTTGT GCGCCGAGTA TCCGCTGCGG GAGCGGCCGC ATCAGCTGCT GATGCTCGCG CTGTACCGGG ACGGCCGCAA GGCCGAGGCG CTGACCGTCC ACAACGACCT GCGCCGCCGG CTCGCCGAGC AGCTGGGCAT CGATCCCGGC GACGAGACCG CCGAGCTGCA CCGGCGGATA CTGCGGGCCG ACCCGGAACT GGCGGCACCC CCACCGATCG CGGCCCCCGA CGCCGAACCG CCGCGTCGCG GCCCGGCCCA GCTGCTGGCC GACATTCCCG ACTTCACCGG CCGCGACGAC GTCGTCAGGG AACTGACCGC GCTGCTCACC GAGGAACGCG ACGCGGCCCC GGTCGTGGTC GTCACGGGCA TCGGCGGCGC CGGTAAGACC ACATTGGCCA CTCACGTCGG CCATCGGGTG GCCGCCGACT TCCCCGACGG TCAGCTGTAC GTCGACCTGC GCGGCGCCGA CGAGGTTCCG CATGAGCCGC TGGCGGCCCA GCGGGGCATG CTGCGGTCGC TGGGCGTCAG CACCGAGGAC ATCCCGGCCG CCGAGGACGA ATGCGCCGCG CTCTTCAGGT CCACTATGGC CAGCAAACGG TTGTTGTTGT TGCTGGACAA CGCCGCCGAC ACCGCGCAGG TCCGGGCGCT GCTGCCCGGT GCGGCCGGGT GCGCGGCGAT CGTCACCGCG CGGTCCACGC TCACCGGTCT CACCGGCGCG CGTTACGTCC GCCTGTCCGC CCTGGAACCC GGCGAGGCGG TGACGTTGCT GCGGCGGGTG GTGGGCACCG AACGCGTCGA CGCCGAGTCC GCCGAGGCGC TGAACGTGGT GACCGCCTGC GGTTCGCTGC CGCTGGCCGT GCGGATCGCC GCCGCGCGCC TGGTCGCCAG ACCGCAGTGG ACGGTCGGGC AGTTCGCCGA GCGGGTGCGC GACGAACAGC GGCGGCTGGC CGAACTGCGG GCCGGTGACC TGGCGGTCGA AGCGGCCTTC GCGCTCAGCT ACCGGCAACT GGACGAGCAG CACGCGCACG CGTTCCGGCT GTTGACCGTC CCGGACGCCC CCGAACTGGC GCTGGCCACG GCAGCGGCGG TGCTGGGACG CGGCGAACTC GACACCGAGG GCCTCGCCGA GGACCTCGTC GACCTGAACC TGCTGGAATC CCCCGCCTAC CGGCGCTACC GGATGCACGA CCTGACCCGG CTGTTCGGAC GCGGCAAGAC CGATGCGGCC GAACGCGACG CGGCGCTGCG CCGACTCGCC GACTACTACG TGGCCACCGC CTACAACGGA CGCCGGGCGG TCGAGTCCGA CGGCCGTGTC ATCAACGGGG TGCTGCCCAC CGCATCGGAA GGCACCGGCT TCGCGTCGGC GGCCGAGGCG GAACGCTGGC TCGACACCGA GACCGACGCG CTGCTGCTCA CCCTGCGGCA ACTCGCCCGA CAGCTGCCTG CCGCCGTCCG GCTCGCGGGC GATCTGCTGC GCGTCATCGC CAACGTGCAC ATCATCTCCA GCCCAAGACG CGGCGACCTG TTCGCCGCCG CCGGGGCGAT AGCGGCGGCG GCCCGGACGC ACGGGGACGC GTTCACCGAG GGGCGGGCCC TGCTGCAACA GGCGGACTGC CACCTGGCGG AGGCGCGCTT CACCGAGGCG AAAGCGCTGG CGACCACCGC CCTGGAGCTG GCCGAGCGAA GCGGCGACGG CTACGGCCAC GCCTCGTCCC AGGCCTGCCT CGGCGTGACC GCCTTCTACG CCGACGACGA TCCGGAGGCG GCGGTCAAGC TGTTCTCGGC CGCCTATGAG GGTTTCAGCG CCCTGTCGGT CGGGGCCGAG GCCGGAAAAC TGCTGGCGAC CCGGGCCCGG ATCCTGTTGC GGCTGGGGCG CCGGACCGAG GCGGTCGCCG ACGCCGTCGA GGCCGTCGCG CGGCTGCGCG AGCACGGTGT CGGCACCGCC CTGGCGGACG GGCTCTACCA GCTGGGCACG ATCCGGCAGT CGATCGGCGA CCTGGACGCG GCCGTCGCGT CGCTCACCGA GTCACTGGAA CTGCACCGCG CGCTTCGCCG CAACGTCCAT GAGGGACTGT CCCTGTACCG GCTGGCCGAG GCCGAGCTGG GACGCGGGGA TGCCCGGACG GCCCACCGCC ACGCCGAGGC GGCGATCGCG AAGTTCACCG ACATCGGCGA CCAATGGGGA CGGCACTCCG CCGAGGTCGT GCTGGGCCGG ATCCTGTTGG CCACCAACGA ACCCGCGCGG GCCCGCGAAC TGATCGGCAA CGCCGCGACC GGCCTCGAAG CCCTCGGACG CGACACCGAG GCCGCCGAAG CCCGCGCGAT ACTCGACGAT CAGTCGCCCT GA
|
Protein sequence | MGISVTAPLR FQLLGPVRAW RDGEELRLGS PQQRLVLVRL LIGEGRPVSI DQLSATLWRD DPPPAARSTV RTYLSRLRSV LGAGIIDSRE HWYSLRVGEV DAWRFAALLR EAEATPDRDR ARRLLSQALD LCQDIPLAEL PGEWARAQRI RLEEERDRAV ERRARLDLDF GRPADAAAAL ARLCAEYPLR ERPHQLLMLA LYRDGRKAEA LTVHNDLRRR LAEQLGIDPG DETAELHRRI LRADPELAAP PPIAAPDAEP PRRGPAQLLA DIPDFTGRDD VVRELTALLT EERDAAPVVV VTGIGGAGKT TLATHVGHRV AADFPDGQLY VDLRGADEVP HEPLAAQRGM LRSLGVSTED IPAAEDECAA LFRSTMASKR LLLLLDNAAD TAQVRALLPG AAGCAAIVTA RSTLTGLTGA RYVRLSALEP GEAVTLLRRV VGTERVDAES AEALNVVTAC GSLPLAVRIA AARLVARPQW TVGQFAERVR DEQRRLAELR AGDLAVEAAF ALSYRQLDEQ HAHAFRLLTV PDAPELALAT AAAVLGRGEL DTEGLAEDLV DLNLLESPAY RRYRMHDLTR LFGRGKTDAA ERDAALRRLA DYYVATAYNG RRAVESDGRV INGVLPTASE GTGFASAAEA ERWLDTETDA LLLTLRQLAR QLPAAVRLAG DLLRVIANVH IISSPRRGDL FAAAGAIAAA ARTHGDAFTE GRALLQQADC HLAEARFTEA KALATTALEL AERSGDGYGH ASSQACLGVT AFYADDDPEA AVKLFSAAYE GFSALSVGAE AGKLLATRAR ILLRLGRRTE AVADAVEAVA RLREHGVGTA LADGLYQLGT IRQSIGDLDA AVASLTESLE LHRALRRNVH EGLSLYRLAE AELGRGDART AHRHAEAAIA KFTDIGDQWG RHSAEVVLGR ILLATNEPAR ARELIGNAAT GLEALGRDTE AAEARAILDD QSP
|
| |