Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0796 |
Symbol | |
ID | 8881980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 839868 |
End bp | 842969 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003509601 |
Protein GI | 291298323 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.64254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAGTTC GGTTGCTCGG TCCCCTGGAG GTACTGCGGG ACGGCACACC GGTGCCGATC CGGGGCCGGA TCCACCCCCG GCTGCTGGCC GTCCTGGCGC TGCACGCCGG CAACGTCGTC GCGCGACCGG CACTGATCAC CGGCGTGTGG GACTGTGAAC CACCGGCTTC GGCCAAGCGA CAGATCCAGA ACGCCGTCTC GGCGTTGCGC CAGGTACTGG ACGGCACCCT CATCGAAACG GTCGGCGACG GCTACCGGCT GCGCCTTGAC GACGTCACCG TCGACGCCCG CGACTTCGAG ACGACCGTCG CCGAGGCGGC TCGGCAGCGT GCGGGCGGCG ACTCCGACGC GGCACTGACC TCGTTGCGGG ACGCGTTGGC GCTGTGGCGC GGCGAGGCCC TCGCCGGGTT GCCGGGACGC GAACTGCGCT CCCGGGCCCA GCGGCTGGAG GAAGCCTGGC TGGCCGCCCG GGAGGAACTG ATCGATGTGG AGCTGGAACT CGGCGAACCG GTGCCGGTCG GCGAACTGGG CGAGCTGGCC CGGCTGCATC CGTACCGGCA GCGCCTGACG GGCCTGTACA TGCGGGTGCT GCACCGGCAG GGGCGCACGC CCGACGCGCT GCGGGTCTTC GACGACATCC GGCGACGGCT CCTGGACGAA CTGGGTATCA CCGTCGGACC GGCACTGCGC GAGCTGCACA CCGCGATCCT GCGGGAGGAC CCGCGGCTGG ACGCGGCGGC ACCGGTGGCC GTGCCCGCGT CCCCGGCCGC CGCCCCGAGG CTGGTGCCCG CCCAGCTGCC CGCCGCCATC GGCGAGTTCG TCGGCCGACA GGAACAACTG GCGCGGCTGG ACGCGCTGAT CGCCAAGGGA GACAACACCG CCCTGCTGTC CACGGTGTCG GGAACCGGCG GCGCCGGGAA GACGGCGCTG GCGATCCACT GGGCGCACCA CAACCGGGAC CGGTTCCCCG ACGGACAGCT GTACGTCAAC CTGCGCGCGT TCGACCGCGC CGAACCGCTG ACCCCGTACG ACGCCCTGAC CCGGTTCCTC GCGGCACTGG GGGTGACCGG CGGCGCGGTC CCGTCCGATG TGGAAGCCGC CGCCTCGCTG TACCGATCGC TGCTGGACGG GCGCCGGATG CTGGTCCTGC TCGACAACGC CGTCGACCTG GAACAGGTGC GCCCGCTGCT TCCCGGCAGC GGCGGCAACG TCACGCTGGT GACCAGCCGC AACCGGATCA CGGGTCTGAC GGCGCTGCAC GGCGCCGAGC TGATCGGTGT GGACACCATG TCGCGGACGG AATCCCTGGA GGTGCTGGGC AACCTGGTGG GTGCGCGGCG GTTGCACGCC GACGCTGCGG CGGCACATCG GCTCGCGGAA CTGTGCGCGG ACCTGCCGTT GGCGTTGCGG ATCGCGGGGG CGAACCTCGC CGTGAACTCG CATGTGGAAC TCAGCGAGTA CGTCCGGGAA CTGGCGGGTC CCAACCGGCT GGAGTTGCTG TCGATCGAGG GGGACCCGGA CTCCGCGGTG GCGTCGGTGT TCGCGCAGTC CTTTCGGGCG CTGTCACCCG AGGCTCAGCG GCTGTTCGCG CGGCTGGGCT GGATACCCGG TGACGATTTC GGCGAGGAGC TCGCGATCGC CGTCGCGGAC CTGCCGGAGG CGGACTGTCG TCGGTTGACG CGCACCCTGG AGACCGGCAA TCTCGTCGAG CGGTATCAGG CGCGGCGGTT CCGGTTCCAC GACCTGGTCC GGGAGTACGC GCGGCAGCAG GCGGAGAACA CACTGGACGA CGCCGAGCGC GACGCGGTCG CGGATCGGGT CGTCCAGTGG TACTACGACA ACCGCCGGAC GCCCCGGGCC GAGGACTACC CCAATCTCGT CGCGACGTTC CAGACCTGGC GGCACCACCC CCGGTGCCTG ATGCTGCCGA GTGTGTTCGC GCAGCACGTC AACGCCGGTC GCGACCTCGC CGGAATGCGA GCCCACCTCG ACGCCGCGTA CGCGCTGGCC AAGAGCCTCG GCGAACCACT GTCCGTGCAT CGGGCCGCCG ACGCCCTGGC CGTGCTGGCC TGGGCGAAGG GCGACACCGG CACCGCCGTC GAGTTCGGGC TCGAATCCCT GGCCAACGCC ATGAGCCACG ACGGGGACGC CCTGGGCACC GCCCGGGCGA ATCTGGGACT GCACTACGCC GCCCACGGCG ACTACCGCAA AGCCGAACCG CTGCAACGGG AAGCGCTGGA GGTCGCCGTC AGCCAGGCGG CGGCGGGCGC GCCGAACCGG GGCCTGAACC TGGTCAATCT CTACTGTGGC CTGGGGCGGT ACGCGGACGC CACCACGCTC GTCGAGCGGA TCCGGCACAT GCCCGGTATC GACGCGAACG ATCTCGTCCT CGGGGCCGTC CACCGGATCA AGGCCCAGAT CCATATGAAT CTGGGCCGGT ACGGCGAAGC GCTCGCCGAG ATCGAGGCGG GCCTGCGACT CGCCGGTCAA CGCTCGCAGC CGCGCAGCGA GAGCATCGCG TTGCGACTGC GGGCCGAGAT CCGGCGCCGG GCCGGTGATC TCACCAACGC CCTGGCCGAC GCGACCCGCG CGCTGGAGCT GGCCCGGCGA CACCAGCTGT CCAAACAGGA GAAGGACGCC GTTTTCGAGC TCGCGGCACT GCACTGCGCG TCGGGCACGG CACAGAAGGC GGAGGAACTG GTGCCGTCGC TGGCGGAGAC GGCTCGCGAC TCCAGCGGGC CACAGCGCGC GGAGGCGTTG GCGCGACTGG CGGAGCTCCA CTTCCGGCGG GGTCGTCACG CCGAGGCGAT CGAGTGCGGC ATGGCGGCGA AGGAGCTGTT CGCGAGCATT CCCCGGCCGC TGGGGCTGGC TCGCGTGTTG CGCGATCTGG CCGAGATCCA CGACGACCTC GCCGAGACCG AGACGGCGCG CGGTTTCCGG GCCGAGGCCC TGGACCTCTT CACCCGCCTG AGAGTCCCCG AGGCCGAAGA ACTCCGCCGC CAGCTCGACA GCCCGCCGTC CGACTTACCT CAGACCCCGC CGTCGTCGAC ACCCGTGCGG CGGTTGCGGT CCTCCTCCAC CAGCTTGTAC AAGGTGGACT GA
|
Protein sequence | MEVRLLGPLE VLRDGTPVPI RGRIHPRLLA VLALHAGNVV ARPALITGVW DCEPPASAKR QIQNAVSALR QVLDGTLIET VGDGYRLRLD DVTVDARDFE TTVAEAARQR AGGDSDAALT SLRDALALWR GEALAGLPGR ELRSRAQRLE EAWLAAREEL IDVELELGEP VPVGELGELA RLHPYRQRLT GLYMRVLHRQ GRTPDALRVF DDIRRRLLDE LGITVGPALR ELHTAILRED PRLDAAAPVA VPASPAAAPR LVPAQLPAAI GEFVGRQEQL ARLDALIAKG DNTALLSTVS GTGGAGKTAL AIHWAHHNRD RFPDGQLYVN LRAFDRAEPL TPYDALTRFL AALGVTGGAV PSDVEAAASL YRSLLDGRRM LVLLDNAVDL EQVRPLLPGS GGNVTLVTSR NRITGLTALH GAELIGVDTM SRTESLEVLG NLVGARRLHA DAAAAHRLAE LCADLPLALR IAGANLAVNS HVELSEYVRE LAGPNRLELL SIEGDPDSAV ASVFAQSFRA LSPEAQRLFA RLGWIPGDDF GEELAIAVAD LPEADCRRLT RTLETGNLVE RYQARRFRFH DLVREYARQQ AENTLDDAER DAVADRVVQW YYDNRRTPRA EDYPNLVATF QTWRHHPRCL MLPSVFAQHV NAGRDLAGMR AHLDAAYALA KSLGEPLSVH RAADALAVLA WAKGDTGTAV EFGLESLANA MSHDGDALGT ARANLGLHYA AHGDYRKAEP LQREALEVAV SQAAAGAPNR GLNLVNLYCG LGRYADATTL VERIRHMPGI DANDLVLGAV HRIKAQIHMN LGRYGEALAE IEAGLRLAGQ RSQPRSESIA LRLRAEIRRR AGDLTNALAD ATRALELARR HQLSKQEKDA VFELAALHCA SGTAQKAEEL VPSLAETARD SSGPQRAEAL ARLAELHFRR GRHAEAIECG MAAKELFASI PRPLGLARVL RDLAEIHDDL AETETARGFR AEALDLFTRL RVPEAEELRR QLDSPPSDLP QTPPSSTPVR RLRSSSTSLY KVD
|
| |