Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_2958 |
Symbol | |
ID | 8884157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 3120224 |
End bp | 3123118 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003511726 |
Protein GI | 291300448 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.500488 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00113225 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTTGATCC GGCTGTTGGG CTCCGTGGAG GTCGCGGGCG AGCACGGTTG GGTGCGGGCG GGTCCGTCGA AACAGTCCTG CGTGCTGGCG GCCCTGGCGA TGACACCGGG TCAACCGGTC GCCACGGCCG CGCTGCTGCA ACGGGTGTGG GGTGACGACC CGCCGGACAA GGTGTTCAGC ACCCTGTACT CGTATCTGGC CCGGCTGCGG CGGCTGCTCC AGCATGACGG CGTCGCGCTG ACGCGGGCCG GAAACGACGG CTATCAGCTG GTCATCGAAC CCGAGGACAT CGACCTGATC GCGATGCGGC GCCTGGTGAC CCGGGCGCGC GAGGCGGCGC GCGCCGGTGA CCACGAGACC GCCGTCCGGG ATTACCGCGA AGCCTGTTCG CTGTGGACCG GTGAGGCGCT GGCCACGGTG GACGGACACT GGGCGTCGCA GACGCGAGAG GCGTTGCGGC GCGAGCAGCT GGCGGTGCTG TCGGCGTTGT TCGACTCGGA GTTGGCGCTG GGTTACCACG AGTCCGTCAT CCCCGAGCTT GAGGAACTGG TCGGCAGACA CCCGCTGGTG GAGTCACTGG TCGCGCAGCT GATGTTGGCG TTGTACCGCG CCGACCGTCC CTCCGACGCC CTGGCGCGCT ACGCCGAGAC CCGGAAACTG TTGCGGGAGC GGCAAGGCGC CGAACCCGTG GAGCGCCTGC GGCGGTTGCA CAAACGCATC CTGAGTCACG ACCCCGAACT GCGGCACGTC ACCGACAGTC CCGCCGTGGT GTCCGGTCGG GAGACACCGG CGCAGCTGCC CGCCGACACC ACCGCCTTCA CCGGCCGGGA ATCGCCACTG CGGACGCTTG TGGACGCGGC CGACGATTCG CGGGTCATCG TCGTCGACGG GATGGCCGGG ATCGGCAAGA CCACGCTGGC GGTGCACGCG GCGCGGCGGC TGGCCGAGCG CTACCCCGAC GGTCAGCTCT ACCTGAATCT GCACAGTTTC ACCGACTCGG TGCCGCCGAT GGCTCCCGCC GAGGCGTTGT CGGCACTGCT GGATTCGCTG GGAGTGCCAC GAAACGCCAT CCCGGAGAGT GTCGACGCGC GGGCCGCCAA GTTCCGGTCC ATGCTGGCCG GACGGCGAGT ACTCCTGTTG CTCGACAACG CCCGCGACGA GGCCCAGCTG TCGCCGTTGC TGCCGGGTGA TTCGGGGTGC CTGACCATCA TCACCAGCCG CAGACGGCTG TCCGGTCTGG ACGACATCCG GCCGATCTCG TTGGAGCCGC TGGACTTGGA ACCCTCGGCG CGGCTGTTCG CCGCCGCCGC GGGGATTGAC GATCTCAACG ACGACGACCG GGCCGCGATC GACCGCGTCG TGGAACTGTG CGGCGGTCTG CCGCTGGCGA TCAGGATCGC GGCGGCCCGG CTGCGCAGCC GTCCCACGTG GTCGGCGGCC GATCTGCTGG AGCGGTTGTC CAAGGACTAC CGCCTGCTGG ACGAACTGGC GGCCGGTTCC CGCAGTGTCG CCTCCACCCT CGGACTGTCC TATCGCGAGT TGACTGACGG GCAGCGTCGC CTGTTCCGGC TGCTGGCACT GTGTCCCGGC AGCGATTTCG ACGCGGCCAC CGCCGCCGCG CTGGCGGGTG CGCCGGTCGA CGCATTGCTC ACCGACCTGG ACGCCCTGGT GGACGTCAGC CTGGTCGACG CCGAACCCGG TGGCCGCTAC CGGATGCACG ACCTCATCCG CCGGTTCGCC GCCGAGGCGC TGGCCCGCGA CGAACGCGAC GTCCTCGCTC CGGTCGGCCG GTTGCGCGAC CACTACCTGC ATCACGCTCA CGCCGCCATC AAGGTGTTGG ATCCCGACAT CGCGCGGTTG CCGGACCTGC CGCCACCGCC GCCGGGCATC ACGCCACCGC TCCTGGACGG GCACGCGGCG GCGCTGTCCT GGTTCGCCGC CGAGGAAACG GCGCTGCTGA GTCTTCTGTC CACATCGGTG GCCGACGCGC CCGGTCTCGT CCTCGATCTG GCGGCCTGTG TGCTGCCCTA CCTGCGCGAC CATTCCCCCT CGACCGAACA GCCCGCGGTC GCGTCGCTGG CGGTGCGGGC CTCCCGCTCC AAAGGCGACA CCCAGCGGCA GGCGGTATGG CTCAATCTGC TGGGCAACGC GCATCTGACG GCGGCCCGGT TCGCACCGGC GGTGGACTGC TATACCGAGG CCCTGGAGAT TCACGACAGT ATCGGCAACG TGTCCGGCGC GGCCTCGGTT CACGGCAATC TCGGTGTCGT CCACAAGGAA CTCGGCAACT ACCAGCGGTC ACTGTCGCAC CTGGAACGGG CGGCGGAGCT GGCGGCGACG GCCGGCGACA CCGTGTCGCT GGCCATCGCC GAGTCCAACG CCTGCGAAGT CCATGTGCGG TTGGGAAACC CGTCCCGGGG AAGGGAACTG GCGGAATCGG CGATGGAACG GTTCCGGGAG CTCGACCGGC CGCTACTGCT GGCCCAGACC CTCGACAATC TGGCGATGGC GTATCTCTCC GAGGGACGAC TGGCCGACGC CCGCCGCGTC GAGGAGGAGG CCGTCGACTA CGGACGACGC CACGACGCCG TCGAGGTACT GGTCCAGGCC CTGAACCGAC TGGGTGCCAT CCTGCGGGAG CAGAACGAAC TGCCGGACGC GCTTGAGCGG CACCGGCAGG CGCTGGCACT GCTGTCGCCG GACGCGCGGC CCGTCCTGGA GACCGGCATC CGCTGCGAGT ACGGCCGGAC CCTGCTGGCC TGCGGTGACC CCGAGGCGGC GCTGGCGCAG TTTCGGCAGG CGGCGGAGCT GGCCAGCCAG GCCGGGCAGC GCTACGAACT GGCCCTGGCG CGGCACGGTA TCGCCGACGC CCTCCGCGCC AGACCGACTG CCTGA
|
Protein sequence | MLIRLLGSVE VAGEHGWVRA GPSKQSCVLA ALAMTPGQPV ATAALLQRVW GDDPPDKVFS TLYSYLARLR RLLQHDGVAL TRAGNDGYQL VIEPEDIDLI AMRRLVTRAR EAARAGDHET AVRDYREACS LWTGEALATV DGHWASQTRE ALRREQLAVL SALFDSELAL GYHESVIPEL EELVGRHPLV ESLVAQLMLA LYRADRPSDA LARYAETRKL LRERQGAEPV ERLRRLHKRI LSHDPELRHV TDSPAVVSGR ETPAQLPADT TAFTGRESPL RTLVDAADDS RVIVVDGMAG IGKTTLAVHA ARRLAERYPD GQLYLNLHSF TDSVPPMAPA EALSALLDSL GVPRNAIPES VDARAAKFRS MLAGRRVLLL LDNARDEAQL SPLLPGDSGC LTIITSRRRL SGLDDIRPIS LEPLDLEPSA RLFAAAAGID DLNDDDRAAI DRVVELCGGL PLAIRIAAAR LRSRPTWSAA DLLERLSKDY RLLDELAAGS RSVASTLGLS YRELTDGQRR LFRLLALCPG SDFDAATAAA LAGAPVDALL TDLDALVDVS LVDAEPGGRY RMHDLIRRFA AEALARDERD VLAPVGRLRD HYLHHAHAAI KVLDPDIARL PDLPPPPPGI TPPLLDGHAA ALSWFAAEET ALLSLLSTSV ADAPGLVLDL AACVLPYLRD HSPSTEQPAV ASLAVRASRS KGDTQRQAVW LNLLGNAHLT AARFAPAVDC YTEALEIHDS IGNVSGAASV HGNLGVVHKE LGNYQRSLSH LERAAELAAT AGDTVSLAIA ESNACEVHVR LGNPSRGREL AESAMERFRE LDRPLLLAQT LDNLAMAYLS EGRLADARRV EEEAVDYGRR HDAVEVLVQA LNRLGAILRE QNELPDALER HRQALALLSP DARPVLETGI RCEYGRTLLA CGDPEAALAQ FRQAAELASQ AGQRYELALA RHGIADALRA RPTA
|
| |