Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_3332 |
Symbol | |
ID | 8884531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 3525878 |
End bp | 3528955 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003512091 |
Protein GI | 291300813 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00786892 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.331119 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACA CCGCCAGCCG CAGCCGGAAC GAGCCCGACG CGACGCCCAC CCCGGCGTCC GGCGGCGACA CCGGCTTCCG GCTGCTGGGC CCACTCGAGG TGACCCACCA CGGCGAGGTC CTCGAGGTTC CGCCCGGACG GCAGCAGGCG CTGCTCGCGG CACTGCTGCT GCGGCCGGGC GAGATCGTGT CGGCACCGAC GCTCATCGAG CAGGTCTGGG GCGGCGACGC CGATCGCACC GTCCTCAACG TGTGCGTGAT GCGACTGCGA CGGTCCCTCA AGGACCCCAA CCGCGAACTG GTTCGCGCCG AGGGCAACGG TTACCGCATC GCGGCCGCAC CCGACACCGT CGATCTGCAC CGGTTCCGGA AACTGTGCGA GCGGGCCGGA ACGGCCCGCG ACCGAGGCGA CCCCGCGGCC GAACGCGACG CCCTCCACCA GGCACTCGCG TTGTGGCGCG GCGACGCCCT GTCCGGGATC GCCTCGTCGG TGCTGCAAGC CGAGGTCGTG CCGATCCTCG AAGAGGAGCG GCTGGCCGCG ACCGAACGAC GCGTGGACGC CGACCTCGAC GGCGAGGCCC ACGCCGAGGT GATCGGCGAA CTGCGCCAGC TGGTCGCCGA TCACCCGTTC CGGGAACGGT TCTGGTGCCA GCTGCTGCTG GCCCTGTACC GGTCGGGCCG CCAGTCCGAG GCGCTGGACG CCTACCGCAC GGCACGGGAA CGACTCGCCG ACGAGCTCGG CATCGAGCCG AGCCAGGAGT TGCGCGAGCT CCACCACCGC ATCCTCACTT CCGATCCGAC GCTGCTGACG CCCCGGCCCG CCGAGCCGTC GGTCCCACGT GTCGTCCCGG CCCAACTGCC CCCCGCCACA GCCGGATTCA GCGGCCGCAG AGACCAACTG CGGCAGCTGG ATCGCCTACT TGACGACACC GAACCCGCAC CAACCACATT GATCACCGGG CCGCCGGGAG CCGGTAAGAC GACCCTGGCG GTGCACTGGG CCGGTCGCCA CCGTGACCGG TGGCCCGACG GTCAGCTCTA CATCGACCTG CGCGGATACG GGCCCGAGCC GTTGGTCCAG CCGATCGAGG CCCTGGCGTA CTTCCTGCGC GCGATCGGAC TGCCTACCGA CCAGGTGCCA CCTCAGCAGG CGGAGGCGTC GGCTTTGTTC CGGTCACGGA TCGCCGGCCG CAGGCTGCTG ATCGTGCTCG ACAATGCCGC CACCGTCGAT CAAGTGCGTC CACTGCTGCC CGGCGGTGGC GACTGTCTGG CCATAGTGAC CAGCCGAGAC CGGTTGACCG GCCTGGTCGC CAAGGAGGGC GCCCGTGTCC TGAGTGTCGA CGTGATGGCC GACGCCGAAG CCCAGACACT GCTGTCCGAT GCCGTCGGTG ACGCCCGGCT CGCCGCCGAA CCCGATGCGG CCGCCGCACT GGCACGCCTG TGTGGCAACC TGCCGCTGGC GCTGCGGATC ACGGCCGCAG ACCTGATCAA CCATCCACAG CGGAGTCTCA CCCGGCACGT GGACCGGCTG CGCACCGGCA ACCGCCTGGA CGCGCTTCAG GTCGCCGACG ACGACGGTAC CGCGGTGCGA GCGGCCTTCC GATCCTCCTA CGCTCGACTG CCCGAACCAG TGAGGCGACT GTTCCGGTTG CTGGGACTGT TCCCCGGTTC GGAGATCGGC CTGGAGTCCG CCGCCTCGCT GGCGGGCGTC GAGGCCGCCG CGACCGAACC GCTGCTGGAT CGTCTGATCT CCGCCCATCT GGTGACGCCG CGGGGAGCCG AACGGTTCGC GTTGCACGAT CTGCTGCGCT TGTTCGCCCA GGAGCTGGGG CAGGAGGAGG ACGCCGACGC CGAACGCGAG GCCGCCACGC GGCGGCTCTA CGACCACTAT TCACGCGTCG CCGTCGCCGC GGCGAATCGG ATGTACTCGT TCGTGGTGAC GCTGCCGTTG TCGGAGGAGG CGGCGAGGAT CCCGGTGAGC TTCGACGACG ACACCACGGC GTCGGCATGG TTCGACACCG AACACCCCAA CCTCGTCGCA CTGGTTCGGT ACGCCGCCGA TCGCGGCGAC CACCTCGACG CCTGCCGGCT GGCCGCCACG ATGCAAAGCT ATCTGCACAG CAAGATGCAC GCCGTCGACT GGCTCAACGT CGCTCGCGCC TATCTGTTCG CCGCCCGCCA GATTGTGGAT TTCAGGGCCC AGATCCTGGC TCATCTGAGC TTGGCCAACC TGCACCGCTC CCGCGCTCAG CGTAACGCCG CCATCAAACA CTTCGAGCAG GCGCTGGCGT TGAGCCGTCA CAGCGGCTGG GTCGACGGAC AGTCCATAAC CCACAACAAT CTCAGTGGTG TCTACGACTA CGAGGGAAAA CTGCGACTGA CGGTTCATCA CCTGCATCAG GCGCTGGAAC TGCGCTCGCC AACGCGGGAC ATCACGTCGA AGGCGGTCGT TCTCAGCAAC CTCGGCAGGG CGTTGCTGCG GTTGGGGCAC GTGGAGGACG CGGTCGACCA CCTGGAACGT GCGACGAACA TTCACCAACG GCGCGGTGCC AGGGTGAGCG AGTCCCGCAG CCGCGCCAGT CTCGGCGAGG CACTGTGCGA GCTGGGACAG CACCACCGTG CGCTGGCATT GCTGAACCAC GCGGTAGCGG TGCAACGGGA GCTGGACGAC AGAATGTTCC TACCCAACAC CCTGTGCCGA CTGGCGAACG CCCACCTCGA CGTCGGCCAG ACCGATCTCG CCGCCGAACT GGTGCCGACC GCGATGGCAC TGGTACGCGA AACCGGACAT CGCCGTTCAG AGGCATGGGT CACCTACGTG ACCGCCCGGG TGCATGAGCA CACCGGCTGC CCTGATGCCG CGCTCGATGG CTACGCCCAG GCGCTACGGC TGAGCCGGGA ACGCGGTAAC GGATATCTTC AGGTGCAGGC GATGATCGGC ATGAGCTCCG CCCATCACCA ACTCGGTCAG CGGGAGTCGG CCCGCCGCTG TATCGACAAG GCCCTCGCGA GAGCTCGCCG TTTCGAGTAC GCGCTTCTGG AACGACAGGC CGAGGCCGTC CGGGTCGCCC TGAGCTGA
|
Protein sequence | MTDTASRSRN EPDATPTPAS GGDTGFRLLG PLEVTHHGEV LEVPPGRQQA LLAALLLRPG EIVSAPTLIE QVWGGDADRT VLNVCVMRLR RSLKDPNREL VRAEGNGYRI AAAPDTVDLH RFRKLCERAG TARDRGDPAA ERDALHQALA LWRGDALSGI ASSVLQAEVV PILEEERLAA TERRVDADLD GEAHAEVIGE LRQLVADHPF RERFWCQLLL ALYRSGRQSE ALDAYRTARE RLADELGIEP SQELRELHHR ILTSDPTLLT PRPAEPSVPR VVPAQLPPAT AGFSGRRDQL RQLDRLLDDT EPAPTTLITG PPGAGKTTLA VHWAGRHRDR WPDGQLYIDL RGYGPEPLVQ PIEALAYFLR AIGLPTDQVP PQQAEASALF RSRIAGRRLL IVLDNAATVD QVRPLLPGGG DCLAIVTSRD RLTGLVAKEG ARVLSVDVMA DAEAQTLLSD AVGDARLAAE PDAAAALARL CGNLPLALRI TAADLINHPQ RSLTRHVDRL RTGNRLDALQ VADDDGTAVR AAFRSSYARL PEPVRRLFRL LGLFPGSEIG LESAASLAGV EAAATEPLLD RLISAHLVTP RGAERFALHD LLRLFAQELG QEEDADAERE AATRRLYDHY SRVAVAAANR MYSFVVTLPL SEEAARIPVS FDDDTTASAW FDTEHPNLVA LVRYAADRGD HLDACRLAAT MQSYLHSKMH AVDWLNVARA YLFAARQIVD FRAQILAHLS LANLHRSRAQ RNAAIKHFEQ ALALSRHSGW VDGQSITHNN LSGVYDYEGK LRLTVHHLHQ ALELRSPTRD ITSKAVVLSN LGRALLRLGH VEDAVDHLER ATNIHQRRGA RVSESRSRAS LGEALCELGQ HHRALALLNH AVAVQRELDD RMFLPNTLCR LANAHLDVGQ TDLAAELVPT AMALVRETGH RRSEAWVTYV TARVHEHTGC PDAALDGYAQ ALRLSRERGN GYLQVQAMIG MSSAHHQLGQ RESARRCIDK ALARARRFEY ALLERQAEAV RVALS
|
| |