Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_2293 |
Symbol | |
ID | 8883487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 2435084 |
End bp | 2436361 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003511075 |
Protein GI | 291299797 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0656119 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAACCA GCACCAGAGC CCGAGCGGGC GCCGCGCTCG CCGCGCTGTC CCTCGCCCTG ACCGGATGTT CCGCGTTCGG CGTCGGCGAC GAGGACGGCG ACACCCTCAC CTTCCAGTCG CTGGCGTTCC AGGACACCAC GATCAAGGCG ACGAAGGACA TCGTGGACGC CTGGAACAAG AAGAACCCCG ACACCCCGGT GAAGATCGTG AAGGGCAGCT GGGACAACGT CCACGACCAG CTCGTCACCC AGTTCAAGGG CGGCACCGCC CCTGACATCA TCCACGACGA GTCCGCCGAC ATCATGGGCT TCGCCGAGCA GGGCTACCTG GCCGACCTAG GGCCGCATCT GAGCGACAAG GTGAAGTCCG CGGTCTCCGA CGAGGTCTGG AAGTCGGTGA CCACCGAGGA CGACAAGGTG GTCGCGGCCC CGACGCTGCT GCAGTCGTAC GTCGTGTTCG CCAACACCGA CGCGTTCGCC GACGCCGACG TCAAGGTCCC CACCGGTGAG GCCCTGGACT GGGACGACCT GCAAAAGCTC GCGAAGAAGC TCACCGCCGA CGGTGACTAC GGCGTCGGCT GGGGCATGAA GGACCCCACC GCCACGGTCA TGAACCTGGC CCTCAACTTC GACGGCACCT TCTTCTCCGG GAAGGGCGAC AAGGCGTCCA TCGACGTGGG GGACAACGAA CTCGAAGTGC CCGAACGCAT CCACTCCATG GCCTTCAAGG ACAAGTCACT CGACCCGAAG TCGCTCACCC AGAGCGGATC CGACGTCCTG CCGGGATTCT TCGACGGCAA GTACTCCATG TACGTCGCGG GCAACTACGT CGCCCAGCAG ATCGTCGAGT CCGCGCCGAA GGACTTCAAG TGGGAGGTGC TGCCGCCACT GGCCGGAACC GCCGGTGCCA GCCAGGCCGC CAACCCGCAG ACGATGTCCG TGTCCGCCGA GAGCGACCAC GTCGACGAGT CCGCCAAGTT CATCGACTTC TTCATGCGGG CCGACCACCA GGCCGCCCTG GCCGAAGGTG ACTGGCTGAT CCCGTCCTCG AAGGACGCGC GCAAGACCGT CGCCACCAAC ACCAAGGGCG CCAACGGCTG GGAGAGCGTC CTCGCCTCGG GGGACACCCT GACCGCCGCC CCGTTCCAGT CGGCGAGCAA ATACCCACAG TGGAAAGACC AGTTCGCCAC ACCCGCGCTC CAGGACTACC TGGCGGACAA GATCACCGCC AAGGAACTGA AGCGGAAGCT CGTCGACGGC TGGTCCGAAC TGGACTGA
|
Protein sequence | MRTSTRARAG AALAALSLAL TGCSAFGVGD EDGDTLTFQS LAFQDTTIKA TKDIVDAWNK KNPDTPVKIV KGSWDNVHDQ LVTQFKGGTA PDIIHDESAD IMGFAEQGYL ADLGPHLSDK VKSAVSDEVW KSVTTEDDKV VAAPTLLQSY VVFANTDAFA DADVKVPTGE ALDWDDLQKL AKKLTADGDY GVGWGMKDPT ATVMNLALNF DGTFFSGKGD KASIDVGDNE LEVPERIHSM AFKDKSLDPK SLTQSGSDVL PGFFDGKYSM YVAGNYVAQQ IVESAPKDFK WEVLPPLAGT AGASQAANPQ TMSVSAESDH VDESAKFIDF FMRADHQAAL AEGDWLIPSS KDARKTVATN TKGANGWESV LASGDTLTAA PFQSASKYPQ WKDQFATPAL QDYLADKITA KELKRKLVDG WSELD
|
| |