Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0758 |
Symbol | |
ID | 8881942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 800882 |
End bp | 802213 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003509563 |
Protein GI | 291298285 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCGTC GAACATTGAC CCGAACCACT GCCCTGGTCG TGACCGCCGT TCTCGCGGGC GGTCTGGTGG CCTGCTCCGG TTCCGCGGCC GAGGTCCCGC AGGACGAGAA CGCGCGACTG GACATCTGGA CCCGCCGACC ACCCGGCGAC CCCTCCGAAC AGGTCTCCAA GGACCTCGCC AAGGCCTTCA CGAAGAAGAC CGGCATCCCC ACCAAGGTGA CCGCGATCTT CGACGACTTC GAGACCAAAC TCCAGCAGGC CGCCTCCCAG AAGGACCTGC CGGACATCGT CATCAACGAC ACCGCCCAGC TGGGCACCCT CGTCGACCAG GGCATCGTCC GCGAGGTCGA CCCCGGCGAG GTGAGCCGAA CCAAGGACAT CTCCGACAAG GCCTGGGACG CCACCAAAGC CTTCGACGGC AAGCACTACG CGGTGCCGTT CTCGGCACAG GCGTTCGCGC TGTTCATCCG CTCCGACTGG CGAAAGGCCG TCGGCGCCGA GGTGCCGAAG ACCTGGGACG ACCTGGACGC GCTGGCCAAG AAGTTCACCA AGGACGACCC GGACGGCAAC GGTGAGGACG ACACCTACGG CTGGGTCGTG CCCGGCTCCA CCAAACGCGG CTACGCCTCC TGGTACCTGT CCAGCTTCCT GTGGTCGTCC GGCGCCGACT ACTTCTCCGG CTCCGGGACG AAACTGGCCC CCGCCATCGA CAGCCCCGAG GCCGCCGCGA CCCTGTCCTG GTTCCAGAAG TCGTTCTGCG ACAAGACAGT GGTGCCCGGA TCCGAGACCG CCGAGACCGC CGCCGCCCAC CCCTACTTCG ACTCCGGGAC CGGCGGCATG TACCTGACCG GCCCCTACAA CATGGCCCGC TTCGACGAGG CGGTCGGCGA GGGCAAGGTC GAGGTGGTGC CGCTGCCCGC CGGCCCCGGC GGCGAGGCCA CGGCACTGGC CGAAGGCGAG AACACCTACC TGATGGCCGG TTCGGCCAAT GAGGCCGGAC AGCTGAAGTT CGCCGAGTTC GCCGCCTCCC CCGCCGGGCA GAAGATCGGC ATGAACGGCG ACGAGAACGG CAGCATCGTG CGACTGCCGA TCAACACGAC CGTCGACATG GCCGCCGAGC GCAAGGACGA ACGCTGGCAG ACCTTCAACG ACATCTACAC CGACGCCAGC CGTTACGTGC CGACGGTTCC CGACTGGACC CCGTTCCTGC AGTCCTCCGC GAAGACCTTC AACGCGGTGG CCTCCGACTG CGAGGCCGAC CCCGAGTCCG AACTGGAGGC ACTGGCGAAG GACTTCGCCA CCGAGCTCGA ATCCCAGCGG GCGGGCGGCT GA
|
Protein sequence | MFRRTLTRTT ALVVTAVLAG GLVACSGSAA EVPQDENARL DIWTRRPPGD PSEQVSKDLA KAFTKKTGIP TKVTAIFDDF ETKLQQAASQ KDLPDIVIND TAQLGTLVDQ GIVREVDPGE VSRTKDISDK AWDATKAFDG KHYAVPFSAQ AFALFIRSDW RKAVGAEVPK TWDDLDALAK KFTKDDPDGN GEDDTYGWVV PGSTKRGYAS WYLSSFLWSS GADYFSGSGT KLAPAIDSPE AAATLSWFQK SFCDKTVVPG SETAETAAAH PYFDSGTGGM YLTGPYNMAR FDEAVGEGKV EVVPLPAGPG GEATALAEGE NTYLMAGSAN EAGQLKFAEF AASPAGQKIG MNGDENGSIV RLPINTTVDM AAERKDERWQ TFNDIYTDAS RYVPTVPDWT PFLQSSAKTF NAVASDCEAD PESELEALAK DFATELESQR AGG
|
| |