Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4693 |
Symbol | |
ID | 8885899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 5002431 |
End bp | 5003804 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003513429 |
Protein GI | 291302151 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.363745 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTCAG CAAAGCCACT TGGACCCGTG TTGGTCCTGT TGACCGCGAC GGTCCTCGTC TCCGGTTGTT CCGGCAGCGA CGACTACGGC GAGGACGGAC GGCTTCAGGT CGAGGTCGCC ATGGACGCGG GCCTGGAGAA GAGCGCCAAG AAGGTGCTCG ACGAACGGGT CAAGCACTTC GAGAAGGCCA ATGAGGACAT CGACATCATC CCCCAGGAGT ACACCTGGGA AGCGACCACG TTCACCGCGC AGCTCGCCGG TGACACGCTT CCCGATGTCT TCACCGCGCC GTTCACCGAC GGCCGCGGCC TGATCGAACG CAAGCAGATC GCCGACATCA GCGCGCTGGT CGCCGACCTG CCCTACGCCG ACAAGTTCAA CCCGGGGATC GCCAAGGCCG GTTCGGATGC CAAGGACCGG ATCTGGGCGG TACCGGTCTC GGCCTACGGC CAGGCGCTGC ACTACAACCG CGCCCTGTTC GACGAGGCCG GGCTCGATCC GGACAAGCCG CCCACGACCT GGAAGGAGGT CCGCGAGGCA GCCAAGAAGA TCGCCGACGA GACCGGCGAG GCCGGGTACG TCCAGATGAC CAAGGACAAC ACCGGCGGCT GGATCCTGAC CACACTGGAC AACGCCCTCG GCGGCCGGGT CGAGGAGCTC GACGGCGACA AGGCCACGTC CACCATCAAC ACACCACAGA TGGTGGAGGC GCTGGAGTTG TTGCGGGACA TGCGCTGGAA GGACGACAGC ATGGGCGACA ACTTCCTGCA CGACTGGGCC GGGTCCAACC AGGACTTCGC GGCCGGGCGG ATCGGCATGT ACATCACCGG CGGCGGCAAC TACGGGCAGC TGATGGCGCA GAACGACATC AAGCCGGACG ACTACGGCGT GACGGTGGTG CCGCTGTCGG ACTCCCCCGA CGCCGGGGTA CTGGGCGGCG GGACGCTGGC GGCGGTGAAC GCCTCCGCCA GCGAGGAGGT CAAGGCGGCG GCGGTGAAGT GGATCGACTT CTACTACATG GAGAAGCTCA CCGACGCCAA GGCCGCCAAG CTGGACGCCA AGACCACCGC CGAGTCCGGG CAGGCCGTGG GGGCGCCGCT GCTGCCGGTC TTCGACAAGA AGACCTACGA CAAGCAGCAG GAGTGGATCG CCGACTACAT CAACGTGCCG GTGGACCAGA TGAAGCCCTA CACCGACAAC ATGTTCGACC AGCCGTTGGC GACCGAACCG ACGAAGTCCA CCCAGGAGGT CTACGGCGTC ATGGACACGG TGGTGCAGTC GGTGCTGACC GAAGAGGACG CCGACATCGA CAAGCTGCTG GACACCGCCG AGAAAGAGGC GCAGGCACTG CTCGACAAGG CCGCGAAGAA GTGA
|
Protein sequence | MLSAKPLGPV LVLLTATVLV SGCSGSDDYG EDGRLQVEVA MDAGLEKSAK KVLDERVKHF EKANEDIDII PQEYTWEATT FTAQLAGDTL PDVFTAPFTD GRGLIERKQI ADISALVADL PYADKFNPGI AKAGSDAKDR IWAVPVSAYG QALHYNRALF DEAGLDPDKP PTTWKEVREA AKKIADETGE AGYVQMTKDN TGGWILTTLD NALGGRVEEL DGDKATSTIN TPQMVEALEL LRDMRWKDDS MGDNFLHDWA GSNQDFAAGR IGMYITGGGN YGQLMAQNDI KPDDYGVTVV PLSDSPDAGV LGGGTLAAVN ASASEEVKAA AVKWIDFYYM EKLTDAKAAK LDAKTTAESG QAVGAPLLPV FDKKTYDKQQ EWIADYINVP VDQMKPYTDN MFDQPLATEP TKSTQEVYGV MDTVVQSVLT EEDADIDKLL DTAEKEAQAL LDKAAKK
|
| |