Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0882 |
Symbol | |
ID | 8882066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 931461 |
End bp | 932720 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003509686 |
Protein GI | 291298408 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.632389 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACGAC GCGGTTTCGT AGCGCTGGCG GCGGCGCCCG CGCTGGCCCC GCTGCTGGCC TCGTGTGGTG GCTCGGAATC CGACGCGAAG AAGGTCGAGA TCTTCAGTTG GTGGTCCGGG CCCGGCGAGA AAGAGGGCCT CGAAAAACTC ATCAAGATGT ACGAGGACGA CAACTCCGGC GTGAAGGTCG TCAACAACGG CATCGCGGGG GGCGGCGGCA GTAAGGCCAA GGACCAGCTC GCCACCCGGT TGAAGAACCA GGACCCGCCG GACTCGTTCC AGGGCCACGC CGGAGCCGAA CTCTACGACT ACATCGACAA CAAGGTCCTC GAGGACATCA CGAAGTTCAT CAAGTCCGAG AAACTCGACG GCGTGCTGCA CCCCGAGATC TTTAAGGGCG TGACGGTCGA CGGCAAGACC TACTCGGTCC CGGTGAACGT GCACCGCGCG AACCTGATGT GGTACAACCC GAAGGTGCTC GAAGAGGCCA AACTGGAGCC TCCGAAGTCC TGGACCGAGC TGATCGAACA GAACAAGAAG CTCAAGAAGG ACGACAAGAT CACCCTCGCC GTTGGTCCGC TGTGGACGCA GATGCAGCTC ATGGAGACCG TGCTGCTCGG CGAGTTGAAG GCCGAGGCGT ATACCGGTCT GTGGGACGGC AACACCGACT GGGCCTCGTC CGAAGTCATC GACGCGCTCG ACCTGTTCAC CAAGGTCCTG GAGGTCACCG ACCTCAAGAG CGCCTCGGAC GACTGGCAGC CGCAGCTGGA CAAGATGATG AAGGGCACCG CGGCCTACGC GGTCATGGGC GACTGGGTGT ACTCCTACCT CACCTCGTCC AAGAAGAAGG AGTACGACAA GGACTACAAG GTCGTCGTAA CACCCGGTTC CGAGGGAGTC TTCGACTACC TGGCCGACTC CTTCACCCTG CCGGTCGGCG CCCCGCACAA GGCGAACGCC GAAGCCTGGC TGAAGATCTG CGGCTCCAAG GAGGGCCAGA CGGTCTTCAA CCAGACCAAG GGCTCGCTGC CGGCTCGCAC CGACATCGAG GAATCCGAGT TCACCGGTTA CCTGGCCTGG AACTACAAGC AGTGGAAGGA CGAGAAGACC ACCGTGGTCG GCTCGCTCGC CCACGGCGCC GTGGTGCGTC CGACCTGGAT GACCGAGATC GAGACCCTCC TGGGTTCCTT TGTCGACGAC GGCGACTCCA AGAAGTTCGC CGACAAGATG ATGTCGACCT ACGAGAAGAC CAAGAAATAG
|
Protein sequence | MKRRGFVALA AAPALAPLLA SCGGSESDAK KVEIFSWWSG PGEKEGLEKL IKMYEDDNSG VKVVNNGIAG GGGSKAKDQL ATRLKNQDPP DSFQGHAGAE LYDYIDNKVL EDITKFIKSE KLDGVLHPEI FKGVTVDGKT YSVPVNVHRA NLMWYNPKVL EEAKLEPPKS WTELIEQNKK LKKDDKITLA VGPLWTQMQL METVLLGELK AEAYTGLWDG NTDWASSEVI DALDLFTKVL EVTDLKSASD DWQPQLDKMM KGTAAYAVMG DWVYSYLTSS KKKEYDKDYK VVVTPGSEGV FDYLADSFTL PVGAPHKANA EAWLKICGSK EGQTVFNQTK GSLPARTDIE ESEFTGYLAW NYKQWKDEKT TVVGSLAHGA VVRPTWMTEI ETLLGSFVDD GDSKKFADKM MSTYEKTKK
|
| |