Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_1828 |
Symbol | |
ID | 8883019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 1918274 |
End bp | 1919641 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003510617 |
Protein GI | 291299339 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.218215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGTG CACGTGCCCT CCTCGCCCCC GCGCTGATCG CCGCCCTTCT CGGTGGAATG ACAGCCTGCG CCGGTGAAGA GAAGGACCCC GGTGAGATCG ACGTATGGAT CGGATTCGTC GACCACCGTC TGGACTGGAT GAAGGACCGC GCCAAGGAGT TCGAGGACGA GCACCCCGGA TACAAGGTCA ACATAACCCC GTACAAGGAC TACCCGACGC TGTGGGACAA ACTGACCGCC GCGGCCGAGC AGGGTGAGCC GCCGACGATC GCGCAGAACT TCGAGGCCGC GACCCAGGAG TCGCGCGACG CCGTGAACAG CGAGGGGGAG CCGCTGTTCG CCTCGGTGGA GAAGGAGATC GACGGGCGCA AGGAGATCCT CGGTGAGAAG GTGGTGCTCG ACGACGTCAT CGACGCCACC CGCAACTACT ACACTTTGGA CGGCGAGTTC GCGTCGATGC CGTGGAACAC CTCCACCCCG GTGTTCTACT CCAACACCGA CATCCTGAAG AAGGCCAAGA TCAAGGAGGC CCCCAAGACC TGGGAGGACC TCCAGGCCGC CTGCGACAAG ATCGACAAGA TGAAGGACGG CCCCAAGAAC TGCATCACCT GGCCCAACCA GGCGTGGTTC CTGGAGCAGC CGCTGGCCGA GCAGGGCGGG CTGTTGGTCA ACAAGGACAA CGGCCGTTCC GGCCGGGCCA CCAAGATCGA CCTGACCAGT GACAAGTTCC TGGCCTGGGC GAAGGTGTGG GCCGACATGT CGAAGAAGAA GCAGTACTCG TACTCCGGCA AGCAGGAGGA CTGGATCACC CCGACCAAGA ACTTCACCGG TCAGGAGGTC GCGTTCATGA TGACCTCCTC GGCGGAGGCC TCGGTGGTCG CCAAGCAGGC CAAGGAATCC GACTTCGGCT TCGAGGTCAC CAAGATGCCG CTGAAGAAGG GAGCCCCTTA CTCGGGCAAC TTCATCGGCG GCGCGACACT GTGGATGACC GCGGGCCTGG AGAAGAAGAC CTCCGACGGC GCGCTGGCCT TCATGCAGTA CATCAACAAC CCCGAGAACG CCGCCGACTG GCACAAGATC ACCGGCTACG TCCCCGTGAC CAAGAGCGCC GAGGAACTCC TGGAGAAGGA GAAGTGGTTC GACGACAACC CGCACCACAA GGTCGCCATC GAGCAGCTGG CCGCCACCGA CGGCTCCCCG GCCGCCACCG GACCGATCGT CGGCAACTTC GTGGCCATCC GCAAGGAGAT GCAACAGGCC ATGGAGGACA TCATGAACAA CGGTGATGAT CCGGCCGAAC GGTTCAAGGA AGCCGAGAAG GCCTGCCAGA AGCTCCTGGA CGACTACAAC GAGCTCAGCG CGGGCTGA
|
Protein sequence | MRRARALLAP ALIAALLGGM TACAGEEKDP GEIDVWIGFV DHRLDWMKDR AKEFEDEHPG YKVNITPYKD YPTLWDKLTA AAEQGEPPTI AQNFEAATQE SRDAVNSEGE PLFASVEKEI DGRKEILGEK VVLDDVIDAT RNYYTLDGEF ASMPWNTSTP VFYSNTDILK KAKIKEAPKT WEDLQAACDK IDKMKDGPKN CITWPNQAWF LEQPLAEQGG LLVNKDNGRS GRATKIDLTS DKFLAWAKVW ADMSKKKQYS YSGKQEDWIT PTKNFTGQEV AFMMTSSAEA SVVAKQAKES DFGFEVTKMP LKKGAPYSGN FIGGATLWMT AGLEKKTSDG ALAFMQYINN PENAADWHKI TGYVPVTKSA EELLEKEKWF DDNPHHKVAI EQLAATDGSP AATGPIVGNF VAIRKEMQQA MEDIMNNGDD PAERFKEAEK ACQKLLDDYN ELSAG
|
| |