Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0203 |
Symbol | |
ID | 8881381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 217238 |
End bp | 218563 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003509015 |
Protein GI | 291297737 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.346278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.123927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTTC CCGACTCCGG CCCCTTTTCC CGCCGCGCCC TGCTCGGCCT CGCCGCCGGA TCCACCGCCG CCATCGCCCT GTCGGCCTGC GGCGGCGGCT CCGACACCGC CGAGGACGGC AGCCAGGGCG GCACCAAGTA CGACGGCCCC AAAGTGGACC TCGACTTCTG GAACGGCTTC ACCGGTGGCG ACGGCCCCAT CATGAAGCAG CTGGTCAAGG ACTTCAACGC CGAGCACGAC AACATCAAGG TCAAGATGAC CACCTACGAG TGGGAGTCGT ACTACGAGAA GGTGCCCGCC GCCGTGCGCA GCGGCAAGGC GCCCGACATC GGCATCATGC ACGTCGACAG CCTGGCCACC AACGCCGCCC GCGGCGTGAT CCTGCCGCTC GACGACGTCG CCGACGCCCT GAAACTGTCC AAAGGTGACT TCGTCGAGCC GGTGTGGAAC GCCGGTGTCT ACGACAAGAA GCGCTACGGC ATCCCGCTGG ACGTCCACCC CGAGGGCAAC TTCTACAACA AGAAGCTGCT CGACGAGGCC GGACTCGACC CGGACAATCC GCCCGCCACC GGCGACGACT ACGCCGACGC CCTCGACAAG CTCAAGAAGG CCAAGATCAA GGGCATGTGG ATGACGCCGT TCCCGTTCAC CGGCTCCCAC ACCTTCCAGT CGCTGCTGTG GCAGTTCGGC GGCGACCTGT TCAGCTCCGA CGCCAAGGAC CCCGCCTTCG CCGAGGACGC GGGCGTCAAG GCGCTGACCT GGATGGTCGA CCTGGTCAAG GACGGCCACA GCCCCAAGGA CGTCGGCCAG GACGCCGACG CGGTGGCGTT CCAGAACGGC AAGACCGCTT TCAACTGGAA CGGCATCTGG AGCATCAACA CCTTCAACGA CGTCGACGGC CTCGAATGGG GCGTGGCGCC GCTGCCGCAG ATCGGTGAGC AAAAGGCCGC CTGGGCCGGT TCCCACAACT TCGTGCTGCT CAAGCAGCGC ACCGTCGACA CCAACAAGCA GGCCGCGTCC AAGGTGTTCG TCAACTGGAT CAGCGGCAAG TCGGTGGAAT GGGCCAAGGG CGGGCAGGTC CCGGCCCGCA ACAGCGTCCG CGATTCCAAG GAGTTCGGCA AGCTCACCGA GCAGTCGGTG TTCGCCGAGC AGGTCGACTA CCTGCACTTC CCGCCCGCCG TGCCGGGCAT CGGCGACGCG ATGCCGCAGG TCGACAAGGC CGTCAACCAG GCGGTGCTGC TGAAGAAGAA GCCCGCCGAC GCGCTGGCCG ACGCGGCCGA CAAGGCGGCC AAGATCCTGG CCGAGAACCG GAAGAAGTAC GGCTGA
|
Protein sequence | MPLPDSGPFS RRALLGLAAG STAAIALSAC GGGSDTAEDG SQGGTKYDGP KVDLDFWNGF TGGDGPIMKQ LVKDFNAEHD NIKVKMTTYE WESYYEKVPA AVRSGKAPDI GIMHVDSLAT NAARGVILPL DDVADALKLS KGDFVEPVWN AGVYDKKRYG IPLDVHPEGN FYNKKLLDEA GLDPDNPPAT GDDYADALDK LKKAKIKGMW MTPFPFTGSH TFQSLLWQFG GDLFSSDAKD PAFAEDAGVK ALTWMVDLVK DGHSPKDVGQ DADAVAFQNG KTAFNWNGIW SINTFNDVDG LEWGVAPLPQ IGEQKAAWAG SHNFVLLKQR TVDTNKQAAS KVFVNWISGK SVEWAKGGQV PARNSVRDSK EFGKLTEQSV FAEQVDYLHF PPAVPGIGDA MPQVDKAVNQ AVLLKKKPAD ALADAADKAA KILAENRKKY G
|
| |