Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_5507 |
Symbol | |
ID | 8886721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 5847233 |
End bp | 5848981 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003514231 |
Protein GI | 291302953 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.553842 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.243799 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAGT CACTGGCGAC AGTCGGTGTA CTCGCCTTGA TGGTGAGCAC CGTCGCCGCC TGTAGCTCGA ATGAAGGCGG GAAGGACGAA GGCACCAAGG ACTTCGAGGT CAAGCCCGCT GTCATCAGCA AGGATGCCAA GGATTCCGAG GGTCCGGCCA GCGAGGTCAA GGGTGCCCAG ACAGGCGGGG AGGCCACCTA TCTGGCCCCC ACGACCTTCG ACCACCTTGA CCCTCGTCAG ACCTACTACG TCAACACCCT TGAGATCGGC CGCCTGTTCT CCCGTCAGCT GACCAGCTAC CGGGTGATGG GCGAGGAGAC CAAGGTCGTC GGCGACCTCG CCACCGGTCC CGGTAAGGAC CTCGGCGACT GCAAGGCCTG GGAATACGAG CTCAAGGACG GCCTGAAGTA CGAGGACGGT TCGCCGATCA AGGCCGACGA CATCGCCTAC GCGATCTCCT CGACCTTCGA CAGCCGACTG CAGGACGGTC CCTCGTCCTA CTTCCGTGGC TGGCTGAAGG GTGCCGAGAA GTACAAGGGC CCGTTCAAGG ACAAGGGTTC CCGCGCTCCG GGCATCAAGG TCGACGGTGA CAACAAGATC ACCTTCGAGC TGAGCTCCCC GCACTGCGAC CTGCCGTACA TGGCGGCCAT GAGCGTCACT TCTCCGCTGC CGGAGAAGAA GGAGGCCAAG AACCCGGCCG ACTACGACTT CAAGCCGTTC TCCTCGGGCC CGTACAAGTT CGAGGGCAAG TGGAGCGAGA ACAAGGGCGT CACCCTCGTC AAGAACGAGA ACTGGGACCC CAAGACCGAC CCGATCCGTC ACCAGTACGT CGACACCTTC AAGGTGAACT TCGGTGACAA CCACAAGGCC ACCACTGACG CGCTGCGTGC CGACAAGGGC GCGGACGCCA CGTCGATGAC CGACACGGTC GACATCAACC AGGTCCCTGA GATCGTCAAG GACAAGGAAC TGATGAAGCG GGTCGAGAAC GTCCCGGGCA TCTTCGTGTA CTGGATGGGC ATCAACAACA TGAAGATCAA GGACCCCGAC GTCCGCAAGG CGCTGGCGTA CGCGGTCGAC AAGGAGGCCA TCGTCAAGGC CACGGGTGGA TCGAGCCAGG CCACGCCCGC CTCGACGACC CTGAGCCCGA CCGTCGCCGG TTACGAAGAC CAGATGGACA TGTACAAGGG TCCGAAGGGC GACAAGAAGA AGGCCAAGGA GCTGCTCAAG GGCAAGGACG TCAAGTCGCT GACGTACGCC TACCGTGCCA GCCCCGCCAA CAAGAAGATC GCCTCCTCGC TGCAGGACCA GCTCAAAGAG GTCGGCATCG AGCTGAAGAT CAAGGAGCTG AGCGAGACCG AGGCTCCGTC GATCCTGAGC GACCCGCAGG AGAACAAGTA CGACCTGTAC ATGAAGAACT GGGGTGCTGA CTGGCCCACC GGTTACAGCG TGCTGCAGCC GATCTACGAC GGCCGCACCA TCACTGACGA CCCGGGCAAC GTCAACAACA TCTGGTTCGA CGAAAAAGAG GTCAACGACC AGATCGACAA GGTCATGAAC ATGACCGACC CCGAGGAGCA GAACAAGGCC TACATGGATC TCGACAAGAA GATCCTGGAG GAGTACATGC CGATGGTCCC GCTGTACTAC AGCAAGACCT TCGCGATGCA CGGTTCCAAG GTGGGCGGTC TCTACTCGAC CAACACCACT GGTACCACCT CGTTCACCGA CGTCTTCGTC AAGTCGTAA
|
Protein sequence | MRKSLATVGV LALMVSTVAA CSSNEGGKDE GTKDFEVKPA VISKDAKDSE GPASEVKGAQ TGGEATYLAP TTFDHLDPRQ TYYVNTLEIG RLFSRQLTSY RVMGEETKVV GDLATGPGKD LGDCKAWEYE LKDGLKYEDG SPIKADDIAY AISSTFDSRL QDGPSSYFRG WLKGAEKYKG PFKDKGSRAP GIKVDGDNKI TFELSSPHCD LPYMAAMSVT SPLPEKKEAK NPADYDFKPF SSGPYKFEGK WSENKGVTLV KNENWDPKTD PIRHQYVDTF KVNFGDNHKA TTDALRADKG ADATSMTDTV DINQVPEIVK DKELMKRVEN VPGIFVYWMG INNMKIKDPD VRKALAYAVD KEAIVKATGG SSQATPASTT LSPTVAGYED QMDMYKGPKG DKKKAKELLK GKDVKSLTYA YRASPANKKI ASSLQDQLKE VGIELKIKEL SETEAPSILS DPQENKYDLY MKNWGADWPT GYSVLQPIYD GRTITDDPGN VNNIWFDEKE VNDQIDKVMN MTDPEEQNKA YMDLDKKILE EYMPMVPLYY SKTFAMHGSK VGGLYSTNTT GTTSFTDVFV KS
|
| |