Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4648 |
Symbol | |
ID | 8885853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 4954745 |
End bp | 4956091 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003513384 |
Protein GI | 291302106 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000260895 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATTGCCC CCCGAAAACA ACGACGTTTG ACCGCGATCG CGGCTTCGAC ACTGACGCTT GCCCTGGCCG TGTCGGCTTG TGGCGCGCCC GAAGAGGACA GCAACCTTTC CGACAAGGAA AAGGACTGTG CCAGCTACGA GAAATACGGC GACTTCAAGG ACAAGGACGC CGAGGTCTCG ATCTACACGC CGATCACCGA CGCCGAGGGC GACGCCTACG AGAAGTCGTG GGCGTACTTC GCCGAGTGCA CCGGCATCGA CGTCAAGTAC ACCGGCAGCA ACGACTTCGA GGCCCAGATC GAGGTCAAGG TGAAGGGCGG CAAGGCCCCC GACATCGCGT TCTTCCCGCA GCCGGGTCTG ATGGCCCGCT ACAAGGACAA GATGGTCCCG GCCTCCAAGA AGCTCGTGAA GGAAGCCGAG AAGGGCTGGA GCGAGGACTG GCTGCAGTAC GGCACCTTCG ACGACGAGCT GTACGCCCAG CCGATGAGCG CCAGCCTCAA GTCGCTGGTG TGGTACTCCC CCAAGTACTT CAAGGACAAC GACCTCGAGG TCCCCGAGAC CTGGGACGAC CTGCTGGCGG TGTCCGACAA GATCGCCAAG ACCGACGTCA AGCCGTGGTG TGCCGGTATC GAGTCCGGTG AGGCCACCGG CTGGCCGGTC ACCGACTGGG TCGAGGACGC GCTGCTGCGC GAGTCCGGCG GCGAGCTGTA CGACAAGTGG GTCAACCACG AGATCGCCTT CAACGACAAG CGGATCGTCA AGGCCCTGGA CAAGGTCGGC GGCATCCTGA AGAACAAGGA CTACGTCAAC GGCGGCTACG GTGACGTCGA CTCGATGGCC AAGGTGTCCT TCGAGGAGGG CGGCCAGCCC ATCGTCAAGG GCGAGTGCGC CATGCACCGT CAGGCGTCGT TCTACGCGGC CCAGTGGCCC GAGGGCACCA AGGTCGGTCC CGACGGCGAC GTGTTCGCGT TCTTCCTGCC GGGCAACAAG GCCGAGGAGA AGCCGCTGCT GGGTGCCGGT GAGTTCGTCG CCGCCTTCCG CGACGACCCC GAGGTGCACG CGGTGCGCGA GTTCCTGGCC TCCGAGCTGT ACGTCAACGC TCGCCTGAAG ACCGGCCCGC TGGCCTTCTC GCACAAGGGT GCCGACCCGA AGAACGCCGA CAACGAGCTC AACAAGATGG TCATCGAGCT CTACCAGGAC GAGGGAGCGG AGTTCCGTTT CGACGGTTCC GACCAGATGC CCGGTCACGT CGGCTCGAAG TCCTTCTTCG TCGAGATGAC GAAGTGGATC AAGGGCAAGT CGTCGAAGGA CGCGCTCGAC GCGATCGAGA AGTCCTGGGA CGAATGA
|
Protein sequence | MIAPRKQRRL TAIAASTLTL ALAVSACGAP EEDSNLSDKE KDCASYEKYG DFKDKDAEVS IYTPITDAEG DAYEKSWAYF AECTGIDVKY TGSNDFEAQI EVKVKGGKAP DIAFFPQPGL MARYKDKMVP ASKKLVKEAE KGWSEDWLQY GTFDDELYAQ PMSASLKSLV WYSPKYFKDN DLEVPETWDD LLAVSDKIAK TDVKPWCAGI ESGEATGWPV TDWVEDALLR ESGGELYDKW VNHEIAFNDK RIVKALDKVG GILKNKDYVN GGYGDVDSMA KVSFEEGGQP IVKGECAMHR QASFYAAQWP EGTKVGPDGD VFAFFLPGNK AEEKPLLGAG EFVAAFRDDP EVHAVREFLA SELYVNARLK TGPLAFSHKG ADPKNADNEL NKMVIELYQD EGAEFRFDGS DQMPGHVGSK SFFVEMTKWI KGKSSKDALD AIEKSWDE
|
| |