Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_1041 |
Symbol | |
ID | 8882226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 1101290 |
End bp | 1102540 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003509844 |
Protein GI | 291298566 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.172945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.776305 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGAATC CGACGCGTCG ATTGTGGTGT GCCGCGCTGG CCTGCGTCAC CGCGGCCGGG CTGCTGGGGG GCTGCGGCAG CGAGGCCAGC GCGCGGGTCG TCAACCTCTA CAATGCCCCG CAACAGAACC TGTCCAAGAT CGTCGAACGC TGCAACCGGC TCGCCGACGG TGACTACAAG ATCGTCCTCA ACACCCTGCC GCGCGACGCC GACGGGCAGC GCGAGCAGAT GGTGCGGCGG CTGGCCGCCG AGGACACCGG CATGGACGTG CTGGGCATCG ACATCACCTG GACCGCCGAG ATGGCCAGCG CCAAGTGGAT CCTGCCGTGG AAGGGCGAGC ACGCGGCCCA GGCCAAGGCC GGGGTCGCCA AGGCGCCGTT GAAGACCGCC ATGTTCGAGG ACCGGATGTA CGCCGCGCCG TCCAACACGA ACGTGCAGCT GCTGTGGTAC CGCTCCGACC TGATGCCCGA ACCGGCGCAG ACCTGGGAGC AGCTGATCGG CGTCGCCAAG AAGCTGAAGC AGGAGGACAA GGCGCACTAC CTGGAGGTCA CCGGCGCCCA GTACGAGGGC CTGGTGGTGT GGTTCAACTC GATGGTGGCC GCCGCGGGCG GCAGCATCCT CAACGCCGAC GGCGACAAGG TGGAACTGGG CGAACCGGCC GTGAAGGCGC TCAAGACGAT GCGGACCTTC GCCCGGTCGG ACGCCGCCGA CCCGTCGCTG TCCAACACCC AGGAGGACAC CGCCCGGCTG GCGGTGGAGA GCGGCTCGGG ATTCGCGGAA CTGAACTGGC CGTTCGTGTA CGCGGCCATG CAGGCCAGCG GCAAGGACTT CGCGAAGGAC TTCAGGTGGG CGCCGTATCC GGGCATCGAC GGGCCCGGCA ACGCGCCGCT GGGCGGCTCC AACTTCGCCA TCAGCAAGTA CTCCACCAAC CGCTCCGAGG CCTTCGACGC GGCGCTGTGC CTGCGCGACA AGGAATCCCA GCGCATGTCG GCGGTACTGG ACGGACTGCC GCCCACGATC GAGTCGGTGT ACGACGCCAA GGGCATGGCC AAGGCTTACC CGATGCGGGG CGAGATCCTC AAGGCGCTGG AAACCGCGGT GCCGCGTCCG GTCACCCCGG TGTACCAGAA CGTGTCCACG GTGACGTCGA AGTACCTGTC GCCGCCGTCC TCGATCCAAC CGGTGGAGAC CGAACGCAAA CTGCGCGAAC AGCTCGTCAA GGCCCTCAAC TCCGAAGGAG TGCTGCCGTG A
|
Protein sequence | MLNPTRRLWC AALACVTAAG LLGGCGSEAS ARVVNLYNAP QQNLSKIVER CNRLADGDYK IVLNTLPRDA DGQREQMVRR LAAEDTGMDV LGIDITWTAE MASAKWILPW KGEHAAQAKA GVAKAPLKTA MFEDRMYAAP SNTNVQLLWY RSDLMPEPAQ TWEQLIGVAK KLKQEDKAHY LEVTGAQYEG LVVWFNSMVA AAGGSILNAD GDKVELGEPA VKALKTMRTF ARSDAADPSL SNTQEDTARL AVESGSGFAE LNWPFVYAAM QASGKDFAKD FRWAPYPGID GPGNAPLGGS NFAISKYSTN RSEAFDAALC LRDKESQRMS AVLDGLPPTI ESVYDAKGMA KAYPMRGEIL KALETAVPRP VTPVYQNVST VTSKYLSPPS SIQPVETERK LREQLVKALN SEGVLP
|
| |