Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4078 |
Symbol | |
ID | 8885279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 4356898 |
End bp | 4358322 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003512823 |
Protein GI | 291301545 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.305211 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACA ACCAACTCAC CGGTCAGGGC CGCAACAGGC GTGATGTGCT GCGGCTCGCG GGCCTCGGGG CGGCCCTCAC CGTCAGCGCG CCGATGCTGC AAGCCTGCGG TTTCGGCGGC GGCACCCAAA GCGGTGACTC GGCGGCCAAC GACATCACCG GCAGCTTCGA CTGGAAGAAG TACGACGGCA AGACCGTCAA GCTGCTGATG AACAAACACC CCTACACCGA CGCGCTCAAG AAACACAAGG GCGAGTTCGA AAAGAAGACC GGCATCACGC TGGAGATCGA CGAGTTCCCC GAGTCGAACT ACTTCGACAA GGTCACCCTG GAGCTCCAGT CCAAACAGGG CAACTACGAC GCCTTCATGC TCGGCGCCTA CATGGTGTGG CAGTACGGCC CGCCCGGATA CCTGGAAGAC CTCGGACCGT GGATGAAGAA CGAGTCGGCC ACCCACGAGG AGTTCGAGAA GGAGGACTTC TTCCCGAACC TGCTCAAGGC CGGACAGTGG AACTTCACCA ACGGCGACCC GCTCGGCGGC AAGCAGCAGT GGATGCTGCC GTGGGGCTTC GAGACCAACG TGGTGTGTTA CCGCGAGGAC GTCTTCAAGA AACTGAAGCT CAAACCGGCC GAGGACTTCG ACGAGTTCAT CGACCTGGCC AAGACGCTGG ACAAGAAGGC GGGCGACGGC ATGTACGGCG TCGCGGTGCG CGGTTCCAAG GAATGGGCCA CCATCCACCC CGGCTTCATG ACCATGTACT CCCGGCTCGG TCTCAAGGAC TTCGACGTCA AGGACGGCAA ACTCGTCCCC ACGATGAACT CCGCCGACGC GGTGGACTTC ACCGACAAGT GGGCCAAGAT GGTGCGCGAC TCCGGCCCCA AGGGCTGGAC CTCCTACACC TGGTACCAGG CCTCCAGCGA CCTGGGCGCG GGCAAGGCCG CGATGCTGTT CGACGCCGAC TCCGCCTCGT ACTTCCAGAA CGAGGGCACC AAGGCCGCCG GAAAGCTCGC CTGGCACCCC GGACCCAAGG GGCCCAACGG TTCCCTGGAC ACCAACATGT GGATCTGGTC GCTGGCCATG AACGCCAACT CCAAGAACAA GGAAGCCGCG TGGTGGTTCC TGCAGTGGGC CACGTCCAAG GAGCACCTGA AGTACGCCGC CACCAAGGGC CAGCACATCG ACACGGTTCG CCAGTCGATC GCCGAATCGT CGGAGTACAA GGACAAGTTC GCCGACTACA CCGGCTTCCT CGACACCTTC GAAAAGGTCA TCGACGACAC CAAGATCCAG TTCACGCCGC AGAAGAACTT CTTCGACGCC ACCACCTCGT GGGCCGAGGC GCTGCAGACG ATCTACGGCG GCGAGGACGC CAAGTCCACA CTCGACGACC TCGCGTCCAA CCTTGAGTCG AGGGTCAACG ACTGA
|
Protein sequence | MNDNQLTGQG RNRRDVLRLA GLGAALTVSA PMLQACGFGG GTQSGDSAAN DITGSFDWKK YDGKTVKLLM NKHPYTDALK KHKGEFEKKT GITLEIDEFP ESNYFDKVTL ELQSKQGNYD AFMLGAYMVW QYGPPGYLED LGPWMKNESA THEEFEKEDF FPNLLKAGQW NFTNGDPLGG KQQWMLPWGF ETNVVCYRED VFKKLKLKPA EDFDEFIDLA KTLDKKAGDG MYGVAVRGSK EWATIHPGFM TMYSRLGLKD FDVKDGKLVP TMNSADAVDF TDKWAKMVRD SGPKGWTSYT WYQASSDLGA GKAAMLFDAD SASYFQNEGT KAAGKLAWHP GPKGPNGSLD TNMWIWSLAM NANSKNKEAA WWFLQWATSK EHLKYAATKG QHIDTVRQSI AESSEYKDKF ADYTGFLDTF EKVIDDTKIQ FTPQKNFFDA TTSWAEALQT IYGGEDAKST LDDLASNLES RVND
|
| |