Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4053 |
Symbol | |
ID | 8885254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 4320902 |
End bp | 4322032 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | glycoside hydrolase family 18 |
Protein accession | YP_003512798 |
Protein GI | 291301520 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0226392 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAA CCCGCGTCCT CAGCCTCGCC GCGATCCCGG TACTCGCGGC CGGGCTGGCC ATCACCACCA CCTCGGCCGG ACACGCCGAC CCCAACCCGC CCGACCGGGC CGAACTGCCG AAGCACGCCC TCATCGGCTA CCTGCACTCC AGCTTCGCCA ACGGATCCGG CTACCTGCCG ATGTCCGAGG TCCCCGACGA ATGGGACATC ATCAACCTCG CCTTCGGCGA ACCCACCTCC GTCACCTCCG GCGACATCCA GTTCGACCTG TGTCCCAAGG AGGAATGTCC GAATGTGGAA ACCGAGGACG AGTTCAAGGC CGCGATCAAG GACAAGCAGG CCAAGGGAAA GAAGGTGCTG CTGTCCATCG GCGGCCAGAA CGGCCAGGTC CAGCTGACCA CCGCCGCCGC CCGGGACAAG TTCGTCGAAT CCGTGGGCGG CATCATCGAC GAGTACGGCC TGGACGGTCT CGACGTCGAC TTCGAGGGAC ACTCGCTGTA CCTCGACTCC GGCGACACCG ACTTCGAGAA CCCCAAGACC CCGGTCATCG TCAACCTGAT CGACGCCCTG GACGCGCTCA AGGCCCGCTA CGGCGACGCC TTCACCCTCA CCATGGCACC CGAGACCTTC TTCGTCCAGG TGGGACACCA GTTCTACGGC GGAGCGGGCG GCGGCGACAA CCGCACCGGC GCTTACCTTC CGGTGATCCA CGCGGTGCGC GACTACCTGA CCGTCCTGCA TGTACAGGAC TACAATTCCG GTCCCGTGAT GGGACTGGAC GGCCAGTACC ACAACATGGG CAACGCCGAC TTCCACATCG CGATGACCGA CATGGTCAAG GCGGGCTTCC CGGTGGCCAG CACCGGAAAG ACCTTCCCGG GCCTGCGCGA GGACCAGATC GGCTTCGGCG TCCCGGCCGC CACCAGCGCC GGAAACGGCC ACACCTCACC CGAGGCGGTG CAGCAGGCCC TCGGCTGTCT GGCCACAGGG GAGGACTGCG GCGGCTACGA ACTGCGCGGC GGCCCGTCAC CCGCGATCCG CGGCCTGATG ACCTGGTCGA TCAACTGGGA CAACTACTAC AAGTGGGAGT TCATGAACGC GCATGAGCCG TACCTGAACG GACTGCCGTA G
|
Protein sequence | MKRTRVLSLA AIPVLAAGLA ITTTSAGHAD PNPPDRAELP KHALIGYLHS SFANGSGYLP MSEVPDEWDI INLAFGEPTS VTSGDIQFDL CPKEECPNVE TEDEFKAAIK DKQAKGKKVL LSIGGQNGQV QLTTAAARDK FVESVGGIID EYGLDGLDVD FEGHSLYLDS GDTDFENPKT PVIVNLIDAL DALKARYGDA FTLTMAPETF FVQVGHQFYG GAGGGDNRTG AYLPVIHAVR DYLTVLHVQD YNSGPVMGLD GQYHNMGNAD FHIAMTDMVK AGFPVASTGK TFPGLREDQI GFGVPAATSA GNGHTSPEAV QQALGCLATG EDCGGYELRG GPSPAIRGLM TWSINWDNYY KWEFMNAHEP YLNGLP
|
| |