Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_3073 |
Symbol | |
ID | 8884272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 3242398 |
End bp | 3244173 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Glycoside hydrolase family 20, catalytic core |
Protein accession | YP_003511837 |
Protein GI | 291300559 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.221724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.79825 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCGC CGACCGAGCT CGTCCTGCTG CCCCGCCCCC GCCACCTCGA CGTCACCGGC GACGGTCCGC CCACCACCAC CGAAGCCGTC GAGCACCAGG CCACCGACCT CGAACCGCAG GGCTTCGAAC TCCACATCAG CGACACGACG ATCAACCTGC GCTACCGCGA CGACGCCGGA CTGCGCTACG GCCGCGCCAC CCTCGCCCAG CTGCGCGCCC AGTTCCCCGA CCGGCTGCCC GCCCTGTCCA TCAAGGACTG GCCCGACTTC GCCACCCGCG GCTACATGCT CGACGTCAGC CGAGGCCGGG TCCCCACCCG CGACACCCTC GAACGCATCG TCGGCCTGCT GGAACTGCTG CGCGTCAACC ACTTCCAGCT CTACACCGAG CACACCTTCG CCTACACGGC CCACGAAACC GTGTGGCGCG ACGCCAGCCC CATGACCCCC GACGACATCC GCTGGCTCGA CGAACGCTGC ACCGAAGCCG GTATCGAACT GGCCGCCAAC CAGAACTGCT TCGGCCACTT CGAACACTGG CTCAAACACG ACGCCTACCG CGACCGCGCC GAACTGGCCG AGGGCTTCGA ACTGATGGGC CGCCACCGCC CCGCCACCAC CCTCGCCCCC ACACAGGACA ACGCCGACTT CGCGCTGGAC CTGGTCCGCG AACTGGTCCC CAACTTCTCC AGCCGCCGCG TCAACATCGG CTGCGACGAA ACCTGGGAAC TGGGCCGGGG CGTATCGAAG GCGGCGGCCG ACGCCAAAGG CAAGGGCCGG GTCTACCTGG AACACCTCCA CCGCCTGGTC AACCCGCTGA TCGCCGACGG CCTCGACGTC CAGTTCTGGG GCGACATCAT CGCCAACCAC CCCGAACTGG CCACCGAACT GCCACAAGGC GCCACCGCCG TGGCCTGGTG GTACGAGGCC CCCTGGGACC CCGAAGAGCA GGAACGACTG CTGTGGGAAG TCGGCGACCG TTTCCTGGAC GCCGGGGTCG ACCTCAGCAA GAAACTGGGC GGCTTCGCGG GCGAGGCCGC ACCCTTCCTC GAATCCAGCT ACCCACTGTG GGTCGCACCC GGCACCGGCA CCTGGCGTTC CCTGCTGGGC CGACTCCGCA CCGCTTACGG CAACATGCTC GACACCGTCA ACGTCGGCCT GTCCGGCGGC GTCGAAGGCG TCCTCATCAC CGACTGGGGC GACGGCGGCC ACCCGCAACC CCCCTCAGTC ACCTTCCCGC CCCTGGCCTA CGGCGCCGCA CTCTCCTGGT GCCGCGACGC CAACCACGAC CTCAACGTCC CCGCCGTCCT CGACCACTTC GTCTTCCCCG GCACCAACAT CGGCGCGGCA CTCGAAGCCC TGGGCCACCT CAACGACCGC ACCGGCCAGA TCGCCTTCAA CTCCAGCCCC CTCCAACTGG CCCTGGAACC CAACGCCCAC CACGTCGGAG TAGGCGACCC GGACCCGAAG GCCCTGGCGG GAGTCGTCGA CGACATCGAC GCCCTCATCG CCACCGTCAA CTCCGGCAGC GGCGCCTTCG GCGACCACGA GATCGTCCAA AACGAACTCA CCGCCGCCGC CCGCCAAGCC CGCCATGGAG CCTGGCGCCT ACTGCGCCAA GCCGGAGCCC CGGCCCCCGA CACCGCCGCC ATGCGCGCCG ACCTGGCCGA AACCACCGAA CTCCACCGTC GAGCCTGGCT CTCCCGCGCC CGACCGGGTG GCATGGAAAC CTGGATGACA ACCCTGACAG CGCTGGCCAA GACCTACGAC TCCTGA
|
Protein sequence | MIAPTELVLL PRPRHLDVTG DGPPTTTEAV EHQATDLEPQ GFELHISDTT INLRYRDDAG LRYGRATLAQ LRAQFPDRLP ALSIKDWPDF ATRGYMLDVS RGRVPTRDTL ERIVGLLELL RVNHFQLYTE HTFAYTAHET VWRDASPMTP DDIRWLDERC TEAGIELAAN QNCFGHFEHW LKHDAYRDRA ELAEGFELMG RHRPATTLAP TQDNADFALD LVRELVPNFS SRRVNIGCDE TWELGRGVSK AAADAKGKGR VYLEHLHRLV NPLIADGLDV QFWGDIIANH PELATELPQG ATAVAWWYEA PWDPEEQERL LWEVGDRFLD AGVDLSKKLG GFAGEAAPFL ESSYPLWVAP GTGTWRSLLG RLRTAYGNML DTVNVGLSGG VEGVLITDWG DGGHPQPPSV TFPPLAYGAA LSWCRDANHD LNVPAVLDHF VFPGTNIGAA LEALGHLNDR TGQIAFNSSP LQLALEPNAH HVGVGDPDPK ALAGVVDDID ALIATVNSGS GAFGDHEIVQ NELTAAARQA RHGAWRLLRQ AGAPAPDTAA MRADLAETTE LHRRAWLSRA RPGGMETWMT TLTALAKTYD S
|
| |