Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_2861 |
Symbol | |
ID | 8884060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 3012016 |
End bp | 3015033 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | glycoside hydrolase family 38 |
Protein accession | YP_003511629 |
Protein GI | 291300351 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.985945 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACGACG ACCGCAAGCT CATTGAAGAC CGCCTCGAAC GCGTCCTGCG CGAACGGATC CGCCCCGCCG TGTACGGCGA GTCCACCCCG TTCACCGTGG CCGTGTGGCA CGCTCCCGGC GAACCGGTGC CGCCCGCCGA GGGCATCGCC GCGTCCTACG AGCCCTGTCA GCTCGGCGAC GAGTGGGGGC CGCCGTGGGG GACCAGCTGG TTTCGCATGT CCGGCCAGGT CCCGGCCGCG TGGGCCGGGC AGGTCGTGGA GGCCCGGATC GACCTGGGGT TCAGCAACTA CGGCTGGGGC CCGGGTTTTT CCGCCGAGGG TCTGATCTAC CGCCCCGACG GCACCGCGAT CAAGGGTCTG CACCCCGACA ACCAGTGGAT GCGCGTCAGT GACAAGGCCG AGGGCGGGGA AGAGGTGCTG TTCCACGTCG AGGCCGCCGC CAACCCCCGT ATTCAGGGAC CGGTGACGGA CCTGGGTGAC AAGGCCACCG CGCCCGACAA GTGCCTGTAC CAGCTGGACC GCGCCGAGCT GGCCGTGTTC AACCCCGAGG TGTACGAACT GGTCCACGAC TTCGAGGTGC TGGACGAGCT GATGCGGCAG CTGCCGCTGG ACCAGCCGCG CCGCTGGGAG ATCCTGCGGG TCGTCGGCAA GGCCCTGGAC GCGGTCGATC TGCGCGACGT GCCCGGCACC GCCGCCGCGG CCCGGACCGT GCTGGCCCCG GCGCTGGCAG CGCCCGCGAC AGCCTCGGCG CACCGGCTGT CGGCCATCGG CCACGCCCAC ATCGACTCGG CCTGGCTGTG GCCGCTGCGC GAGACCGTCC GCAAGGTCGC CCGCACCGCG TCCAACACCG TCGACCTCAT GGACGACCAC CCCGAGTACG TCTTCGCGAT GTCGCAGGCT CAGCAGTTGG CCTGGATCAA GGAGTATCGG CCCGAGGTGT ACGAGCGGGT CAAGGAGAAG GTCGCGTCGG GACAGTTCGT CCCGGTGGGC GGCATGTGGG TCGAGTCCGA CACCAACATG CCCGGCGGCG AGGCCATGGC CCGCCAGTTC GTCCACGGTA AACGGTTCTA CCTGGACGAG TTCGGGCTGG AGACCGAGGA GGTGTGGCTG CCGGACTCCT TCGGCTACAC CGCCGCGCTG CCGCAGCTGG TGAAGCTGTC GGGCTCGAAG TGGTTCCTGA CCCAGAAGAT CTCGTGGTCG CAGTCCAACA AGTTCCCGCA CCACACCTTC TGGTGGGAGG GGCTGGACGG TTCCCGCGTG TTCACGCACT TCCCGCCCAT CGACACCTAC AACGTCACCT TCTCCGGCGA GCAGATGGCC CACGCGGTGA CCAACTTCCG CGACAAGGGG CCCGCCAGCC GGTCGCTGGC GCCGTTCGGA CACGGCGACG GCGGTGGCGG CGCCACCCGC GAGATGCTGG CCCGCGCCAA ACGCCTGTCC AACCTGGAGG GTTCGGCGCG CGTCGACATC GAGAAGCCGT CGGTGTTCTT CGAGAAGGCC CACGCCGACT ACCCCGACGC ACCCGTGTGG GTGGGGGAGC TGTACCTGGA GCTGCACCGC GGCACCTACA CCTCGCAGGC CAAGACCAAA CAGGGCAACC GGCGCAGCGA GCACCTGCTG CGCGAGGCCG AACTGTGGGC CGCCACCGCG GCGGTGCGCG GGGACTTCGC CTACCCGCAC GCGGAGCTGG ACCGGTTGTG GAAGATCGTG CTGCTGCACC AGTTCCACGA CATCCTGCCC GGCTCGTCCA TCGCCTGGGT CCACCGGCAG GCCGCCGAGA CCTACGCGAG CGTCGCCGCC GAACTGGAGT CCATCATCGA CGCCGCGCAG CGGTCGCTGG CCGGTGCCGG CGACCACCGG GTCGTGTTCA ACGCCGCCCC GCACGCGCGC GGCGGGGTCG CGGCCGGTGG CGCGGGCACG GTGTCGTCCG GAGAACCGGT CGTGGTCACC GGCTCGGCCG ACGACGGCTG GAGTGTCGAC AATGGACTGA TCCGGGTCGG CGTCGACGGC CGGGGTCTGG TCACCTCGGT GGTCGACCTC GCCGAACAGC GCGAGGCGCT GCCGCCCGGC GAGGTGGCGA ACCTGCTCCA GATCCACCCC GACCTGCCCA ACCACTGGGA CGCCTGGGAC GTGGACTCGT TCTACCGCAA CACGGTCACG AACCTCACCG AGGCCGACTC GGTCGAGATC GCCTACGCGG GCCTGACCAA GGTGACCTTC GAGGTGCGTC GCTCCTTCGG CGCCTCGAGT GTCACGCAGT CGATCACGGT CAACGCCGGT TCGGCGACCG TCGACTTCGA CACCGAAGTG GACTGGCACG AGTCGGAGAC CTTCCTCAAG GCGGCGTTCC CGATCGACGT GCGTGCCGAG ACCTCGGCCG CCGAGACCCA GTTCGGTCAC GTCCGGCGAG CCACCCACAC CAACACCAGC TGGGAGAACG CCAAGTTCGA GATCTGCGCC CACCGGTTCC TGCACGTGGC CGAGCCGGGC TGGGGTGCCG CGGTCGTCAA CGACTCGACC TACGGTCACG ACGTGACCCG GGCGGTCCGT GCCGACGGCG GCACCACCAC GACGGTGCGG CTGTCGCTGC TGCGGGCGCC CCGGTTCCCC GACCCGGACA CCGACCAGGG CGTGCACCGG CTGCGTTACG GCTTCGCGCC CGGCGCCGAC ATCGCCGACG CGGTCCGCGA GGGCTACCGC ATCAACCTGC CCGAACGCGC CGTCACCGGC GGCGGGCCGG TCGAGCCGCT GGTGAGCGTC GACAACGACA AGGTCGTCGT CGAGGCGGTC AAACTCGCCG ACGACGACTC CGGTGACATC GTGGTGCGGC TGTACGAGGC GTGCGGCGGC CGCGCGAACG CGCGGCTGAC GGCCTCGGTG CCGCTGGCCT CGGCCGTGGA GACGGACCTG CTGGAGCGGG CACTGCCCGA GGGCGAGCGG ACCGTCGAGG CCGACGGCGT CGCGGTGTCG TTGCGTCCGT TCCAGATCGT GACGCTGCGG CTGGCCCGAC AGTCCTGA
|
Protein sequence | MHDDRKLIED RLERVLRERI RPAVYGESTP FTVAVWHAPG EPVPPAEGIA ASYEPCQLGD EWGPPWGTSW FRMSGQVPAA WAGQVVEARI DLGFSNYGWG PGFSAEGLIY RPDGTAIKGL HPDNQWMRVS DKAEGGEEVL FHVEAAANPR IQGPVTDLGD KATAPDKCLY QLDRAELAVF NPEVYELVHD FEVLDELMRQ LPLDQPRRWE ILRVVGKALD AVDLRDVPGT AAAARTVLAP ALAAPATASA HRLSAIGHAH IDSAWLWPLR ETVRKVARTA SNTVDLMDDH PEYVFAMSQA QQLAWIKEYR PEVYERVKEK VASGQFVPVG GMWVESDTNM PGGEAMARQF VHGKRFYLDE FGLETEEVWL PDSFGYTAAL PQLVKLSGSK WFLTQKISWS QSNKFPHHTF WWEGLDGSRV FTHFPPIDTY NVTFSGEQMA HAVTNFRDKG PASRSLAPFG HGDGGGGATR EMLARAKRLS NLEGSARVDI EKPSVFFEKA HADYPDAPVW VGELYLELHR GTYTSQAKTK QGNRRSEHLL REAELWAATA AVRGDFAYPH AELDRLWKIV LLHQFHDILP GSSIAWVHRQ AAETYASVAA ELESIIDAAQ RSLAGAGDHR VVFNAAPHAR GGVAAGGAGT VSSGEPVVVT GSADDGWSVD NGLIRVGVDG RGLVTSVVDL AEQREALPPG EVANLLQIHP DLPNHWDAWD VDSFYRNTVT NLTEADSVEI AYAGLTKVTF EVRRSFGASS VTQSITVNAG SATVDFDTEV DWHESETFLK AAFPIDVRAE TSAAETQFGH VRRATHTNTS WENAKFEICA HRFLHVAEPG WGAAVVNDST YGHDVTRAVR ADGGTTTTVR LSLLRAPRFP DPDTDQGVHR LRYGFAPGAD IADAVREGYR INLPERAVTG GGPVEPLVSV DNDKVVVEAV KLADDDSGDI VVRLYEACGG RANARLTASV PLASAVETDL LERALPEGER TVEADGVAVS LRPFQIVTLR LARQS
|
| |