Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sked_14870 |
Symbol | |
ID | 8633123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sanguibacter keddieii DSM 10542 |
Kingdom | Bacteria |
Replicon accession | NC_013521 |
Strand | - |
Start bp | 1670469 |
End bp | 1673219 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_003314257 |
Protein GI | 269794802 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.44593 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0967933 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCCAC CACGACGACG CCTCCCCCTG AGGCTCGCGG CGGCAGCGAC GGCAGGCGCC GTGCTCCTCA CGCCCCTCGT CACCACCTCC GCCGTCGCAG CCGACGACCC GACGGGACCG GTCGAGGCCG GGATCGTCGT CGACAAGGTG GAGGGCCTCC CCGAGGACTT CATCAACGGC GTCGACATCT CGTCGATCCT CGCCCTCGAG AAGAGCGGCG TGACCTTCCA GGACTGGGAC GGCCAGCAGG CCGACATCTT CGAGGTGCTC GCCGACGCGG ACGTCAACTA CGTCCGCGTC CGCGTGTGGA ACGACCCCTA CGACGCCCAG GGCAACGGCT ACGGCGGCGG CGACAACGAC ATCGCGGCAG CCGTCGCGAT TGGCGAGCGC GCCACCGCGC ACGGCATGCG CCTCCTGGTC GACTTCCACT ACTCCGACTT CTGGGCCGAC CCCGCCAAGC AGCAGGCCCC CAAAGCCTGG GCCGAGATGA CCGTCGACCA GAAGGCCGAC GCCACCGAGG CGTACACCGC CGACGCCCTC ACCCAGCTGC GCGACGCGAA CGTCGACGTC GGCATGGTGC AGGTGGGCAA CGAGACCAAC AACTCCGTCG CCGGCGTCAC CGGCTGGGCC GGGATGTCGC AGATCTTCAG CGCCGGCAGC AGCGCCGTGC GCACCGTCTA CCCCGACGCC CTCGTCGCGG TGCACTTCAC CAACCCCGAG CGGGCCGGCT CCTACGCGAA CATCGCCCGC CAGCTCGACA CCCACGGCGT CGACTACGAC GTCTTCGCGA GCTCGTACTA CCCGTTCTGG CACGGGTCGC TGAGCAACCT GACGACCGTG CTGGACCAGG TCGCGACGAC CTACGACAAG CAGGTCATGG TCGCCGAGAC GTCGTGGAAC TACACCCTCG AGGACGGCGA CGGCCACGAG AACACCATCA AGCCGACCTC CGGGTTCACG CAGTACCCCT CGTCCGTCCA GGGCCAGGCG ACCGCGGTCC GCGACGTCAT GCAGGCCGTC ACCGACGTGG GCCCCGAGGG GATCGGCGTC TTCTACTGGG AGCCCGCGTG GCTGCCGGTC GGCCCACCCG AGGAGCTCGC GCAGAACAAG CTCCTCTGGG AGGAGCACGG CTCCGGCTGG GCGTCCAGCT ACGCCGCCGA GTACGACCCG GAGGACGCCG GCGAGTGGTA CGGCGGGTCG TCGTGGGACA ACCAGGCGCT CTTCGACTTC CAGGGTGCAC CGCTCGAGTC GCTGCGCGTG TTCCAGTACG CCCGCACCGG TGCGACCGCG CCGCGCGAGG TCGTCGACGT CGAGAAGGTG ACGCTCACCG TCCCTGACGG ATCACCCGTC ACCCTTCCGG CCACGGTGCG CGTCACGTAC AACGACGGCA GCACCGAGGA CCGACCCGTC ACGTGGTCGA CCTCCGTAGA CTGGATCCGC GGGCCGGGGG CGTACACGGT CTCCGGGACC ACCGCCGACG GCGCGCGGGT CACCGCGACC GTCACGGTCA GCGCGGTGAA CCACGTCGCC AACCCCAGCT TCGAGGACGC GACCGCCGCC CCGTGGACCA TCACGGGGAC GGGGGCATCC GTCGCGGCCG ACGCCGACGC CTCCGACGGG GCTCGCTCCC TGAAGTTCTG GTCCGCCACC CCCTACACCT TCGCGGTCGA GCAGACGATC ACCGGCGTAC CCGCCGGCAC CTACCGGCTG TCCGCCACGA GCCAGGGCGC ACTGGCAGGC CCGACCGACA CCCGCACCCT GAGTGCGACC ACGGCCGAGG GCGAGCAGTC GGCGCCCATC ACCCTCGACG GCTGGCGGGC CTTCTCGACC GCGACCGTCG ACGACGTGGT CGTCGGCGAG GACGGCCTCG TCACCGTCGC CGCGCGCTAC TCGCTCAGCG GTGCCGCCTG GGGCAACCTC GACGACGTCC GCCTCACCCG CGTGGAGAGC ACGACGACCG TCGACACCGC CGCCCTCGAG CAGTCGCTCC AGACGGCCCG CACCGTCGAC CGCACGCTCT GGACGCCGGC GTCGCTCGCG ACCCTCGACG TCGCGGTCGA GAGCGCCGAG GTGCTCCTGT CCGGGTCACG GGCCACGCAG GAGGACGTCG ACGCGGTGAC CGCGCTCGTC GACGCGGCCG TCGCCGGGCT GGAGGCGATC GACACGACGC CCCCGGTGGA CCCTGAGCCT CCCGTGACTC CGCAGCCCCC GGTCACGCCC GAGCCCCCGG TCACGCCGGA GCCGCCCGTC ACGCCCGAGC CCGTCACGCC GGTCATGACG CTCAGCTCCG GGTCCGTCAC CGCGGGTGAC GACGTCACCG TCACGGTGAC GGGGCTCGAC CTCCCCGAGG TCGAGATCGG CATCGAGAGC GAGTACCAGC GCCTCGCCTC GGCCACCGTC GTCGACGGCG CCGCGACGGT CACCGTGACC GTCCCGGCCA CCCTCGAGGC CGGCACGCAC AGCCTCCAGG CCCGCGACGC CGACGGCGCC GTGCTCGCCC AGGCAGCCGT CGAGGTGCTC GCCGCGCCCG TCGCCACGCC CACGCCGGGA GAGCCGGGGA CGGATGCACC GACGACGCCT GCACCAGGCG TCCCGTCATC AGGGGCTCCG GCAGCAGGCG GCACCGGAAC GCTCAGCGAG ACCGGCGCCG ACGTCGCCCT CCTCGCCGGG CTGACGACGA TGCTCGTCGC GGCCGGCAGC GCGGTGCTCG TGCTGCGTCG CCGTAGGGCC GGGACCTCGA CCCTACGGTG A
|
Protein sequence | MHPPRRRLPL RLAAAATAGA VLLTPLVTTS AVAADDPTGP VEAGIVVDKV EGLPEDFING VDISSILALE KSGVTFQDWD GQQADIFEVL ADADVNYVRV RVWNDPYDAQ GNGYGGGDND IAAAVAIGER ATAHGMRLLV DFHYSDFWAD PAKQQAPKAW AEMTVDQKAD ATEAYTADAL TQLRDANVDV GMVQVGNETN NSVAGVTGWA GMSQIFSAGS SAVRTVYPDA LVAVHFTNPE RAGSYANIAR QLDTHGVDYD VFASSYYPFW HGSLSNLTTV LDQVATTYDK QVMVAETSWN YTLEDGDGHE NTIKPTSGFT QYPSSVQGQA TAVRDVMQAV TDVGPEGIGV FYWEPAWLPV GPPEELAQNK LLWEEHGSGW ASSYAAEYDP EDAGEWYGGS SWDNQALFDF QGAPLESLRV FQYARTGATA PREVVDVEKV TLTVPDGSPV TLPATVRVTY NDGSTEDRPV TWSTSVDWIR GPGAYTVSGT TADGARVTAT VTVSAVNHVA NPSFEDATAA PWTITGTGAS VAADADASDG ARSLKFWSAT PYTFAVEQTI TGVPAGTYRL SATSQGALAG PTDTRTLSAT TAEGEQSAPI TLDGWRAFST ATVDDVVVGE DGLVTVAARY SLSGAAWGNL DDVRLTRVES TTTVDTAALE QSLQTARTVD RTLWTPASLA TLDVAVESAE VLLSGSRATQ EDVDAVTALV DAAVAGLEAI DTTPPVDPEP PVTPQPPVTP EPPVTPEPPV TPEPVTPVMT LSSGSVTAGD DVTVTVTGLD LPEVEIGIES EYQRLASATV VDGAATVTVT VPATLEAGTH SLQARDADGA VLAQAAVEVL AAPVATPTPG EPGTDAPTTP APGVPSSGAP AAGGTGTLSE TGADVALLAG LTTMLVAAGS AVLVLRRRRA GTSTLR
|
| |