Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_3710 |
Symbol | |
ID | 3966726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 4696567 |
End bp | 4697997 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637922807 |
Product | arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_529177 |
Protein GI | 90023350 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000857688 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.462081 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTAT CTATAAAGAA AGCGCTATGC GCTATGGCTG CGTCCACCTT TATTTTGCTG CAGGCTTCAA ACGCCCTTGC TATGGCAAAA GGTGCAGATG TGAGTTGGAT TTCTGAAATG GAAGCAGAGG GCTACACGTT TTATAACGAT GCGGGCCAGC AGCAAGATGT GCTGCAAATT CTAAAAGATC ACGGTATGGA TTCCATCCGT CTGCGCGTGT GGGTAAACCC CGCCGGTGGA TGGTATAGCA GCATTAATGA CGTAATAGAA AAAGCTCAGC GCGCCAAAGC TGCGGGCATG CGTATTATGA TCGATTTTCA CTACAGCGAC TCTTGGGCTG ACCCAGGCAA GCAATACAAG CCCGCCGCGT GGACCAACTA TACCTTAGAC GGTTTAATGT CTGCGGTGTG GTGGCACACC TACGATTCCC TCGTGGCCCT AAAGAATGCG GGTATTACCC CTGAATGGGT GCAAGTGGGC AACGAAACAA ACAACGGTAT GTTATGGGAA GAGGGGCGCG CATCCGCCAA TATGCAAAAC TATGCGTGGT TGGTGAATAG TGGCTACGAT GCCGTTAAAG AAGTGTTCCC TAATACCAAG GCAGTGGTGC ACTTGGCAAA CTGCCACGAC AACGCAAACT TCCGCTGGAT ATTTGACGGC TTACAAGCCA ATGGTGGTAA GTGGGATGTA ATAGGTGCCT CTATTTACCC TACCAACGCA AGCGGTTATA GCTGGAGCCA AGCCAACAGT TTGTGCGAGG CAAACTTAAA CGATATGCAA TCGCGCTATG GGTCCGAGGT GCTAATTGCC GAGGTTGGTG CGCCGTGGGA TCACCCAGAA GCGAAAGCAA TCGTGAGCGA TGTAATTGCT AAGGCGCAAA ACGCCGGTGC AACAGGGGTA TTTTATTGGG AGCCGCAGGC ATCAAACTGG CAGGGCTACA CGCTAGGTGC ATGGAACCCA AACACCATGC GCCCCACCGA AGCATTAGAC GCGTTTATTG ACGGCAGCTC GAATGTGACA ACCGCGCGTT TGCAATCGCG CAATAGCAAC CGCTGTATAG ATGTTAATGG CCGCAGTACA GCAGATGGTG CCGATATCAT TCAGTGGAGT TGCCACAGCA ACGCCAACCA GCAATGGACT TTTGAAGATA TGGGCAATAA CTACGTGCGA TTGCGCGTGG GCCACAGTAA TAAGTGCTTA GATGTACTGG GTGCAGGCAC TGCCGATGGC GATAACGTAG TGCAGTGGGC ATGCCACAAT AACGCCAATC AGCAATGGCT AAAAGAAGAC ATGGGCGATG GCTACTTCCG CTTAAAATCT CGCGCCAGCG GTAAATGCGT AGATGTAAAC GCAGGCGGTG CTAACAACGG TGATTCTATT ATTCAATGGA GTTGCCACAC TGGTTGGAAC CAGCAATGGA TGGTTTATTA G
|
Protein sequence | MTLSIKKALC AMAASTFILL QASNALAMAK GADVSWISEM EAEGYTFYND AGQQQDVLQI LKDHGMDSIR LRVWVNPAGG WYSSINDVIE KAQRAKAAGM RIMIDFHYSD SWADPGKQYK PAAWTNYTLD GLMSAVWWHT YDSLVALKNA GITPEWVQVG NETNNGMLWE EGRASANMQN YAWLVNSGYD AVKEVFPNTK AVVHLANCHD NANFRWIFDG LQANGGKWDV IGASIYPTNA SGYSWSQANS LCEANLNDMQ SRYGSEVLIA EVGAPWDHPE AKAIVSDVIA KAQNAGATGV FYWEPQASNW QGYTLGAWNP NTMRPTEALD AFIDGSSNVT TARLQSRNSN RCIDVNGRST ADGADIIQWS CHSNANQQWT FEDMGNNYVR LRVGHSNKCL DVLGAGTADG DNVVQWACHN NANQQWLKED MGDGYFRLKS RASGKCVDVN AGGANNGDSI IQWSCHTGWN QQWMVY
|
| |