Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_3237 |
Symbol | |
ID | 3965710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 4125970 |
End bp | 4127862 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637922334 |
Product | cellulase |
Protein accession | YP_528706 |
Protein GI | 90022879 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000087335 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.349209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCAG CAACCACAAA TCAATCGAGG GCACGCAGTA GCGCCTTTAA AAATATGTTG GCGGCATCGC TCGCAGGTTT AGGGCTACTA TCAGCTTCTG CATTTGCCGA TGTAGCCCCG CTAACCGTAG ACGGCAATAA AATTCTTAGC GGTGGCCAGC AAGCCAGTTT TGCCGGTAAT AGCTTATTTT GGTCTAACAA TGGCTGGGGC GGTGAGAAGT ATTACACGGC CGGTACCGTT GAATGGCTAA AGCAAGACTG GGGCAGTAAT TTAGTTCGCG CCGCAATGGG TGTCGATGAA AACGGCGGCT ACTTAGAAGA CCCAGCAGGA AACAAAGCGA AAGTAACAAC CGTTGTAGAT GCAGCCATCG CTAACGATAT GTATGTAATT ATCGATTGGC ACAGCCACCA CGCCGAAGAC TACCAAAACC AAGCCATTAG CTTTTTCCAA GATATGGCTC GCACCTACGG TAACAACAAC AACGTTATAT ACGAAATTTA TAACGAGCCA TTACAGGTTT CTTGGAGCGG CACCATCAAG CCTTACGCAG AAGCGGTAAT TGGCGCAATT CGCGCAATCG ACCCAGATAA CCTTATTATT GTGGGCACGC CTACTTGGTC GCAGGATGTA GACGTAGCCT CGCGCGACCC CATCACGCAG TACAGCAACA TTGCCTACAC TATTCACTTT TATGCGGGCA CCCACAAACA ATCCCTACGC GATAAAGCAC AAACCGCATT AAATAATGGT ATTGCTTTGT TTGCTACCGA ATGGGGTACA GTAAATGCCA ACGGTGACGG CGGTGTAGAC GCAGCCGAAA CTGATCGTTG GATGCAGTTT TTTAAAGCGA ATCATATAAG CCATGCCAAC TGGGCCTTAA ACGATAAAGC CGAAGGCTCT TCTGCATTAA AGCCTGGCTC TAACGCAAAC GGCGGCTGGA GCAATTCCGA CTTAACCGCC TCTGGTACCT ATGTTAAAAA CTTAATTAAA ACATGGAACG ACGGCTCACC GAGCAGCAGC TCATCTAGCA GCACCAGTTC TTCTTCAAGC AGCTCCTCGT CTAGTAGCTC ATCATCTAGC AGCTCTTCAT CTAGTAGTTC TGGCGGTACC AATTTACCCG CGCGCATTGA AGCAGAAAAC TACGATAGCG CACCGGTAGA AACCACTGCA GGTAATAGCG GCTCACCCAC CAATTGTTCG TATAAAGGTA TGGGCGTAGA TGTAGAAAAC TCTACTGAAG GTGCTTGTAA TATTGGCTGG ACTGCGGCAG GCGAAAAAGT AACTTACAAC ATTGGCAATG CCGATGGCAC TTACGATATT GCATTGCGCG TAGCCTCTAT GGATGCGGGC AAACGTATCT CTGTGCATGT AAACAACAGC CTAGCAGATA CCGTAACCAC ACAAGGTGGC GGCTGGCAGG CATGGACTAC CGAAACCATT TCTAACGTGT ATATCCCATC AAACTCGGTA ATTACCGTTG AGTTTTACGA TAGTGGCTCT AACCTAAACT TTTTAAACAT TACCGAAAGC TCGGGTACCG AACCACCTGT AGAACCACCC GTTGAGCCGC CAGTAGAACC ACCCGTAGAC AACGGTAACT TCCCATGTAA CGACGGTAAC TCTACGCTTG CCAACAACGG CGCCTCCATT AACCTTAACC AAGGAGCGTG TGTTAAATAC AATCACGGCT GGGGCGATAT TCGTTTAGGC ACCTGGAGCG GCAACGGTAC CATTCGATAC GACGTACTAG ACTGCAATAA CAACGTAATG AGTGATATTG CACAAAAACT TAATGACTTT ACTGCTGTAG ACACCGCAAC AATGAACTGC GCACACTACA TTTATGTAAA ACAAGCCCCT AGCAGCTACA CCCTGCAATT TGGTAGCTGG TAG
|
Protein sequence | MKSATTNQSR ARSSAFKNML AASLAGLGLL SASAFADVAP LTVDGNKILS GGQQASFAGN SLFWSNNGWG GEKYYTAGTV EWLKQDWGSN LVRAAMGVDE NGGYLEDPAG NKAKVTTVVD AAIANDMYVI IDWHSHHAED YQNQAISFFQ DMARTYGNNN NVIYEIYNEP LQVSWSGTIK PYAEAVIGAI RAIDPDNLII VGTPTWSQDV DVASRDPITQ YSNIAYTIHF YAGTHKQSLR DKAQTALNNG IALFATEWGT VNANGDGGVD AAETDRWMQF FKANHISHAN WALNDKAEGS SALKPGSNAN GGWSNSDLTA SGTYVKNLIK TWNDGSPSSS SSSSTSSSSS SSSSSSSSSS SSSSSSSGGT NLPARIEAEN YDSAPVETTA GNSGSPTNCS YKGMGVDVEN STEGACNIGW TAAGEKVTYN IGNADGTYDI ALRVASMDAG KRISVHVNNS LADTVTTQGG GWQAWTTETI SNVYIPSNSV ITVEFYDSGS NLNFLNITES SGTEPPVEPP VEPPVEPPVD NGNFPCNDGN STLANNGASI NLNQGACVKY NHGWGDIRLG TWSGNGTIRY DVLDCNNNVM SDIAQKLNDF TAVDTATMNC AHYIYVKQAP SSYTLQFGSW
|
| |