Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_3239 |
Symbol | |
ID | 3965729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 4130713 |
End bp | 4132629 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637922336 |
Product | cellulase |
Protein accession | YP_528708 |
Protein GI | 90022881 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00277111 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.266356 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTCA CAAGAATGAA ATCATCACAC CAAGGCGCGT GTCGACCAAG GTCTTCCACC CTACAGCGAC TAATCGCCTC ATCACTTACC ACCGCATGTT TGCTAGCAGC GTCTACTTTT GCCGACGTAG CGCCGTTAAC CGTAGATGGC AACCGCATTC TCAGCGGTGG CCAAGAGGCT AGCTTTGCCG GTAACAGTTT GTTTTGGAGC AACAATTATT GGGGCGGTGA GAAATACTAC ACAGCCGAAA CTGTTAACTG GTTAAAACAA GACTGGGGCG CAACACTAGT GCGCGCGGCC ATGGGTGTAG AAGATAACGG CGGCTACCTA GATGACAAAG AAGGCAACAA ACAAAAGGTA AAAACCGTTG TAGATGCTGC TATTGCCAAC GACATGTATG TAATTATCGA TTGGCACAGC CACCACGCCG AAGACCACAA AAGTGAAGCC ATTGCTTTTT TTGAGGATAT GGCGCGCACC TACGGCAATA AAAAACACGT TATTTACGAA ATTTATAACG AGCCTTTACA AATTTCGTGG AGCAACACAA TTAAACCCTA CGCCGAAGAT GTAATTAGAG CTATTCGCGC GATAGACCCC GACAACTTAA TTATTGTTGG TACGCCAACG TGGTCGCAAG ATGTAGACGT AGCATCGCAA GACCCCATTA CCGGCTACGC CAATATTGCC TACACATTGC ACTTTTACGC AGGCACCCAC AAACAATCTT TACGAGACAA AGCGCAAACC GCACTTAACA ACGGCATAGC GCTTTTCGCA ACAGAGTGGG GAACAGTAAA TGCAAACGGT GATGGCGCTG TAAACACCAC CGAAACAGAC AAGTGGATGA CGTTCTTTAA AACCAACCAC ATAAGCCACG CAAACTGGGC GCTAAACGAC AAATCAGAAG GCGCTTCTGC ATTAAACCCC GGAGCCAGCC CCAATGGCAA CTGGAGCAAC GCCGACTTAA CCACATCGGG TAAGTACGTA AAAAACATTA TCAAAAACTG GAACGACGGC ACGCCGGGAG GCAGCTCTTC AAGCTCGTCC GGCGGCTCAA CCAGTTCCTC CTCAAGCTCA TCTAGCTCTA ATTCCAGCTC TGGTGCTGGC AAAGTAAATT TACCCGCACG CATTGAAGCC GAAAACTATA ACAGTGCACC GGTAGAAACA ACTGCAGGCA ATAGTGGCGG CAGCGTTTCA CAATGTACAT ACAGAGGGCT AAATGTAGAC GTACAAGACG CAAGCGAAGG CACTTGTAAT ATTGGCTGGA CAGCAGCAGG CGAAAAAGTT ACCTACAACA TAGGCACAGC AAATAATACT TACAATATTG CACTTCGCAC CGCATCGCTT GATGCAGGCA AGCGCGTATC GGTATATGTA GGCAACACCC TCGCCGACAC AATAAGCACC CAAGGTGGCG GCTGGCAAAA TTGGAAGACG CAAACCATCC CCAATGTATA TATTCCATCA AACTCAGTTA TTACCGTGGA ATTCTACGAT GGCCGCACCA ACCTTAACTA CTTAAACATT AGTGCAGCTT CGGGGTCTTC CTCTTCAAGC TCCTCATCTA GCTCGTCAAC GTCTAGCTCT TCTTCGAGCT CATCTTCTAG CTCTTCAGGT GGTGGCAGTT GTAGCAGCTA TATAGATATA CCTTGGAATA CTCGCACCGA AGTTACCCTA ACAAGTGGCG CCTGCGTTCG CTTTAACCAA AACCTTTCGG GCAAAACCCT ACAAGTGTGG GATAGCGATG CAAACTCATC GTGCGATTTC CGGGGCACAG TTACAACAGT AGGCGGCACT GGCAGTTTAA ATGTAAGCAG CAACTATGTT TCGTCTAAGA GCCTAACAGG AACCAAACTT ACATTTAATT CAGCAAGTAA TAACAATTGT AAGTACGTTA AAGTTCGTGC TTATTAG
|
Protein sequence | MTFTRMKSSH QGACRPRSST LQRLIASSLT TACLLAASTF ADVAPLTVDG NRILSGGQEA SFAGNSLFWS NNYWGGEKYY TAETVNWLKQ DWGATLVRAA MGVEDNGGYL DDKEGNKQKV KTVVDAAIAN DMYVIIDWHS HHAEDHKSEA IAFFEDMART YGNKKHVIYE IYNEPLQISW SNTIKPYAED VIRAIRAIDP DNLIIVGTPT WSQDVDVASQ DPITGYANIA YTLHFYAGTH KQSLRDKAQT ALNNGIALFA TEWGTVNANG DGAVNTTETD KWMTFFKTNH ISHANWALND KSEGASALNP GASPNGNWSN ADLTTSGKYV KNIIKNWNDG TPGGSSSSSS GGSTSSSSSS SSSNSSSGAG KVNLPARIEA ENYNSAPVET TAGNSGGSVS QCTYRGLNVD VQDASEGTCN IGWTAAGEKV TYNIGTANNT YNIALRTASL DAGKRVSVYV GNTLADTIST QGGGWQNWKT QTIPNVYIPS NSVITVEFYD GRTNLNYLNI SAASGSSSSS SSSSSSTSSS SSSSSSSSSG GGSCSSYIDI PWNTRTEVTL TSGACVRFNQ NLSGKTLQVW DSDANSSCDF RGTVTTVGGT GSLNVSSNYV SSKSLTGTKL TFNSASNNNC KYVKVRAY
|
| |