Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_0953 |
Symbol | |
ID | 3967668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 1237117 |
End bp | 1238508 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637920020 |
Product | glycoside hydrolase family protein |
Protein accession | YP_526427 |
Protein GI | 90020600 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.161663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000205432 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTATAAAA TTTCACGCCG CACAACACTC AAAGGCTTAG GCCTAACTTG CCTAGCCGGC TGCACCACCA GCCTACCCAC ACTAGAGCAA GACCCATGGG CTTTTGCACA AAACATAGCG GACAACACCA CCATCCCCAC ATTCCCAAAC AAAGAATTTA ATTTACTCGA ATTCGGCGGC AAAGAAGGGA GCGACAACAC CCTCGCCTTC AAAAAAGCGA TTGCAGCATG CAGCAAAGCA GGTGGCGGCA AGGTGGTAGT ACCCGCAGGA CGATTTGAGA CAGGCGCCAT CCACTTAGAG TCGAACGTTA ACCTTCATAT TAGCGAAGGC GCTACCATCG CCTTTTTTAC CGACCCCAAA TATTACCTGC CTGCGGTTTT CACTCGCTGG GAAGGCATGG AGTGCATGGG CTACTCACCC CTTATATACG CCTACGGCAA AACCAACATA GCCATTACCG GTAAAGGCAC CCTCGACGGT CAAGCCGACC CAACGCACTG GTGGGCATGG AAAGGCAACA AAGAATGGGG CGTAGAGGGC TACCCAAGCC AAAAGGAAAG CCGCAACCAA CTATTTGCCC AAGCAGAAGC TGGCGACCCC GTTAGAGAGC GCGTGTATGC AGACGGCCAC TACCTGCGCC CCTCGTTTGT GCAACCCTAC AAGTGCGAAA ACGTGCTGAT AGAAGACATA ACTATTATCA ACGCTCCCTT CTGGTTGCTA CACCCCACCC TTTCACAAAA CGTCACTGTA CGCGGTGTTC ACCTAGAAAG CCTAGGCCCC AACTCGGATG GCTGCGATCC TGAAAGCTGT AAGAATGTAG TTATCGAAAA CTGCTTTTTT AATACCGGTG ACGACTGTAT CGCTATTAAA TCTGGCCGCA ACAACGATGG CCGCAGGCTT GCCACACCTA CCGAGAACGT GATTATTCGC AACTGTAAAA TGGAAGCGGG TCACGGTGGC GTAGTTATAG GCTCAGAAAT TTCTGGCGGC GTGCGCAATG TGTTTGCCGA AAATAACGTA ATGAGCAGCC CCGATTTAGA GAAAGGCATT CGCATTAAAA CCAACTCTGT GCGCGGCGGA CTGCTAGAGA ACATCTATGT GCGCAACTGC ACCATAGGCG AAGTACAACA AGCCATTGTT ATTAACTTCC AATACGAAGA AGGCGATGCG GGTAAATTTG ACCCCACCGT GCGCAATGTA GAAATACGCA ATTTGGTCTG CCAGCACGCC TTACAAGTGT TTAACATCCG CGGTTTTGAG CGCGCCCCCA TTCAAAACTT TAGGATAATC GACAGCACCT TTGTGCGTGG TGACAACCCA GGCGTAATTG AACATACCAC AGGGTTAGTT ATCGACAACG TCCAAGTCAA CGGCAAAGCG TTTAACATCT AG
|
Protein sequence | MYKISRRTTL KGLGLTCLAG CTTSLPTLEQ DPWAFAQNIA DNTTIPTFPN KEFNLLEFGG KEGSDNTLAF KKAIAACSKA GGGKVVVPAG RFETGAIHLE SNVNLHISEG ATIAFFTDPK YYLPAVFTRW EGMECMGYSP LIYAYGKTNI AITGKGTLDG QADPTHWWAW KGNKEWGVEG YPSQKESRNQ LFAQAEAGDP VRERVYADGH YLRPSFVQPY KCENVLIEDI TIINAPFWLL HPTLSQNVTV RGVHLESLGP NSDGCDPESC KNVVIENCFF NTGDDCIAIK SGRNNDGRRL ATPTENVIIR NCKMEAGHGG VVIGSEISGG VRNVFAENNV MSSPDLEKGI RIKTNSVRGG LLENIYVRNC TIGEVQQAIV INFQYEEGDA GKFDPTVRNV EIRNLVCQHA LQVFNIRGFE RAPIQNFRII DSTFVRGDNP GVIEHTTGLV IDNVQVNGKA FNI
|
| |