Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4196 |
Symbol | |
ID | 8727955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5051670 |
End bp | 5054678 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003388980 |
Protein GI | 284039050 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.690171 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTTCT GTCGGTCTGC TTTTTTTAGT TTAGTTAGCG TTTTTCTGTC CGTAACGACC TTTGCGCAAA CCACATCGCC CTCCTTTCTT AAACTCAATG CCCGCCAGGC TTGCTGGGCC GATTCGGTGT TTACGAACAT GGCCCCCGAC GACCGGATTG CCCAGCTAAT TATGGTAGCC GGGTACTCGA ACCGGAAACC GGCCTACGAA GATTCGCTGA TACGGCTTGT TCAGACCAAT AAACTCGGTG GGGTCGTCAT GTTTCAGGGT GGGCCCGTTC GGCAGGCGCA GCTTACCAAT CGGTTACAGG CCGGTTCGCC CGTTCCACTC CTCATTGCGA TGGATGCCGA ATGGGGCATT GCCATGCGCC TGGACAGCAC CGTTCGCTAC CCCTACCAGA TGACCCTCGG CGCCATGCAG GGGGCCGCTT CCGATTCGCT CATTTACCAG ATGGGCGCCA ACCTGGCTAA ACAGGCCCGT AGGCTGGGGA TGCATGTGAA CTTTGCCCCT TCGGTCGATG TCAACAACAA CCCGAACAAC CCCGTCATCA ACTTCCGCTC CTTTGGCGAA GACAAATATG CCGTTGCCCG CAAAGCCCTT GCTTACATGC GCGGTATGCA GGACAACCAG TTACTAACCA GCATCAAGCA CTTTCCCGGC CACGGCGACA CTGGCACCGA CTCCCATTAC GACTTACCCC TGATTGCCAA AAGCCGGACC CAGCTCGACT CGCTCGAACT GTATCCGTTC CGGGAGTTGA TCCGGGCGGG GGCTACGGGC GTGATGATCG CGCACCTCAA CATTCCCGCT CTCGATACCA CCCGAAATCG CCCATCGACG CTTTCGCCCG CTATTGTTAC GAACCTCCTC AAGAATGAAT TAGGCTTTAA AGGGCTGGTC TTTTCAGATG CGATGAATAT GAAAGCCGTA ACCAAATTCT ACCCGTCCGG CAAAGCCGAT GAACTGGGCC TGGAAGCGGG GATGGACGTG CTCGAATTTA CCGAAGATGT TCCGGCGGCA CTGGCGCAGG TGAAACAGGC CATTGTGGAC GGACGAATCA CGCAGGCCTC CATCGACGCC CGCTGCCTGA AAGTATTACA GGCCAAAGCG TGGGCCGGGC TGGATATGTA CAAACCCATT GTAATGGAGA ACCTGGTCGA AGACCTGAAC CCGGTTCAGG ACGAGCTACT TAACCGCAAG CTTACCGAAC GCAGCCTGAC GGTTTTGAAG AACGACCGGA ATGTGCTGCC GCTACAGGGT CTGGATACAC TCCGGATTGC GTCGGTAGCG GTGGAGAGCG ACAAGATCAC GGCGTTTCAG CAGATGGCGT CCAACTACAC CAAAATCGAC CATTTCAACG TAACCTCGAA AACAGCTGAT TCGACCTGGG CGCAAATCCG TGATTCGCTC AGCAACTACA ACCTTATTCT GGTCGATGTC CACCTGAACA ATATCCGCCC CGCCGTTAAA TACGGTCTGC AACCCAAAAC GGCGGGCATC GTTGGCGAAC TGGTGGCTAC CGGCAAAGCG GTCGTTACGG TGTTCGGCAA TGTGTATGCG CTGGACAAAC TGACCTTTCC GATGGATACC GCTCAGCCCA GCCGCAACAT CGAACAGGCG CGGGCCATTG TGATGCCGTA CCAGCTGACA CCCTACACGG AAGAACTGTC GGCTCAGTTG ATTTTCGGGG CTATTGGCGC ATCGGGTAAG CTGCCCGTTA CGGTCAACCA GCGATTCCGG CTGGGCGACG GGTTGCCCGT TCAAGCCATT GGTCGGATGA AATATACCAT TCCGGAAGAG GTGGGTATCA GCAGTAAGTT TCTGACCCAA CAGGTCGACT CGCTGGTAAA CGTGGGCATC AGCCAGGGCG CTTTTCCGGG CTGTGTGGTC CAGATGGCGA AAGATGGTAA GGTAATTTTC CGAAAAGCCT ATGGCAAGCA TACCTACGAT GCTTCGCTGG GGGCCGAACC CCTTCCGGTG CAGCTCGACG ACCTGTATGA CATGGCCTCG GTCACGAAGG TGAGTACCTC CACCCCCGCC CTCATGCACC TGGTCGACGA CGGCAAGTTC AACCTGAACG GTAAAATGGC CGATTATCTG CCCGGTTTCA AGAAGTCGAA CAAAGCCGAT CTGCTCTGGC GTGATGTACT GACCCACCAG GCCCGGCTAA GAGCCTGGAT ACCTTTCTGG ATGGACACCA AAAACCCGGA CGGCTCGTGG AAGCCCAAAA CCTTCCAGAA GGAACGGTCT GGCCGGTACC CGATTGAAGT GACCGACAGC CTGTTTGAGT TCAAAAACTA CCCGAAAACG ATTTTTGAGC AGATCAGAGA CTCCCCGCTG AACGCGAAAA AAGAATATGT GTACTCTGAC CTGTCGTTTA TTCTATACCC CCAGATCGTG AAACGACTAA CGGGCGAAAA CTTCGAAGAC TACCTCAAAA AGACCTTCTA CAAGCCCCTT GGCGCGAGTA CGCTTACCTT CCTGCCCCGC CGGTTCTATC CGCTCACCCG GATCGTTCCT ACCGAGTACG ATTCGCTGTT CCGCAAAACG CTCATCTGGG GGCGTGTGCA CGACGAGGGC GCGGCTATGT TGGGTGGCCT ATCGGGACAT GCGGGCCTGT TTGGTAATGC GAATGACTTG ATGAAGGTTT ACGAAATGTA CCGCCAGAAA GGCAGTTACG GCGGTCAGCG GTTTATTTCA GAAAAGACAA TTGCCGAGTT TACCCGGTAT CAGTTTCCGG AGCTGGGCAA CCGGCGTGGT CTTGGCTTCG ACAAACCGTC GTTCGCCTAT ACGGGCAATG CGCCGAAGTC GGCTACCAAA GCCAGTTACG GGCACTCAGG CTATACAGGC ACGTTTGTGT GGGTAGAACC CGACCCAGCC TATAATTTAA CATACATATT TTTGTGCAAC CGCGTATATC CTACGAGAAA TAATCCTAAA CTTGGTAATC TAAACACCCG CACCAACATT GTCGAAGCAT TGTATCAGGC AACCAAACGT GGACTGTGA
|
Protein sequence | MPFCRSAFFS LVSVFLSVTT FAQTTSPSFL KLNARQACWA DSVFTNMAPD DRIAQLIMVA GYSNRKPAYE DSLIRLVQTN KLGGVVMFQG GPVRQAQLTN RLQAGSPVPL LIAMDAEWGI AMRLDSTVRY PYQMTLGAMQ GAASDSLIYQ MGANLAKQAR RLGMHVNFAP SVDVNNNPNN PVINFRSFGE DKYAVARKAL AYMRGMQDNQ LLTSIKHFPG HGDTGTDSHY DLPLIAKSRT QLDSLELYPF RELIRAGATG VMIAHLNIPA LDTTRNRPST LSPAIVTNLL KNELGFKGLV FSDAMNMKAV TKFYPSGKAD ELGLEAGMDV LEFTEDVPAA LAQVKQAIVD GRITQASIDA RCLKVLQAKA WAGLDMYKPI VMENLVEDLN PVQDELLNRK LTERSLTVLK NDRNVLPLQG LDTLRIASVA VESDKITAFQ QMASNYTKID HFNVTSKTAD STWAQIRDSL SNYNLILVDV HLNNIRPAVK YGLQPKTAGI VGELVATGKA VVTVFGNVYA LDKLTFPMDT AQPSRNIEQA RAIVMPYQLT PYTEELSAQL IFGAIGASGK LPVTVNQRFR LGDGLPVQAI GRMKYTIPEE VGISSKFLTQ QVDSLVNVGI SQGAFPGCVV QMAKDGKVIF RKAYGKHTYD ASLGAEPLPV QLDDLYDMAS VTKVSTSTPA LMHLVDDGKF NLNGKMADYL PGFKKSNKAD LLWRDVLTHQ ARLRAWIPFW MDTKNPDGSW KPKTFQKERS GRYPIEVTDS LFEFKNYPKT IFEQIRDSPL NAKKEYVYSD LSFILYPQIV KRLTGENFED YLKKTFYKPL GASTLTFLPR RFYPLTRIVP TEYDSLFRKT LIWGRVHDEG AAMLGGLSGH AGLFGNANDL MKVYEMYRQK GSYGGQRFIS EKTIAEFTRY QFPELGNRRG LGFDKPSFAY TGNAPKSATK ASYGHSGYTG TFVWVEPDPA YNLTYIFLCN RVYPTRNNPK LGNLNTRTNI VEALYQATKR GL
|
| |