Gene Slin_4196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4196 
Symbol 
ID8727955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5051670 
End bp5054678 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content55% 
IMG OID 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003388980 
Protein GI284039050 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.690171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTCT GTCGGTCTGC TTTTTTTAGT TTAGTTAGCG TTTTTCTGTC CGTAACGACC 
TTTGCGCAAA CCACATCGCC CTCCTTTCTT AAACTCAATG CCCGCCAGGC TTGCTGGGCC
GATTCGGTGT TTACGAACAT GGCCCCCGAC GACCGGATTG CCCAGCTAAT TATGGTAGCC
GGGTACTCGA ACCGGAAACC GGCCTACGAA GATTCGCTGA TACGGCTTGT TCAGACCAAT
AAACTCGGTG GGGTCGTCAT GTTTCAGGGT GGGCCCGTTC GGCAGGCGCA GCTTACCAAT
CGGTTACAGG CCGGTTCGCC CGTTCCACTC CTCATTGCGA TGGATGCCGA ATGGGGCATT
GCCATGCGCC TGGACAGCAC CGTTCGCTAC CCCTACCAGA TGACCCTCGG CGCCATGCAG
GGGGCCGCTT CCGATTCGCT CATTTACCAG ATGGGCGCCA ACCTGGCTAA ACAGGCCCGT
AGGCTGGGGA TGCATGTGAA CTTTGCCCCT TCGGTCGATG TCAACAACAA CCCGAACAAC
CCCGTCATCA ACTTCCGCTC CTTTGGCGAA GACAAATATG CCGTTGCCCG CAAAGCCCTT
GCTTACATGC GCGGTATGCA GGACAACCAG TTACTAACCA GCATCAAGCA CTTTCCCGGC
CACGGCGACA CTGGCACCGA CTCCCATTAC GACTTACCCC TGATTGCCAA AAGCCGGACC
CAGCTCGACT CGCTCGAACT GTATCCGTTC CGGGAGTTGA TCCGGGCGGG GGCTACGGGC
GTGATGATCG CGCACCTCAA CATTCCCGCT CTCGATACCA CCCGAAATCG CCCATCGACG
CTTTCGCCCG CTATTGTTAC GAACCTCCTC AAGAATGAAT TAGGCTTTAA AGGGCTGGTC
TTTTCAGATG CGATGAATAT GAAAGCCGTA ACCAAATTCT ACCCGTCCGG CAAAGCCGAT
GAACTGGGCC TGGAAGCGGG GATGGACGTG CTCGAATTTA CCGAAGATGT TCCGGCGGCA
CTGGCGCAGG TGAAACAGGC CATTGTGGAC GGACGAATCA CGCAGGCCTC CATCGACGCC
CGCTGCCTGA AAGTATTACA GGCCAAAGCG TGGGCCGGGC TGGATATGTA CAAACCCATT
GTAATGGAGA ACCTGGTCGA AGACCTGAAC CCGGTTCAGG ACGAGCTACT TAACCGCAAG
CTTACCGAAC GCAGCCTGAC GGTTTTGAAG AACGACCGGA ATGTGCTGCC GCTACAGGGT
CTGGATACAC TCCGGATTGC GTCGGTAGCG GTGGAGAGCG ACAAGATCAC GGCGTTTCAG
CAGATGGCGT CCAACTACAC CAAAATCGAC CATTTCAACG TAACCTCGAA AACAGCTGAT
TCGACCTGGG CGCAAATCCG TGATTCGCTC AGCAACTACA ACCTTATTCT GGTCGATGTC
CACCTGAACA ATATCCGCCC CGCCGTTAAA TACGGTCTGC AACCCAAAAC GGCGGGCATC
GTTGGCGAAC TGGTGGCTAC CGGCAAAGCG GTCGTTACGG TGTTCGGCAA TGTGTATGCG
CTGGACAAAC TGACCTTTCC GATGGATACC GCTCAGCCCA GCCGCAACAT CGAACAGGCG
CGGGCCATTG TGATGCCGTA CCAGCTGACA CCCTACACGG AAGAACTGTC GGCTCAGTTG
ATTTTCGGGG CTATTGGCGC ATCGGGTAAG CTGCCCGTTA CGGTCAACCA GCGATTCCGG
CTGGGCGACG GGTTGCCCGT TCAAGCCATT GGTCGGATGA AATATACCAT TCCGGAAGAG
GTGGGTATCA GCAGTAAGTT TCTGACCCAA CAGGTCGACT CGCTGGTAAA CGTGGGCATC
AGCCAGGGCG CTTTTCCGGG CTGTGTGGTC CAGATGGCGA AAGATGGTAA GGTAATTTTC
CGAAAAGCCT ATGGCAAGCA TACCTACGAT GCTTCGCTGG GGGCCGAACC CCTTCCGGTG
CAGCTCGACG ACCTGTATGA CATGGCCTCG GTCACGAAGG TGAGTACCTC CACCCCCGCC
CTCATGCACC TGGTCGACGA CGGCAAGTTC AACCTGAACG GTAAAATGGC CGATTATCTG
CCCGGTTTCA AGAAGTCGAA CAAAGCCGAT CTGCTCTGGC GTGATGTACT GACCCACCAG
GCCCGGCTAA GAGCCTGGAT ACCTTTCTGG ATGGACACCA AAAACCCGGA CGGCTCGTGG
AAGCCCAAAA CCTTCCAGAA GGAACGGTCT GGCCGGTACC CGATTGAAGT GACCGACAGC
CTGTTTGAGT TCAAAAACTA CCCGAAAACG ATTTTTGAGC AGATCAGAGA CTCCCCGCTG
AACGCGAAAA AAGAATATGT GTACTCTGAC CTGTCGTTTA TTCTATACCC CCAGATCGTG
AAACGACTAA CGGGCGAAAA CTTCGAAGAC TACCTCAAAA AGACCTTCTA CAAGCCCCTT
GGCGCGAGTA CGCTTACCTT CCTGCCCCGC CGGTTCTATC CGCTCACCCG GATCGTTCCT
ACCGAGTACG ATTCGCTGTT CCGCAAAACG CTCATCTGGG GGCGTGTGCA CGACGAGGGC
GCGGCTATGT TGGGTGGCCT ATCGGGACAT GCGGGCCTGT TTGGTAATGC GAATGACTTG
ATGAAGGTTT ACGAAATGTA CCGCCAGAAA GGCAGTTACG GCGGTCAGCG GTTTATTTCA
GAAAAGACAA TTGCCGAGTT TACCCGGTAT CAGTTTCCGG AGCTGGGCAA CCGGCGTGGT
CTTGGCTTCG ACAAACCGTC GTTCGCCTAT ACGGGCAATG CGCCGAAGTC GGCTACCAAA
GCCAGTTACG GGCACTCAGG CTATACAGGC ACGTTTGTGT GGGTAGAACC CGACCCAGCC
TATAATTTAA CATACATATT TTTGTGCAAC CGCGTATATC CTACGAGAAA TAATCCTAAA
CTTGGTAATC TAAACACCCG CACCAACATT GTCGAAGCAT TGTATCAGGC AACCAAACGT
GGACTGTGA
 
Protein sequence
MPFCRSAFFS LVSVFLSVTT FAQTTSPSFL KLNARQACWA DSVFTNMAPD DRIAQLIMVA 
GYSNRKPAYE DSLIRLVQTN KLGGVVMFQG GPVRQAQLTN RLQAGSPVPL LIAMDAEWGI
AMRLDSTVRY PYQMTLGAMQ GAASDSLIYQ MGANLAKQAR RLGMHVNFAP SVDVNNNPNN
PVINFRSFGE DKYAVARKAL AYMRGMQDNQ LLTSIKHFPG HGDTGTDSHY DLPLIAKSRT
QLDSLELYPF RELIRAGATG VMIAHLNIPA LDTTRNRPST LSPAIVTNLL KNELGFKGLV
FSDAMNMKAV TKFYPSGKAD ELGLEAGMDV LEFTEDVPAA LAQVKQAIVD GRITQASIDA
RCLKVLQAKA WAGLDMYKPI VMENLVEDLN PVQDELLNRK LTERSLTVLK NDRNVLPLQG
LDTLRIASVA VESDKITAFQ QMASNYTKID HFNVTSKTAD STWAQIRDSL SNYNLILVDV
HLNNIRPAVK YGLQPKTAGI VGELVATGKA VVTVFGNVYA LDKLTFPMDT AQPSRNIEQA
RAIVMPYQLT PYTEELSAQL IFGAIGASGK LPVTVNQRFR LGDGLPVQAI GRMKYTIPEE
VGISSKFLTQ QVDSLVNVGI SQGAFPGCVV QMAKDGKVIF RKAYGKHTYD ASLGAEPLPV
QLDDLYDMAS VTKVSTSTPA LMHLVDDGKF NLNGKMADYL PGFKKSNKAD LLWRDVLTHQ
ARLRAWIPFW MDTKNPDGSW KPKTFQKERS GRYPIEVTDS LFEFKNYPKT IFEQIRDSPL
NAKKEYVYSD LSFILYPQIV KRLTGENFED YLKKTFYKPL GASTLTFLPR RFYPLTRIVP
TEYDSLFRKT LIWGRVHDEG AAMLGGLSGH AGLFGNANDL MKVYEMYRQK GSYGGQRFIS
EKTIAEFTRY QFPELGNRRG LGFDKPSFAY TGNAPKSATK ASYGHSGYTG TFVWVEPDPA
YNLTYIFLCN RVYPTRNNPK LGNLNTRTNI VEALYQATKR GL