Gene Slin_4349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4349 
Symbol 
ID8728109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5272057 
End bp5275767 
Gene Length3711 bp 
Protein Length1236 aa 
Translation table11 
GC content54% 
IMG OID 
ProductGlycosyl hydrolase family 98 putative carbohydrate binding module 
Protein accessionYP_003389130 
Protein GI284039200 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.165187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGAA AATCTATTGG TGAGCACCGT CCGCGTGCCC GTTCACTACT TCTTTTTTAT 
GCTTTCCTAC TTCTTTTTGG TACTGTTTGT CAATCGCTAC AAGCGCAGAC GACCTACTAT
GTCGCTAACT CGGGGAACGA TGCCAACAGC GGCAATTCGG AGGGAGCCCC GTTCCAGACA
CTGGCGAAGG TCAACAGCCT GACGTTACAA CCGGGCGACG CTATTTTGTT TCGGCGGGGC
GATACCTTTC GTGGGAGCCT GGTTATTCGG CAGTCCGGTG CGCCGGGCAG GCCCATCGTA
ATCGACGCCT ATGGCTCTGG AAGTAAACCG GTTCTGGCCG GTTCTGTGCC ACTGACTGGC
TGGAGCAATA TTGGCAACAA CATCTGGCAG GCCAATTGCC CATCCTGCGG AAGTCAGGTA
ACGGGCGTGT ATCAAAATAC TGTGGCCTTA CCGCTGGGCC GCTACCCCAA CCCCGGTGAG
GCCAGCAAAG GCTACCTGAC CATTCAGTCG CACAGTGGCA AAACGCAGCT TACCAGTCAA
CAGGGGTTAA CAGCCAACTG GACGGGGGGC GAAGCGGTTT TGCGGCCTAA TCAGTGGATT
CTGGATCGGG CGGCTATTAC CCAGCAGAAC GGGAATACCC TGACCCTGGC TAATAACAGT
AGTTACAACC TGGCCGATGG CTGGGGCTAT TTCATCCAGA ATCACCCCGC CACGCTGGAC
CAGGTAGGGG AGTGGTATTA TAACCCGGCC GACAAAACGA TTCAGTTGTT CGACAATCAG
CGCAACCCGA ATACGCAGCT TATTACTGCC ACGACCTTCA GTGAAGGAAT AAAGCTGACA
AACGTGTCGT ACGTAACAGT GCGCAATGTT GAAATTACAG AAACGCTCAG CAGCGGCATC
GCCGTTACGG GCGGGTCGAA TTTCACCTTC TCCGGCAACG ATATTACCAA TTCCGGCGAA
GACGGCGTCA CGATTATTGG TTCGGGGAAT ACCGTAGTGG CCGAGAATAA TCTGATCGAA
GACGCCAACA GCAGTGGCTT TTACATTGGT CCTTACCAGA ACTTTACGTT TCGGGGCAAC
ACGCTTCGGC GGATAGGGAC GCTGCCTGGC CGGGGAAAAA GTGGTGATGG TACGTATTCA
GCCCTGCAAT CGCTGTGTAC GGGCAACACC CTGATTGAAA ATAACGTGGT CGACAACATC
GGCTACAACG GTATTGCCGT CGTGACCAAT GCCACGGTGC GGTATAATCA GGTGTCTAAC
TTCTGCCTGA CCAAAAGCGA CGGGGGCGGC ATTTACACCT GGAACGGTAG CGGGGGTAAC
GTAGGCGATC TGCACATTGT GTCAAACATC GTGTTCAACG GAGCCGGGGC ACCGGAGGGA
ACACCGGGCG GGGCTTATTC CGGCGCCAAC GGTATTTTTC TGGACGACTG TTCGAAAAAT
GTAGAAGTGC TTAACAACAT GTCTTTTGGG TCGAAAGGGA TGGGTATCTT CCTGCGGGGC
GTGTCCAGCA TTACCGTCAG AGGAAATACC AGTTTCAACA ATACGGAAGA ACAGCTCAAA
CTGGCGTACA ACGGGGCCTG TGCCTTGAGA AATAATATTG TCGAGAATAA CATTCTGTTT
AGTCGGCTGG CTAATCAGGT GGTAGCCGCT TATGAGTCGA ACACAAACGA CCTGACCAGC
TACGGCCAGT TTGATTACAA CTATTACGTT CGTCCATTTG AGGATTTGTT CAAGATCCGG
GCCGTTTACA ATCCGGGGTC CGGGCTAACC GGGGCCGATC TGCCCCTGAA AGCGTGGCAG
GCGCAGTTCG GTAAAGACGC GAACTCGTTC AACAGCCCGA TCACGTATAA AAGTCAGATC
GTTAGCCAGA CGGGAGCCAG CTTGTTAAAC AGTTCGTTTT CCGGCGATGC TGGGGGGTGG
AGCGTCTGGT CACCAGCGGG CAATGGCCGG GCCGACTGGG ACAATACCGG TCGACTCGAT
GGTGGCTCGC TTCGGCTGTC CTTCAGCAAT AACCAGCCGG ATTCGTACCT GCTGGCTACG
GTAAAAATCG GTGCCGTTAC CAAAGGGAAA TCGTACCAGT TGTTGTTCGA TGGGGTGGCT
TCGGCCGAGG GCAAGAAAGT GGAAGTGTAC CCCCGGCAAT TGTCTGGCAG TTATAAAGAC
CTGTCGCCCC GCACGCTGTT ACTGATGGGT ACCGGCCGCC AGACCTACGA AGCTGTGTTC
ACGGCTACGG CCGATGAGGC CAATGCGATT CTGGTGGTGC AGGTAACCGG CGATGGACAA
ACGGCCTGGA TTGACAATGT TCGCCTGGCA GACGCAACAC TCACCACGGT AAACCCCGAC
GACTACATTA AGTTGGTCTA TAACGCGACC AGTCAGGATA AAACTGTGGG GCTGAATGGC
ACGTACCGAG ATGCGAAAAA TATGGCCTAT ACCAATCAGA TAACGCTGTC TCCTTTTTCG
TCGGCGGTGC TGATGAAGGA AATCAATCCG GCTCCAACGC CCGTCGTGGA CTTGCGGGAG
CCGGAAAATC CGGCGAATGC CGTTGCGGGA CTTGACTACC AATATTACGA AGGGTACTGG
AACAACCTGC CTGACATGGC CAGTCTCACA CCCGTAAAAT CGGGCATCGT TGCCCGTGTC
GACCTCTCGG TGCGTAACAG ATCGGAGCAG TACGCCCTGC GCTATAAAGG ATACATTGAC
ATACCGGCCG ATGGTAGTTA CACCTTTTAT ACGGCTTCGG ATGATGGTAG TAAACTGCTG
ATCGGAACAA CGGAGGTGGT TGCCAACGAT GGCGTACACG GCGTGATCGA GAAATCCGGC
GTTATCGGTT TGAAAGCGGG CCGACACGCC ATTACTCTGC TTTATTTTCA GGCCGGTGGT
GGTCAGTCGA TGACCGTCAG TTACGAAGGC CCCGGCCTGA GCAAGCGGGA GGTTCCGGCA
TCGGCTTTTT ATCGGGTAGC TGCCGACGTC AGTGGCGTCT ACCTGTCTGA CCTGACCTGG
ACATCGGCCA GCAGCGGCTA TGGACCGGTG GAGAAAGATC GCAGCAATGG CGAGGCCAAC
GCGGGCGACG GGCGCACCAT AACCCTCAAC GGCGTTACCT ATAACAAGGG GCTGGGTGTT
CATGCGTCTT CGGATATAAC GTATAGCCTG AACGGTCAGT ACACTCGTTT TTTGTCTGAT
ATTGGTATTG ATGACGAAAT CCCCAACGGA AGTTGCGGGT CGGTTACGTT TGAGGTGTAC
CTCGACAATG TGCTGACTTA TAGCAGTGTT CGTATGAATC CCGCAATGGC TACCAAAACT
ATTGATCTGG ACGTATCTGG TAAGCAGACG CTGCGTCTGG TGGTAACAAA CGCGGGCGAT
GACGCCAGCT GCGATCATGC CGACTGGGCC GGGGCCAGGC TAACTGGCTC GGGCAGTGGA
CGCATAGCCA ATCGTAATGC CGACGAGTTT TCGGCAGAAT TAGCCGTTCA GGTGTATCCC
ATACCTGCCC GCGACGAAGT TCAGGTACGC TACGGTACGG CTTTGTCCGG CATGGTAAGT
GTACAACTGC TCAACACAGC GGGTATACCG GTCATTCAGA CGCGCCAGTC GGTGTCAGCG
GGTGAGAACT TGATTCGGCT ATCCGTTGGT GAACTTACGC GTGGTTTTTA CGTGCTGACG
GTTGTTCAGG ATGGGAAACG GTTCTCCCGG AAAGTGATCC TGGCGGAGTA A
 
Protein sequence
MLRKSIGEHR PRARSLLLFY AFLLLFGTVC QSLQAQTTYY VANSGNDANS GNSEGAPFQT 
LAKVNSLTLQ PGDAILFRRG DTFRGSLVIR QSGAPGRPIV IDAYGSGSKP VLAGSVPLTG
WSNIGNNIWQ ANCPSCGSQV TGVYQNTVAL PLGRYPNPGE ASKGYLTIQS HSGKTQLTSQ
QGLTANWTGG EAVLRPNQWI LDRAAITQQN GNTLTLANNS SYNLADGWGY FIQNHPATLD
QVGEWYYNPA DKTIQLFDNQ RNPNTQLITA TTFSEGIKLT NVSYVTVRNV EITETLSSGI
AVTGGSNFTF SGNDITNSGE DGVTIIGSGN TVVAENNLIE DANSSGFYIG PYQNFTFRGN
TLRRIGTLPG RGKSGDGTYS ALQSLCTGNT LIENNVVDNI GYNGIAVVTN ATVRYNQVSN
FCLTKSDGGG IYTWNGSGGN VGDLHIVSNI VFNGAGAPEG TPGGAYSGAN GIFLDDCSKN
VEVLNNMSFG SKGMGIFLRG VSSITVRGNT SFNNTEEQLK LAYNGACALR NNIVENNILF
SRLANQVVAA YESNTNDLTS YGQFDYNYYV RPFEDLFKIR AVYNPGSGLT GADLPLKAWQ
AQFGKDANSF NSPITYKSQI VSQTGASLLN SSFSGDAGGW SVWSPAGNGR ADWDNTGRLD
GGSLRLSFSN NQPDSYLLAT VKIGAVTKGK SYQLLFDGVA SAEGKKVEVY PRQLSGSYKD
LSPRTLLLMG TGRQTYEAVF TATADEANAI LVVQVTGDGQ TAWIDNVRLA DATLTTVNPD
DYIKLVYNAT SQDKTVGLNG TYRDAKNMAY TNQITLSPFS SAVLMKEINP APTPVVDLRE
PENPANAVAG LDYQYYEGYW NNLPDMASLT PVKSGIVARV DLSVRNRSEQ YALRYKGYID
IPADGSYTFY TASDDGSKLL IGTTEVVAND GVHGVIEKSG VIGLKAGRHA ITLLYFQAGG
GQSMTVSYEG PGLSKREVPA SAFYRVAADV SGVYLSDLTW TSASSGYGPV EKDRSNGEAN
AGDGRTITLN GVTYNKGLGV HASSDITYSL NGQYTRFLSD IGIDDEIPNG SCGSVTFEVY
LDNVLTYSSV RMNPAMATKT IDLDVSGKQT LRLVVTNAGD DASCDHADWA GARLTGSGSG
RIANRNADEF SAELAVQVYP IPARDEVQVR YGTALSGMVS VQLLNTAGIP VIQTRQSVSA
GENLIRLSVG ELTRGFYVLT VVQDGKRFSR KVILAE