Gene Slin_4900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4900 
Symbol 
ID8728664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5965155 
End bp5968337 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content54% 
IMG OID 
Productglycosyl hydrolase BNR repeat-containing protein 
Protein accessionYP_003389677 
Protein GI284039747 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATA AAATAACCGA AAACAGTTAT TTGATGAGTT ATACGCAGCC CGATACGCTA 
AAAATCTATT CTTCCATGCA TAAGTTTTTC CTTTACTTCT TTCTAACGCT GTTTGTTAAC
CAGCCTGCCA TAGGCCAGGC CACGAAAGCA CAGGCCACGG TTTCCTTTGA GCCATCAACG
TACGAGGGCT TACGCTGGCG TGAATTAGGT CCATATCGCG GAGGCCGCTC TTGTACAGTT
ACCGGCGTAC CCAACAATCC CAATCTATAT TACATGGGTA CCGTAGGCGG TGGGGTCTGG
CGCACGACCG ATGGCGGTCA GACCTGGGGC AGCATTACCG ATAATTACTT TGGCGGCACC
ATTGGCGCGG TGGCCGTTGC CGAAAGTGAC CCCAATGTTA TTTATGTAGG CGAGGGCGAA
CAGACGTTAC GCAATAACGT GGCATCGGGC ACCGGCATGT GGCGCTCCAC CGACGCGGGC
ACCAGTTGGA AGCGCATCGG CCTGACCGAC TCGAAACACA TTGCCCGCAT TCGGATTCAC
CCCAAAAACC CGGATGTAGT TTACGTTGCC GCTATGGGTA ACCTCTGGAA ACCCAACGAG
ATGCGGGGCA TCTTTCGCAG CACCGACGGT GGGCAAACCT GGAAAAAAGT GCTGTACATC
AACGACCAGG CGGGTGCCGC CGATCTGATG CTCGACCCCA ATAACCCGCG AATAATGTAT
GCCAGCACCT GGAACATGAA GCGCAACGGC TACCGCATGG ATAGCGGTGG TCCCGACTCC
AAACTGTGGA AAAGCACCGA CGGGGGCGAC ACCTGGGAAA ACCTGTCCGA TAAGCCCGGC
ATGCCGTCGG GAATTAACGG CATCATCGGC GTGACCGTTT CGCCAAAAAA CTCCAGCCGG
GTGTGGGCCA TCATCGAGAA TAAAGACGCC GGTGGCGTTT ACCGCTCCGA CGACGCCGGG
AAAACCTGGA CTAAAATCAA TCAGGACCGC GCCCTGCTGC AACGGGCCTG GTACTACTGC
CGCATTTACG CCGACTCACA AAACGAGGAC ATCGTGTATG TCATGAACGT GAGTTACGGC
GTGTCGAAAG ATGGCGGTAA AACTTTCGAG TTAAAAAATG CACCCCACGG CGACCACCAC
GACCTCTGGA TTGATCCCAA CAACAACAAA CGCATGGCCA TTGCCGATGA TGGGGGCGCT
CAGATTTCGA CCGATGGCGG CAACAACTGG ACAACCTACC ACAACCAACC AACAGCGCAG
TTCTACCGCG TTTCGACCGA CAACCATTTT CCGTACCGCA TTTATGGTGC CCAGCAGGAC
AATACCAGCG TCCGGATTTC GCACCGGACG GGCAGCGCAT CTGTCACCGA AAAAGATTGG
GATGCGCTGG CCATTGGCGA AAGTGCTCAC CTGGCTGCCG ATCCGCTCAA CAGCGAGGTG
GTTTTTGGTG GCGATTATAA AGGCTATATG ACCATGCAGG ACCTGGCTAC CGGTCAGGAG
CGATCGACCA ACATCTACCC CGATCTGCCC GCTGGTTCAG GTGCCGATGC CATGAAATAC
CGCTTCAACT GGAATTACCC GGTCTTTTTC AGCCCGCACA ACCCCAAGAA ACTCTACGCC
GGATCGAACC ATCTGCACGT CAGTTACACA GGGGGCGAAA GCTGGGAGGT GATCAGCCCC
GATTTGTCGA GAGGAGAACC CGAAACCATC AAGTCGTCGG GTGGGCCGAT CACGCAGGAC
AATACCGGAG CCGAATACTA CGCCAACTTG TTTGCCGCCA CCGAGTCGCC TTATACCGAA
GGTGAAATCT GGACGGGGTC GGACGATGGG CTGGTGCATG TGACTACGGA TGGCGGCAAG
AACTGGAAAA ACGTTACCCC GCCTATGTCG CCAAAGTACA ATATGATGAA CTCGGTGGAG
GTTAATCCAT TCGTGAAAGG GGGTGCTTAC ATTGCGGCTA CGTCGTATAA GTTTGGCGAT
TATATGCCTT ACATCTACAA AACGCTGGAT TACGGCAAAA CCTGGACGGT GATGACCAAA
GGTATTCCCA AAGATGAGTT TGTACGGGTC GTTCGGGCCG ACCCCAAGCG AAAGGGGCTG
CTCTATGCCG GTACCGAGCG GGGCGTGTGG GTGTCGTTCG ACGATGGAGA AAGCTGGTCG
AAGCTTCAGC TTAACCTCCC GCCGGTACCT ATTCATGACC TGGCTATCAA GAATGATAAC
CTCATTGCGG CCACGCACGG CCGTAGTTTC TGGCTAATTG ACGATCTTAC TCCGCTTCAC
CAGCTCAAAC CCGAGTTAGC GTCGAAAGAT GTTATTCTGT ATCAGCCAAT GCCCACCTAC
CGCATGGCCG GAAGCGACAA GCGCGAGGCC GCTCTGGAAG GCGAAAATCA CCCCAATGGT
GTGATGATTC ATTACTTCAT CAAAAAAGCC GACCCGGCTG CGGAGGTTAA GCTGGAAATT
CTGGAAAGCA ACGGCAACCT CATTCGTTCG TTCAGCAGCA AAGCCAAAGA GAAAGCCGAC
CTGCTGAAGG TGAAATCTGG TGGCGACCGC TTCGTGTGGG ATATGCGCTA TCCGGGCTAC
AAGACCTTCC CCGGCATGGT GTTTTACGGC TCGCCCAACC TCGGCCCCAA AGCGGTACCG
GGCAACTACC AGGTTAGGCT GACCGTAAAT GGGCAATCCC AGACCCAGCC GTTCGAGATC
ATCAAAGACC CGCGTCTAAA AACAACGCCG GAAGATTACC AGGCCCAGTT CAGCTTCCTG
ATGAAAGTAC GGGACAAAGT AACGGAAGCT AATGAAGGCA TTATCGATCT GCGGAAAATA
AAAGAAGACC TGACGTATCT GAAGAACAAA ATGGGGTCTG ACGAGAAGAA CAAGGACATA
AACGAGGTCA TCAAGAAGTT TGAAGACGAC CTCAAAACCA TTGAGAACGA CATTCATCAG
ACCAAAAACA TGAGCGTTCA GGACCCGCTC AACTACGGAA TTAAGCTCAA CAACCGGCTG
GCGCACCTCA TGAGCGAGCA GGCCCAGGGC GATTTTCGCC CAACGAAACA GGGCGAAGAC
GTTCGCAGTA AACTCACCAA AGAGGTCGAC GAACAGCTAG TCAAATTGAA AGTGACGATT
GAAACTAACC TGCAACGCAT CAACCAGATG GCTAAGGATA AAGGGGTTGT CCTGGTTAAC
TAG
 
Protein sequence
MFDKITENSY LMSYTQPDTL KIYSSMHKFF LYFFLTLFVN QPAIGQATKA QATVSFEPST 
YEGLRWRELG PYRGGRSCTV TGVPNNPNLY YMGTVGGGVW RTTDGGQTWG SITDNYFGGT
IGAVAVAESD PNVIYVGEGE QTLRNNVASG TGMWRSTDAG TSWKRIGLTD SKHIARIRIH
PKNPDVVYVA AMGNLWKPNE MRGIFRSTDG GQTWKKVLYI NDQAGAADLM LDPNNPRIMY
ASTWNMKRNG YRMDSGGPDS KLWKSTDGGD TWENLSDKPG MPSGINGIIG VTVSPKNSSR
VWAIIENKDA GGVYRSDDAG KTWTKINQDR ALLQRAWYYC RIYADSQNED IVYVMNVSYG
VSKDGGKTFE LKNAPHGDHH DLWIDPNNNK RMAIADDGGA QISTDGGNNW TTYHNQPTAQ
FYRVSTDNHF PYRIYGAQQD NTSVRISHRT GSASVTEKDW DALAIGESAH LAADPLNSEV
VFGGDYKGYM TMQDLATGQE RSTNIYPDLP AGSGADAMKY RFNWNYPVFF SPHNPKKLYA
GSNHLHVSYT GGESWEVISP DLSRGEPETI KSSGGPITQD NTGAEYYANL FAATESPYTE
GEIWTGSDDG LVHVTTDGGK NWKNVTPPMS PKYNMMNSVE VNPFVKGGAY IAATSYKFGD
YMPYIYKTLD YGKTWTVMTK GIPKDEFVRV VRADPKRKGL LYAGTERGVW VSFDDGESWS
KLQLNLPPVP IHDLAIKNDN LIAATHGRSF WLIDDLTPLH QLKPELASKD VILYQPMPTY
RMAGSDKREA ALEGENHPNG VMIHYFIKKA DPAAEVKLEI LESNGNLIRS FSSKAKEKAD
LLKVKSGGDR FVWDMRYPGY KTFPGMVFYG SPNLGPKAVP GNYQVRLTVN GQSQTQPFEI
IKDPRLKTTP EDYQAQFSFL MKVRDKVTEA NEGIIDLRKI KEDLTYLKNK MGSDEKNKDI
NEVIKKFEDD LKTIENDIHQ TKNMSVQDPL NYGIKLNNRL AHLMSEQAQG DFRPTKQGED
VRSKLTKEVD EQLVKLKVTI ETNLQRINQM AKDKGVVLVN