Gene Slin_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1024 
Symbol 
ID8724754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1242306 
End bp1244240 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content54% 
IMG OID 
Productglycosyl hydrolase family 88 
Protein accessionYP_003385874 
Protein GI284035944 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTA GTCTTTTGGG CAGTCTGTTA CTTGTCGGCT ACGCAACGCT GGCGCAGCCG 
ACGGTTAACA AAGCCGCGCG CGATACCCCC TGGTCACAAC GAATGGTGGC TACGATCATG
GCGAATAATG CCGATTCCAT TGCTTATGTC AGGGAAGGTA AAGACGCGCG CTGGGAGTAT
GAAATGGGGG TGCTTCTCGA AGCTGTGGAA CAGGTCTGGT ACCGCACGGC TGATGATCGG
TATTTTCAGT ACATCCAGAA AAATATGGAC CGGTATGTAA ACGCCGACGG CGACATCCGA
ACCTATAAAA TGGACGATTA CAATATTGAT TTTGTAACCC CCGGCCGGGC TTTGCTTCTG
CTTGCGCAAC AATCGCTGCC GGGCAATGCA AAATACCGGA AAGCCGCTGA GGTGCTCCGG
AAGCAGCTTG CTCAACAGCC GCGTACGAAG GAAGGCGGCT TCTGGCACAA AAAGCGCTAT
CCGTATCAGA TGTGGCTCGA TGGCCTCTAC ATGGCCGAGC CGTTCTATGC GGAGTACACC
CGCCTGTATG GCGATGAAAA GAACTTTGAT GATATCGTCA ACCAGTTTGT GTGGATGGAC
AACCACGCCC GCGACGAAAA AACGGGTTTG CTCTATCATG GCTGGGACGA AAGCCGGGAG
CAGAAATGGG CCAACAAGCA AACGGGCAAG TCGCCCAACT TCTGGAGCCG TTCTATAGGC
TGGTACGTAA TGGCGCTCGT CGACGCGCTC GACTATGTAC CGCAGTCGCA CCCGCGCCGG
GGTGAACTGG TGGCGATATT GCAGCGTGTA ATGCCTGCCA TCGTAAAATA CCAGGACCCA
AAAACTGGCT GCTGGTATCA GGTGACCGAC CGGCTTGGCG ATAAAGGAAA CTACATCGAA
GCATCAGGAA CGTCGATGTT TGTGTATGCG CTGGCCAAAG GTGTTCGGAT GGGGTATCTG
CCAGCATCGC TGGCGGCACC GGCTCAAAAG GGGTATGCCG GTATTCTGAA AAACTTTATC
ACAACCGATG CCCAGGGCCA GATTCACCTC GAAAAGACAG TTCTGGTAAG CGGCCTTGGC
GGGAATCCGT ACCGCGACGG GAGCTATGAA TATTATCTGA GTGAGCCACT GCGTCAGGAT
GATTTGAAAG GCGTTGGTCC GTTCATTATG GCCAGCGTGG AAATGGAAAT AGCCGCTGAG
CGAGGCATCG CCAAAGGGAA AACCGTGGCC GTCGACAACT ACTTCAATCA CGAGTTCCGG
AAAGGAGTAA ATGGTGAGCA GGAACCGTTT CATTATACCT GGGAAGACCA GATGCATTCG
GGCTTTTACT GGTGGGGAAC CATCTTCCGG AATCTGGGTG CTAAAACCGC TACTATTTCG
GGGGCGCCAA CGGCCGCATC CCTGAAGGGA GTAGACGTCT ACATCATCGT TGATCCCGAT
ACGCCAAAGG AGACGGCGAA ACCCAATTAC GTTCGCGATG CAGATATCAA CGCTATTGCC
GATTGGGTGC AGGCCGGTGG TACGCTGGTG CTGATGGCCA ACGACACGTC GAACTGCGAG
CATGTGAACT TCAACAAGCT GGCCGCTCGT TTTGGCATGC AGTTTCTGCC CAAAAACCGG
AACATGGTAC AGGGTACGCA GTGGGATCAG GGGACAATCT CGATTCCCGT CGGAGGGCAG
TCTATTTTTC CGACCACCCG GACGGTCTAC ATCAAAGAAC TGGCCCCCCT CGCTGTGAAG
GCTCCTGCCA AAGCCGCTGT GACCGACGGT GGCGACATTA TCATGGGTGT GGCTACAGTG
GGCAAGGGTA GGGTCTTTGC CGTGGGTGAT CCCTGGCTCT ACAATGAGTA TGTCGACGGT
CGTCGTATAC CTGCCAAATT TCAGAACTTC CAGGCTGCCA AAGAATTGGC TACCTGGCTG
GTGGGTGGTA AATAG
 
Protein sequence
MKFSLLGSLL LVGYATLAQP TVNKAARDTP WSQRMVATIM ANNADSIAYV REGKDARWEY 
EMGVLLEAVE QVWYRTADDR YFQYIQKNMD RYVNADGDIR TYKMDDYNID FVTPGRALLL
LAQQSLPGNA KYRKAAEVLR KQLAQQPRTK EGGFWHKKRY PYQMWLDGLY MAEPFYAEYT
RLYGDEKNFD DIVNQFVWMD NHARDEKTGL LYHGWDESRE QKWANKQTGK SPNFWSRSIG
WYVMALVDAL DYVPQSHPRR GELVAILQRV MPAIVKYQDP KTGCWYQVTD RLGDKGNYIE
ASGTSMFVYA LAKGVRMGYL PASLAAPAQK GYAGILKNFI TTDAQGQIHL EKTVLVSGLG
GNPYRDGSYE YYLSEPLRQD DLKGVGPFIM ASVEMEIAAE RGIAKGKTVA VDNYFNHEFR
KGVNGEQEPF HYTWEDQMHS GFYWWGTIFR NLGAKTATIS GAPTAASLKG VDVYIIVDPD
TPKETAKPNY VRDADINAIA DWVQAGGTLV LMANDTSNCE HVNFNKLAAR FGMQFLPKNR
NMVQGTQWDQ GTISIPVGGQ SIFPTTRTVY IKELAPLAVK APAKAAVTDG GDIIMGVATV
GKGRVFAVGD PWLYNEYVDG RRIPAKFQNF QAAKELATWL VGGK