Gene Slin_4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4520 
Symbol 
ID8728284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5478561 
End bp5481281 
Gene Length2721 bp 
Protein Length906 aa 
Translation table11 
GC content54% 
IMG OID 
Productglycoside hydrolase family 9 
Protein accessionYP_003389299 
Protein GI284039369 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.815518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAT TATTCGGAAT GATGCCGAAT ATCAACGTAT CAATGATTAC ATCGAATCAA 
ATCCCGAAAA TTGGGAAAAG GATAAATTTT ATGACGCCCT CTAAGATTAT GCGTTCTGCT
CTATTTCTTT CCATTCTACT CAGTCTTATT TCAGTTAACA TACCCAACGA ACCTTCCGCT
GTTATCCGCA TTAACCTGCT CGGTTACCGA CCTGATAGCC CGAAGGTCGC CGTTTGGGGA
AGCCTGACCG ACGGGCAAAT TAATACGTTC GAACTGGTCG ATGAACAGAC GAACGGGGTG
CAGCAACAGC TACCGGCCGG GCGGGCGTTT GGCGGGTATG GACCGTTTAA ACAATCCTAT
CGGCTCGACT TTTCGGCGGT TCAGAAACCC GGTCGGTATT ACCTGCGCAC AGCCGACGGT
GCCCGATCGC CGGGGTTTCG GATTGGCGAC GATGTGTATG CGGGCGCGGC CGATTTTTGT
CTGCGCTACA TGCGCCAGCA ACGAAGCGGC TTCAATCCCT TTCTGAAAGA CTCCTGCCAT
ACGCACGATG GCTATACCAT GTACGGTCCC ATGCCCGACA GTACGCATAT CGACGCGTCG
GGTGGCTGGC ACGATGCCTC AGATTACCTG CAATACGTAA CCACCTCAGC CAATGCCACC
TATCACCTGC TGGCCGCCCT GCGCGACTTC CCCACTGCTT TCGGCGATCA GCACCAGTTA
AATGGCCTGG CGGGTGCCAA TGGCTTGCCC GATGTGCTGG ACGAAGCCCG TTGGGGACTC
GACTGGCTCC TGAAAATGCA CCCCACCGAT ACCTGGCTCT TCAACCAGCT CGGCGACGAC
CGCGACCATT CGAGTATGAG AATTCCAAAA CTCGACAGCA TGTACGGCAA AGGGGCTGAA
CGGCCGGTTT ATTTCGCAAC TGGCCAGCCA CAGGGGCTGT TCAAATACAA AAACCGCGCA
ACGGGCGTGG CCTCTACGGC TGGTAAAGTG AGCAGCGCAC TGGCACTGGG CTACCAACTA
ACCCGTAAAA AAGACCCTGC CTATGCCGGT AAACTCTGGC AACGGGCGCA GTCGGCGTAT
AAACTTGGGC TCGAAAAGCC GGGCAATTGC CAGACCGCGC CGGGTCGGGC TCCTTATTTT
TATGAGGAAG ACAACTACGC CGACGATATG GAACTGGCTA CCGTTGAACA ACTAAATGCA
ACGGGAGCTA CCGGAACACA AGCAGCCAGA TTACAAACTA CCGCACTGGA TTTTGCCCGG
ATGGAGCCTG TTACGCCCTG GATTCTGAAC GATACGGCCC GACATTATCA ATGGTATCCG
TTTGTCAACG TCGGTCATGC GGAGCTGGCC AAGCAGTTAC CGGCCCGCGA TCGGAAAATC
GTTACTGATT ACTACAAGTC GGGCATAGAA ATTATCTGGA AACGGGCGAG CCAGAACGCG
TTTTACCGGG GTGTGCCGTT TACCTGGTGT AGCAATAACC TCACAACTTC TTTCGCCACG
CAGTGCTTCT GGTATCGGCA GCTCACCGGC GACACGCAGT TTGCCGCGCT GGAACAGGCC
AATTTCGACT GGCTGTTTGG CTGTAATCCG TGGGGCACAA GTTTCGTTTA CGGCCTTCCG
GCCAACGCCG ACACCCCATC GGACCCGCAT TCGTCCTTCA CCCATTTGAA AAATTACCCC
ATCGACGGTG GACTGGTGGA TGGTCCTGTT CGAGGCAGCA TTTACAGCAG GCTGATCGGC
ATTACCCTGC ACGAACCCGA CGAGTACGCA CCGTTCCAAA GCAACGTAGC CGTATACCAC
GACGATTATG GCGATTACAG CACGAACGAG CCAACCATGG ATGGCACGGC CTCGCTGGTT
TACCTGCTGG CCGCCAAACA TGCCGAGAGC TATAAACCAG CCAAGGCAAA TACCTCCCGG
AAACCTGAAA AACCGGCTAA GACACTAAAA CCATCGACGA AAACCGACCT CAAATCAACC
TATTTCAAGG GAGCCAAAAT AAGGGGTGAT ACCAGCGCCC GCCGATTGGC GCTGGTGTTT
ACCGGCGATG AGTTTGCCGA TGGCGGCTCC ACCATAGCCC GAACCCTGCA AAAGCACCAG
GTTCGGGCTT CGTTTTTTCT GACCGGTCGG TTTCTGCGCA ACCCCGCTTT TACCGCGCTC
ACAAAACAAC TTGCGCGGGA AGGCCACTAC CTCGGCCCCC ACTCCGACCA GCATTTACTT
TACTGCGACT GGACAAAGCG CGATAGCCTG CTCTTAACCC GGCAGCAGTT TGTGGATGAT
TTACGGGCTA ATTACGCGGC TCTTTCGGGG GTATTGGGCG GGATGAATCA AACGCCATAT
CTTGAAGATA TGGCGTTTGA TTCATCCCGC CCAATACCAA AAACCAGTAA ACTATTTCTA
CCCCCTTACG AGTGGTATAA CGACAGCATT TCCGTCTGGG CCAACGCAGA AGGCGTTCAA
CTCATCAACT ACACGCCCGG TACACTAAGC CACGCCGACT ACACCACCCC TCAGGATAAA
AACTACCGCA ACAGTGCCAC CATACTGCAA TCCATACAAA CCTACGAGCA GAAAAAACCG
GCTGGCCTGA ACGGCTTTAT CTTACTGATG CACATTGGCG TAGCCCCAAA TCGAACGGAT
AAACTGTACG ATCATCTGGA TGAATTAATC GCTGAACTTC GGCAAAAAGG GTATGCCTTT
GTGCGAGTCG ATGCCTTGTA A
 
Protein sequence
MTTLFGMMPN INVSMITSNQ IPKIGKRINF MTPSKIMRSA LFLSILLSLI SVNIPNEPSA 
VIRINLLGYR PDSPKVAVWG SLTDGQINTF ELVDEQTNGV QQQLPAGRAF GGYGPFKQSY
RLDFSAVQKP GRYYLRTADG ARSPGFRIGD DVYAGAADFC LRYMRQQRSG FNPFLKDSCH
THDGYTMYGP MPDSTHIDAS GGWHDASDYL QYVTTSANAT YHLLAALRDF PTAFGDQHQL
NGLAGANGLP DVLDEARWGL DWLLKMHPTD TWLFNQLGDD RDHSSMRIPK LDSMYGKGAE
RPVYFATGQP QGLFKYKNRA TGVASTAGKV SSALALGYQL TRKKDPAYAG KLWQRAQSAY
KLGLEKPGNC QTAPGRAPYF YEEDNYADDM ELATVEQLNA TGATGTQAAR LQTTALDFAR
MEPVTPWILN DTARHYQWYP FVNVGHAELA KQLPARDRKI VTDYYKSGIE IIWKRASQNA
FYRGVPFTWC SNNLTTSFAT QCFWYRQLTG DTQFAALEQA NFDWLFGCNP WGTSFVYGLP
ANADTPSDPH SSFTHLKNYP IDGGLVDGPV RGSIYSRLIG ITLHEPDEYA PFQSNVAVYH
DDYGDYSTNE PTMDGTASLV YLLAAKHAES YKPAKANTSR KPEKPAKTLK PSTKTDLKST
YFKGAKIRGD TSARRLALVF TGDEFADGGS TIARTLQKHQ VRASFFLTGR FLRNPAFTAL
TKQLAREGHY LGPHSDQHLL YCDWTKRDSL LLTRQQFVDD LRANYAALSG VLGGMNQTPY
LEDMAFDSSR PIPKTSKLFL PPYEWYNDSI SVWANAEGVQ LINYTPGTLS HADYTTPQDK
NYRNSATILQ SIQTYEQKKP AGLNGFILLM HIGVAPNRTD KLYDHLDELI AELRQKGYAF
VRVDAL