Gene Slin_4381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4381 
Symbol 
ID8728141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5313452 
End bp5316667 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003389161 
Protein GI284039231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.380441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.425173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCACT TTTTATTCCA TCAGCTTCGG CTCCTGGTCA TCGGGAGTAT GTTAAGCGTA 
TTGACGGTTG CTCATAGCGT ATCTGCTCAG TCAGCTAAAG GGCTGGTGAG TGGTAAAATC
ACCGCCGAGG AAGACGGTGA AGCCTTGGTT GGGGCTACCG TTACCGAGAA AGGAACCACC
AATGGCACTA CCTCGGATGT GAACGGCAAT TTCAAGCTGA ACGTAGCGGG CAATGCAACG
CTGGTAATCA GCTTTATTGG GTACGCACCC CAGGAACTGC CCGTCAGCAA CGGAAACGGC
CAGCCGCGCA CTAACTTGAC TATCGCCCTG AAAACTGACC AGCAGCAGTT GCAGGATGTG
GTTGTGGTGG GATATGGTAC CCAGCGCAAA AAGGATCTGA CGGGGTCCAT CGTCAACCTG
ACCAGCAAAG ACCTGGTGCC CGTACCCTCG GCAACGAGCG TCGACCAGAT GATGCAGGGC
AAAGTGGCGG GGGTACAGAT TACTCAGACG TCGGGTGCGC CGGGCGGCAA TGTCAACGTG
ATCATTCGGG GGATCAGCTC CATTACAGGC GGTAACTCAC CGCTGTATGT TGTGGATGGA
TACGCCATTG GTACCGGCGG GGGCGGCTCC GACCTGAGCA GTTTCGGTGC CAATTCCTAT
ACGGCCAGCG GACTTGCCAG TAGCAGTTCG ACTAACCGGA TTAATCCGCT CAGCATCATC
AACCCCGCCG ATATTGAGTC GATTCAGGTG CTGAAAGATG CGTCGGCCAC GGCTATTTAT
GGCTCAAGGG GTTCCAACGG CGTGATTATT ATAACGACAA AACGGGGTAA GCTCGGCAAG
CCTACCATCA GTTTCGAGCA TTCGACGGGT ATGCAGGAAC TGGCCCGAAA AATGAAGTTG
CTGACGCCCC GTCAGTACGC CGAATTTGTT GCCGAAGGGC GCGACAACGC CTGGGTATTT
GCGGGTGGTA AAGCGACCGA CCCAAACAAC ATTCGCAGTA CAGCCACGCA GGTGAAGCCC
GACTTTCGTA ACCCCGGCCA GTTTGCCGAT GCCGGTTACG GCACCGACTG GCAGGACGTG
ATTTTCCGAA AAGGGATGGT TCAGAATTAC CAGTTGTCGG CCAGCGGCAC GAGTCGGGAC
GTTAGCTATT ACGTTTCCGG CGGCTTTTTC AACCAGAAGG GCATCATCAT CGGGTCGGAT
TTTAACAAGT TCACCCTCCG TACCAACATT GACGCCCAGC TCACCCCCCG CCTGAAAATC
GGCGCGTCAT TTTCGGGCGC TCATTCGTAC GGCAATTTCG CGAGGGCTGA GGGACACCTG
CAATTCCGGG GTCTGATCTC GGCGGCCCTC GCCAGCGACC CGACCATTCC GGTCACTAAT
CCCGATGGTA CGCCTTACTC CGAATTCTCC AGTCCAACGG GCATTCCCGT CGAAAATCCG
CTGATCATTG CCGCTGAGTT TTTCGATAAA CGCAACAATA CCAATGTGTT CACCAATAAC
TACCTGCAAT TCGATCTGGC ACCGGGGCTT GTCCTGAAAA CGTCCATCGG GGTGAATTAC
TCCAACAATG TAACCCGCTT GTGGAAGTCG TCGAAGGTCG GGCTGGCGAC CAGCCGAACG
GGGGCCGCCA CCGCAGCATC GACCGAAATC AAAAGCCTGA ACTGGCTGAA CGAAAACACC
ATCAACTACC GGCATAAGTT TGGTGGCAGG CACGATATTG ATGCGCTGGC GGGCTACACC
ATCCAGAAAA ATTCGGACGA GGTGCTACAG GCCGGGGCTA CCGGCTTCTC GACCGATTAT
GTGCCGTTTC TGGCCGCAGG AACCGTTTCG ACGGGCACGA ATTACATCAG CGAATGGGCC
ATTATGTCGT GGCTGGCCAG GGTAAATTAC ACCTATAACG GTAAGTACCT GCTCACAGCG
ACGATTCGGA AAGATGGCAG CTCACGTTTT GGCTCGAAAA ATCGCTGGGG GACGTTTCCG
TCGATTTCAG CCGCCTACCG CTTGTCGGAT GAGCCCTTCA TGAAATCGGC CAGTTTCATC
AGCGATTTGA AAATCAGGGC CAGTTATGGT ATTTCGGGCA ATAACCTGAT TCCCAACTAC
GCCACGCAGG GCTTGCTGGG CGTTGCCCGA ACGGTGGCGA ATGGTCAGAT TGTGTCGGGT
ATTATCCCAA CCAGCCTGGC CAATGACGAA CTGACCTGGG AGCAATCGGT GCAGAGTAAC
GTGGGCATTG ATCTGTCGTT GTTCCAGAAC CGACTGTCGT TTACGGTCGA TGCCTATCAG
GCCTACAAAA AGAATCTGCT GCTTAACGTA ACCCTGCCTT CGGCTTCGGG CTTTGGCAGC
TCGGTTCAGA ACATCGGCGA GGTAGAAAAC AAGGGGATCG AACTGACGGT CAATTCGCAG
AACATCGCGA AAGGGCCATT CCAGTGGAAT ATGGATTTTA ATATTAGCTG GAACCGCAAC
AAAGTGCTGG CGCTCAATTC AAGTTCGGCC CGTATCGTTA CGTCCGATTA CCAGGTGGCG
CAAGTTGGCT ACCCCATTTC CAGCTTCCGA CTGCTCAACA TTCTGGGCGT TTTCCAGACC
CAGGAGGAGG TCAACAACAG CCCAAAACAG AACCCACGCG TGCAGCCGGG TGATTATAAG
TACCAGGATG CCGACGGCAA TGGCACCATC AATACATCCG ACAGAACCAT TGTCGGAAAT
CCGTGGCCCC GATATACCTG GGGACTTGGT AACCGCTTTA CTTACAAAAA TTTCGCCCTG
AGCGTGAGCC TCAATGGCAC CTACGGCAAC CAGGTTTATT TTCAGGGGGG CGAGGTCAAC
CTGAATGGGG CTGGGGTACA GAACCAACTG GCCGCTATGG CCGACCGCTG GAAATCGCCG
GAGAGTCCGG GGGCGGGCTT GTATACGCGG GCTATCCGAA ACGACTATGC TTTCGGGTTC
AGCGCGGGAA CGACCAAATA CCTGTTCGAC GGGTCATTCA CCCGCATTCG GGATGTCAAT
TTATCGTACA CCTTTCCAGC ACCGGCGGTT AGCAAGCTGA AGCTTCAGGC ACTGTCGATC
TATGCGGATG TCACGAACCT GTACACGTTT ACGAAGTATC CGGGCTATGA CCCGGAGGGG
AGTACCGGGG GCGATAATCT GGCCAAAAGT GGCGTTGACT TCTTCTCGTA TCCAAACCCA
CGGACCTACA CCGTCGGCCT GCGCGTGACT TTCTAA
 
Protein sequence
MNHFLFHQLR LLVIGSMLSV LTVAHSVSAQ SAKGLVSGKI TAEEDGEALV GATVTEKGTT 
NGTTSDVNGN FKLNVAGNAT LVISFIGYAP QELPVSNGNG QPRTNLTIAL KTDQQQLQDV
VVVGYGTQRK KDLTGSIVNL TSKDLVPVPS ATSVDQMMQG KVAGVQITQT SGAPGGNVNV
IIRGISSITG GNSPLYVVDG YAIGTGGGGS DLSSFGANSY TASGLASSSS TNRINPLSII
NPADIESIQV LKDASATAIY GSRGSNGVII ITTKRGKLGK PTISFEHSTG MQELARKMKL
LTPRQYAEFV AEGRDNAWVF AGGKATDPNN IRSTATQVKP DFRNPGQFAD AGYGTDWQDV
IFRKGMVQNY QLSASGTSRD VSYYVSGGFF NQKGIIIGSD FNKFTLRTNI DAQLTPRLKI
GASFSGAHSY GNFARAEGHL QFRGLISAAL ASDPTIPVTN PDGTPYSEFS SPTGIPVENP
LIIAAEFFDK RNNTNVFTNN YLQFDLAPGL VLKTSIGVNY SNNVTRLWKS SKVGLATSRT
GAATAASTEI KSLNWLNENT INYRHKFGGR HDIDALAGYT IQKNSDEVLQ AGATGFSTDY
VPFLAAGTVS TGTNYISEWA IMSWLARVNY TYNGKYLLTA TIRKDGSSRF GSKNRWGTFP
SISAAYRLSD EPFMKSASFI SDLKIRASYG ISGNNLIPNY ATQGLLGVAR TVANGQIVSG
IIPTSLANDE LTWEQSVQSN VGIDLSLFQN RLSFTVDAYQ AYKKNLLLNV TLPSASGFGS
SVQNIGEVEN KGIELTVNSQ NIAKGPFQWN MDFNISWNRN KVLALNSSSA RIVTSDYQVA
QVGYPISSFR LLNILGVFQT QEEVNNSPKQ NPRVQPGDYK YQDADGNGTI NTSDRTIVGN
PWPRYTWGLG NRFTYKNFAL SVSLNGTYGN QVYFQGGEVN LNGAGVQNQL AAMADRWKSP
ESPGAGLYTR AIRNDYAFGF SAGTTKYLFD GSFTRIRDVN LSYTFPAPAV SKLKLQALSI
YADVTNLYTF TKYPGYDPEG STGGDNLAKS GVDFFSYPNP RTYTVGLRVT F