Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4381 |
Symbol | |
ID | 8728141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5313452 |
End bp | 5316667 |
Gene Length | 3216 bp |
Protein Length | 1071 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003389161 |
Protein GI | 284039231 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.380441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.425173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCACT TTTTATTCCA TCAGCTTCGG CTCCTGGTCA TCGGGAGTAT GTTAAGCGTA TTGACGGTTG CTCATAGCGT ATCTGCTCAG TCAGCTAAAG GGCTGGTGAG TGGTAAAATC ACCGCCGAGG AAGACGGTGA AGCCTTGGTT GGGGCTACCG TTACCGAGAA AGGAACCACC AATGGCACTA CCTCGGATGT GAACGGCAAT TTCAAGCTGA ACGTAGCGGG CAATGCAACG CTGGTAATCA GCTTTATTGG GTACGCACCC CAGGAACTGC CCGTCAGCAA CGGAAACGGC CAGCCGCGCA CTAACTTGAC TATCGCCCTG AAAACTGACC AGCAGCAGTT GCAGGATGTG GTTGTGGTGG GATATGGTAC CCAGCGCAAA AAGGATCTGA CGGGGTCCAT CGTCAACCTG ACCAGCAAAG ACCTGGTGCC CGTACCCTCG GCAACGAGCG TCGACCAGAT GATGCAGGGC AAAGTGGCGG GGGTACAGAT TACTCAGACG TCGGGTGCGC CGGGCGGCAA TGTCAACGTG ATCATTCGGG GGATCAGCTC CATTACAGGC GGTAACTCAC CGCTGTATGT TGTGGATGGA TACGCCATTG GTACCGGCGG GGGCGGCTCC GACCTGAGCA GTTTCGGTGC CAATTCCTAT ACGGCCAGCG GACTTGCCAG TAGCAGTTCG ACTAACCGGA TTAATCCGCT CAGCATCATC AACCCCGCCG ATATTGAGTC GATTCAGGTG CTGAAAGATG CGTCGGCCAC GGCTATTTAT GGCTCAAGGG GTTCCAACGG CGTGATTATT ATAACGACAA AACGGGGTAA GCTCGGCAAG CCTACCATCA GTTTCGAGCA TTCGACGGGT ATGCAGGAAC TGGCCCGAAA AATGAAGTTG CTGACGCCCC GTCAGTACGC CGAATTTGTT GCCGAAGGGC GCGACAACGC CTGGGTATTT GCGGGTGGTA AAGCGACCGA CCCAAACAAC ATTCGCAGTA CAGCCACGCA GGTGAAGCCC GACTTTCGTA ACCCCGGCCA GTTTGCCGAT GCCGGTTACG GCACCGACTG GCAGGACGTG ATTTTCCGAA AAGGGATGGT TCAGAATTAC CAGTTGTCGG CCAGCGGCAC GAGTCGGGAC GTTAGCTATT ACGTTTCCGG CGGCTTTTTC AACCAGAAGG GCATCATCAT CGGGTCGGAT TTTAACAAGT TCACCCTCCG TACCAACATT GACGCCCAGC TCACCCCCCG CCTGAAAATC GGCGCGTCAT TTTCGGGCGC TCATTCGTAC GGCAATTTCG CGAGGGCTGA GGGACACCTG CAATTCCGGG GTCTGATCTC GGCGGCCCTC GCCAGCGACC CGACCATTCC GGTCACTAAT CCCGATGGTA CGCCTTACTC CGAATTCTCC AGTCCAACGG GCATTCCCGT CGAAAATCCG CTGATCATTG CCGCTGAGTT TTTCGATAAA CGCAACAATA CCAATGTGTT CACCAATAAC TACCTGCAAT TCGATCTGGC ACCGGGGCTT GTCCTGAAAA CGTCCATCGG GGTGAATTAC TCCAACAATG TAACCCGCTT GTGGAAGTCG TCGAAGGTCG GGCTGGCGAC CAGCCGAACG GGGGCCGCCA CCGCAGCATC GACCGAAATC AAAAGCCTGA ACTGGCTGAA CGAAAACACC ATCAACTACC GGCATAAGTT TGGTGGCAGG CACGATATTG ATGCGCTGGC GGGCTACACC ATCCAGAAAA ATTCGGACGA GGTGCTACAG GCCGGGGCTA CCGGCTTCTC GACCGATTAT GTGCCGTTTC TGGCCGCAGG AACCGTTTCG ACGGGCACGA ATTACATCAG CGAATGGGCC ATTATGTCGT GGCTGGCCAG GGTAAATTAC ACCTATAACG GTAAGTACCT GCTCACAGCG ACGATTCGGA AAGATGGCAG CTCACGTTTT GGCTCGAAAA ATCGCTGGGG GACGTTTCCG TCGATTTCAG CCGCCTACCG CTTGTCGGAT GAGCCCTTCA TGAAATCGGC CAGTTTCATC AGCGATTTGA AAATCAGGGC CAGTTATGGT ATTTCGGGCA ATAACCTGAT TCCCAACTAC GCCACGCAGG GCTTGCTGGG CGTTGCCCGA ACGGTGGCGA ATGGTCAGAT TGTGTCGGGT ATTATCCCAA CCAGCCTGGC CAATGACGAA CTGACCTGGG AGCAATCGGT GCAGAGTAAC GTGGGCATTG ATCTGTCGTT GTTCCAGAAC CGACTGTCGT TTACGGTCGA TGCCTATCAG GCCTACAAAA AGAATCTGCT GCTTAACGTA ACCCTGCCTT CGGCTTCGGG CTTTGGCAGC TCGGTTCAGA ACATCGGCGA GGTAGAAAAC AAGGGGATCG AACTGACGGT CAATTCGCAG AACATCGCGA AAGGGCCATT CCAGTGGAAT ATGGATTTTA ATATTAGCTG GAACCGCAAC AAAGTGCTGG CGCTCAATTC AAGTTCGGCC CGTATCGTTA CGTCCGATTA CCAGGTGGCG CAAGTTGGCT ACCCCATTTC CAGCTTCCGA CTGCTCAACA TTCTGGGCGT TTTCCAGACC CAGGAGGAGG TCAACAACAG CCCAAAACAG AACCCACGCG TGCAGCCGGG TGATTATAAG TACCAGGATG CCGACGGCAA TGGCACCATC AATACATCCG ACAGAACCAT TGTCGGAAAT CCGTGGCCCC GATATACCTG GGGACTTGGT AACCGCTTTA CTTACAAAAA TTTCGCCCTG AGCGTGAGCC TCAATGGCAC CTACGGCAAC CAGGTTTATT TTCAGGGGGG CGAGGTCAAC CTGAATGGGG CTGGGGTACA GAACCAACTG GCCGCTATGG CCGACCGCTG GAAATCGCCG GAGAGTCCGG GGGCGGGCTT GTATACGCGG GCTATCCGAA ACGACTATGC TTTCGGGTTC AGCGCGGGAA CGACCAAATA CCTGTTCGAC GGGTCATTCA CCCGCATTCG GGATGTCAAT TTATCGTACA CCTTTCCAGC ACCGGCGGTT AGCAAGCTGA AGCTTCAGGC ACTGTCGATC TATGCGGATG TCACGAACCT GTACACGTTT ACGAAGTATC CGGGCTATGA CCCGGAGGGG AGTACCGGGG GCGATAATCT GGCCAAAAGT GGCGTTGACT TCTTCTCGTA TCCAAACCCA CGGACCTACA CCGTCGGCCT GCGCGTGACT TTCTAA
|
Protein sequence | MNHFLFHQLR LLVIGSMLSV LTVAHSVSAQ SAKGLVSGKI TAEEDGEALV GATVTEKGTT NGTTSDVNGN FKLNVAGNAT LVISFIGYAP QELPVSNGNG QPRTNLTIAL KTDQQQLQDV VVVGYGTQRK KDLTGSIVNL TSKDLVPVPS ATSVDQMMQG KVAGVQITQT SGAPGGNVNV IIRGISSITG GNSPLYVVDG YAIGTGGGGS DLSSFGANSY TASGLASSSS TNRINPLSII NPADIESIQV LKDASATAIY GSRGSNGVII ITTKRGKLGK PTISFEHSTG MQELARKMKL LTPRQYAEFV AEGRDNAWVF AGGKATDPNN IRSTATQVKP DFRNPGQFAD AGYGTDWQDV IFRKGMVQNY QLSASGTSRD VSYYVSGGFF NQKGIIIGSD FNKFTLRTNI DAQLTPRLKI GASFSGAHSY GNFARAEGHL QFRGLISAAL ASDPTIPVTN PDGTPYSEFS SPTGIPVENP LIIAAEFFDK RNNTNVFTNN YLQFDLAPGL VLKTSIGVNY SNNVTRLWKS SKVGLATSRT GAATAASTEI KSLNWLNENT INYRHKFGGR HDIDALAGYT IQKNSDEVLQ AGATGFSTDY VPFLAAGTVS TGTNYISEWA IMSWLARVNY TYNGKYLLTA TIRKDGSSRF GSKNRWGTFP SISAAYRLSD EPFMKSASFI SDLKIRASYG ISGNNLIPNY ATQGLLGVAR TVANGQIVSG IIPTSLANDE LTWEQSVQSN VGIDLSLFQN RLSFTVDAYQ AYKKNLLLNV TLPSASGFGS SVQNIGEVEN KGIELTVNSQ NIAKGPFQWN MDFNISWNRN KVLALNSSSA RIVTSDYQVA QVGYPISSFR LLNILGVFQT QEEVNNSPKQ NPRVQPGDYK YQDADGNGTI NTSDRTIVGN PWPRYTWGLG NRFTYKNFAL SVSLNGTYGN QVYFQGGEVN LNGAGVQNQL AAMADRWKSP ESPGAGLYTR AIRNDYAFGF SAGTTKYLFD GSFTRIRDVN LSYTFPAPAV SKLKLQALSI YADVTNLYTF TKYPGYDPEG STGGDNLAKS GVDFFSYPNP RTYTVGLRVT F
|
| |