Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1166 |
Symbol | |
ID | 8724899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 1419615 |
End bp | 1422869 |
Gene Length | 3255 bp |
Protein Length | 1084 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003386016 |
Protein GI | 284036086 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.689476 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG CTCTAATAGG AAGCTGGCTG CTATTGCTGT TGGTTGGTTT GCCTGTGTTA GCCCAGGAAA TAGCCATAAC TGGCCGGGTC ACCTCATCAG ATGATGGCTC CGCATTGCCC GGTGTGAGTG TTGTCGTCAA AGGATCGACC CGCGGCACCA CAACCGATGC CAATGGTACT TATCAGATAA ATGCAGGCTC CGCCACTACA CTGACATTCT CGTTCGTTGG CTTCAAACCA CAGGATGTAG CGGTGGCTAA TCGTACGACA ATCAACGTAG TTTTAGCCGC CGATGCCTCC ACACTGAACG AAGTCGTTGT TACAGGATTC GGGATTCGGC GCAACGAGCG CGAAATTGGC ACATCTGTTA CCAAGATCAA TAACACATTG ATCAACCAGG CAGCTCCCGT TAACCTGGCG AATGGCCTGA CCGGTAAAGT GGCCGGGCTG CAGATCAACG CGGTCAATAA CGGCGTCGGT TCGAACCCCC GCATAACCAT TCGGGGAAAT CGTTCGTTTC TGGGCAACAA CCAGGCGCTA CTGGTCGTAG ACGGTGCTCT GACGGACGTG AGCTTTCTGT CGGCCATCAA CCCCAACGAC ATTGAAAGCA GCTCCATCTT GAAAGGGCCG AGCGCAGCGG CTTTGTATGG TTCCGACGCG GCTAACGGTG TACTGGTCAT TACAACCCGG CGCGGTACAA CCAACAACAA GCCCCAAATT TCTTACACTA ACAACACCCA GTTGGAGAGT GTGTCGTACA TGCCTGATCT ACAGCGCCTG TATGGCTCCA ATGGGGGCGA AGGTGCTCCT TTTCTGGATG CCAACGGCCA GCGGCTTTAC GTACCTTATG AGAACCAGCA GTTCGGCCCT CTGTATGATG GCTCATTACA GCCGCTGGGT TACGGGGTGC AGGTCATAAA CCCTGATGGG TCTATTCGGA TCGATACGCT GAAAGTTCCC TATGCGTCGA CAGGAAAAGA CCCGCGCCGT GCGTTTTTTA ACACGGGGGT TACGCAGCAG CACGACCTGG CCTACCGGGT GGGCGATGCA CAGAATTACT TTGGACTTAG TGTGCAGCGC GTCGATCAGA AAGGGATTGT TCCTAACGAT AAATACAGCC GTACCAACTT CACGGTGAAC GGTGGCCGGG CGGTAGACCG CTTTACGGCC AATGCCAAGA TGCAGTTCAC TTACGAAAAT ACGGATCAGG AGAACGGCGA TTTCGGCCAG GGACGCCCGC TATACTGGAA CCTGCTGAAC CAGCCCGCCC ATGCACCACT CACCGATCCG CGTATTAAGG ATATTAACTC CCCTTACGGC GATGTGAACG GCTATTTCAA CGCCTACTAC CCAAACCCCT GGTGGCAGGT AACCGGCGAC AACTCGCGGG CCGTTACCAA CAAATACTCG ATTCAGGGGA CGGCCGATGT TGGCTACAAA TTCACCGATT GGCTCAATGT TACCTATCGG GTGAGTGGTC AGGTATCCAA CACACAGTTT AAATCGCACC TGGCGGCTGT TTCGTTCAGC ACCTACGCGC TCGGCGACCC CTGGGGTGCG GGGAACATTG CCTCATCGCT GAAACAGGTG AACGGTAACG TGAGTGATTA TAGTCGTACG ACCTCCCGCG TGACAGGTGA CCTGCTGATT ACGATAGCTC CCAATTTTGG TGACTTCACT ACCAAGCTGA TTCTGGGGCA GCAGGCCCGG GTCGACTATT CACGGTACAT CTCCACGACG GCAACCTCAC TCGTAGTGCC CGGTACCTAT AACATCGCCA ACCGGCTGGG TAATGTGCTT GCCAGTGAGA ACTCCTACCA GAGTCGGCTG TTAGGTTATT TTTATGATTT CACGGCCGGG TTCCGGAATT TCGCCTTCAT CAACGCCACC GGTCGTTACG ACAACACCTC GTTGCTGGCT GCTGGCAACC GGTCCTATTT TTACCCTGGT GTGAATGCGT CGGTTATCCT GACCGAAGCC ATTCCTGCCC TGAAGGGGAG TAGCGTCCTT TCCTATCTGA AGGTGCGCGG GGGTATTGCC AGAGCAGGGA ATATCAGCGT TGGGCCTTAC CAGTTGCAGA ATGTATTTAA CCCCGGATCA GGCTTCCCCT ACGGTAGCCA GCCCGGCTTC TCGCTCAGCA CTCAGCAGAA CGACCCTAAC CTGAAGCCCG AGTTCACCAC GAACAAGGAA GTAGGTGTTG AGTTTGGCCT GTTCGACCGG GTCAATGCCG AAGTCGTGTA TTACACGATG GAAACCATTA ACCAAACGGT TCCCATTCAG GTTTCGCGGG CTACGGGCTA TGGCAGCGCG CTGATCAATA CAGGTACTAT GGTCAACAAT GGGCTTGAAG TGGAGTTGAA AACCCTCCGG CCGATTGTTA ATACCGGTGG CTTTACCTGG AATGTCAATA CGAACTTTAC CTACCTCAAC AATACGGTAA CGTCCGTTTA TCCGGGCCTG GATCGCATCA ATATTACGCA GTCGAATGGG GCGCAATCGG CTAACGTGTT TGCCGCTGTC AATTATACGT ATCCAGCTTT ATTCGGTACT GATATTGCCC GTGTGCAAAA CACCGATCCC AATGCGGCCT ATTACGATGC AACGGGCCAG TTTGTTGGCC AGCCGGTTAT TAACCCATCA ACGGGCTACC CCATTCTGGA CGCCAATATC AAGTACCTGG GCAACACACA GCCAAAATAC CGGTTCGGGT TCAACAACAC TTTCGCCTTT AAAGGACTGA CACTGAACGC CCTGGTCGAG TACCGGGGGG GCAATGTGAT TTACAACCAG TTGGGTAACG CACTGGAGTT TACAGGTGCC GGTATTCGGT CGACTTACAA CGGACGGCAG AACTTCGTCT ACCCGAACTC TGTGCTGGCC ACCACCAACC CCGATGGAAC CACCACCTAT GCGCCCAATA CCAGCGTGTC GACCCGTGAT GGCAATCTGG AGTTCTGGAC GAATTCGGGC TATCACAATG CCGTGTCGAG TTACGTGACA AGTGCGGCCT TCTGGAAGCT TCGTGAAGTA GCGTTGAGCT ACAACTTCCC TACCCAGTTG TTCAGCAATA TCAAGTTTAT CCGATCACTG ACCCTTGGCT TAACAGGCCG CAACCTGCTG ATGCTTCGGC CGAAAACAAA CGTATTCACG GACCCCGAGT TTTCGGTGGA CAACAGCAAT GCCCAGGGCG TTACGAACGA ATACCAGACA CCACCAACCC GCCAGTACGG TTTCCGGCTG AGCGTTGGGT TTTAA
|
Protein sequence | MKKALIGSWL LLLLVGLPVL AQEIAITGRV TSSDDGSALP GVSVVVKGST RGTTTDANGT YQINAGSATT LTFSFVGFKP QDVAVANRTT INVVLAADAS TLNEVVVTGF GIRRNEREIG TSVTKINNTL INQAAPVNLA NGLTGKVAGL QINAVNNGVG SNPRITIRGN RSFLGNNQAL LVVDGALTDV SFLSAINPND IESSSILKGP SAAALYGSDA ANGVLVITTR RGTTNNKPQI SYTNNTQLES VSYMPDLQRL YGSNGGEGAP FLDANGQRLY VPYENQQFGP LYDGSLQPLG YGVQVINPDG SIRIDTLKVP YASTGKDPRR AFFNTGVTQQ HDLAYRVGDA QNYFGLSVQR VDQKGIVPND KYSRTNFTVN GGRAVDRFTA NAKMQFTYEN TDQENGDFGQ GRPLYWNLLN QPAHAPLTDP RIKDINSPYG DVNGYFNAYY PNPWWQVTGD NSRAVTNKYS IQGTADVGYK FTDWLNVTYR VSGQVSNTQF KSHLAAVSFS TYALGDPWGA GNIASSLKQV NGNVSDYSRT TSRVTGDLLI TIAPNFGDFT TKLILGQQAR VDYSRYISTT ATSLVVPGTY NIANRLGNVL ASENSYQSRL LGYFYDFTAG FRNFAFINAT GRYDNTSLLA AGNRSYFYPG VNASVILTEA IPALKGSSVL SYLKVRGGIA RAGNISVGPY QLQNVFNPGS GFPYGSQPGF SLSTQQNDPN LKPEFTTNKE VGVEFGLFDR VNAEVVYYTM ETINQTVPIQ VSRATGYGSA LINTGTMVNN GLEVELKTLR PIVNTGGFTW NVNTNFTYLN NTVTSVYPGL DRINITQSNG AQSANVFAAV NYTYPALFGT DIARVQNTDP NAAYYDATGQ FVGQPVINPS TGYPILDANI KYLGNTQPKY RFGFNNTFAF KGLTLNALVE YRGGNVIYNQ LGNALEFTGA GIRSTYNGRQ NFVYPNSVLA TTNPDGTTTY APNTSVSTRD GNLEFWTNSG YHNAVSSYVT SAAFWKLREV ALSYNFPTQL FSNIKFIRSL TLGLTGRNLL MLRPKTNVFT DPEFSVDNSN AQGVTNEYQT PPTRQYGFRL SVGF
|
| |