Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_2101 |
Symbol | |
ID | 8725839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 2532200 |
End bp | 2535316 |
Gene Length | 3117 bp |
Protein Length | 1038 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003386935 |
Protein GI | 284037005 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.693236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.298537 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTTA CCCGATTTGT CGGATTTTGG ATTGTGTGGA CATGCTTTTT ATTAAGCAGT CAACCAGTTT TAGCGCAGGA AAGCCATACC GTAAAAGGAT TGGTACTCGA CGAAAACCAG AAACCGCTAT TTGGTGCCTA TGTGATCCTG AAGAACACAA CAACCGGCAC GACAACGGAT GTCAATGGAG AATTTGCCCT GAAAGTACCC GCCGGTAAGC AAACGCTGGC CATCTCTTAT CTTGGGTCGA AGCCACAGGA CATTACGGTG GAAAAGGGGG GGAATGTTAA GGTTGTACTC TCGGGCGGTG ATATTACCCT GGGCGAGACA GTCGTGGTGG GCTATGCGCA GCAAAAAAAG CAAAGCGTTG TGGGAGCCAT CTCACAAACC ACCGGGAAAG TACTCGAACG AGCCGGTGGC GTATATAGTG TAGGGTCAGC ACTGACCGGT AATGTGCCGG GGGTGATTAC TACCTCCAGC ACAGGCATGC CCGGCGAAGA AGATCCCCGC ATTCTGATTC GGGGGTTAAG CTCCTGGAAC AACAGCTCTC CGCTCATTCT GGTCGATGGG ATCGAACGCC CCATGAACAG CGTTGACATT TCCTCGATCG AAACGATTTC TGTACTGAAA GATGCGTCCG CAACGGCTGT ATTTGGCGTG AAGGGTGCCA ACGGCGTTAT TCTGATCACC ACTAAACGGG GTAAGGAAGG TAAAGCGAAC ATCAGCATAC GGGCCGACTA CACCCTAAAG GCACCCTCTA CCCTACCCAA CAAGTATGAC GCCTACGATG CCCGACGCAT CCGTAACCAG GTCATCGAAA ACGAATTAGG GCTGGAGTCG GCAGGCTGGG CAACCTACAC GCCCTACGAT ATTCTAACGA AATACCGCAA CCCGGCCAAC CTGGCCGAAG CCGAGCGCTA CCCGAACGTA AATTGGGCCA AAGAGATGTT CAAGCCGGTT ACCCAAACGT ATAGCGCCAA CCTCAACGTA TCGGGAGGAA CGTCGTTCGT CAAGTATTTC GCATCGGCCG ACTTTCTGAT GGATGGCGAC ATTTTTAAAG AGTGGGATAA CAACCGGGGG TATCAGGCGG GCTACGGATT CAACCGGATC AACGTACGGA CAAACCTTGA TTTTCAGCTG ACACCCACGA CGTTGCTGGT CACCAACCTG TCCAGCTCAA ACGGGGTCAA AAAGAGTCCC TGGGGGGCTA CCGGTGGCGA ATACGATATG TGGCAAGGTG CCTATGGCGT AGCGCCTGAT GCCATGCTGC CCCGCTATTC CGACGGTACC TGGGGCTATT ATGCGCAGGA CCCCGTGGCG GCTACCAACT CCATTCTGAA TCTGGCCCGC AGTGGTATCA TGAAGCGAAC CACGAGCCGG ATCACCACGG ATTTTACGCT GAATCAGGAT TTGAAATCAA TTGCGAAAGG CCTAAGTACA TCGGGCACCA TTTCGTGGGA TAATAGCTTT GTGGAGAGCC AACGCGGCAT CAACGACCTC TACAATGATC CGCAGACGAA GTGGATCGAC CCCGCTACGG GGGAGTCGCG CTACCGGTTT ACGTATGATG CCGCTAATAT GTTCGACTTC CAGGAGGCTA TCCGCTGGGC ACCACAGGCG GGTACAATGG ATAACGGAGC GACTTATCGG CGATTATTTT ACCAGCTCAA ACTGAATTAC AACAGGACCT GGAATAAGCA CACGGGCTCG GCTATGGGGC TGTTCAGCCG CGAAGACCGG GCGTCGGGCA GCGAGATTCC CAATTATCGG GAAGACTGGG TGTTCCGTAC CACCTACGAT TATGCCGGTA AGTACTTCGC CGAAGTCAAC GGGGCCTATA ACGGTTCCGA AAAGTTTGGT AAAGACAACC GCTTTCACTT TTTCTCGTCG GGTGCGGTGG GCTGGCTGCT ATCGGAAGAG GCATTTTTCA AGCGGGCCAA GTTTCTGGAT CTGCTCAAGC TGCGGGTGTC CTACGGAAAG ATCGGTGATG ACAACATCAA CCAGCGATGG CTCTATATGA CTCAGTGGGG ATATGACGGC CAGGCAAAAA TAGGCGAGAA CAACTCGGAT GTCAGCCCCT ATGTCTGGTA TCGGGAGTCC TCGGTGGGTA ACCCGTCGGT ACACTGGGAA ACGGTAACGA AAACGAACCT GGGAGCCGAT TTCTCCCTGT TCAACGGGAC TATTTCGGGT AGTGCGGATT ACTTCAACGA CTACCGCACG GATATTCTCA TTGCTGGTGG GTCGCGGGCT ATTCCGTCTT ACTATGGCAC CAATGCCCCC GTTGCCAACC TGGGTAAAGT GCGGGTGAAA GGCTATGAGC TGGAAGTAAA GGTAAACCAT CAGCTGACCA ACGGCATCCG GCTCTGGGCC AACGCAAACA TGACCCATGC CAAAGACCGG ATCATTGAAG CGGATAACCC AAGCCTGCTG CCCGACTACC GCAAGAGCGA GGGCAAGCAG ATCGGACAGA CCTATTCGTA CGTTAGCCAC GGCTACTACA ACAACTGGGA TGAGCTGTAT GGCAGCACCG AACAGAACAC CAACGACCGG CAAAAGCTGC CGGGTAACTT CCAGATCGTT GATTATAACG GCGATGGCAA GATCGACACT TACGACAATA TCCCATATGG TTTCCCCGAG CGTCCGCAGA ATACTTACAA CGGTACGGTT GGTTTTGAGT GGAAAGGATT CAGCGCCTTC GCGCAGTTTT ATGGCGTCAA TAACGTAACC CGTCAGGTGG TGTTCACCAG CTTTGGCTCC CGAATGAACA CTGTTTATAA CCAGGGCGAG TACTGGACGA AAGACAATCC AACGGCGAAT TCACCTCTGC CCCGTCTGAT GACGGCCACC GACCCATCGA CCTACGGCAA CTTCTATATG TACGATGGCT CCTACGTGCG CCTGAAGAAC GCCGAGGTCG CCTACACCTT CAACCGGGGC TGGATCGAAC GGTTCGGGCT GCGCACGCTG CGAATCTACG CCAACGGGAA CAACCTCTGG CTCTGGACCA AAATGCCCGA CGACCGCGAA TCGAATTTCG CGGGTACGGG CTGGGCCGCC CAGGGCGCTT ACCCGACAGT GAAGCGGTAC AATTTCGGCA TCAACATTAC CCTGTAA
|
Protein sequence | MNVTRFVGFW IVWTCFLLSS QPVLAQESHT VKGLVLDENQ KPLFGAYVIL KNTTTGTTTD VNGEFALKVP AGKQTLAISY LGSKPQDITV EKGGNVKVVL SGGDITLGET VVVGYAQQKK QSVVGAISQT TGKVLERAGG VYSVGSALTG NVPGVITTSS TGMPGEEDPR ILIRGLSSWN NSSPLILVDG IERPMNSVDI SSIETISVLK DASATAVFGV KGANGVILIT TKRGKEGKAN ISIRADYTLK APSTLPNKYD AYDARRIRNQ VIENELGLES AGWATYTPYD ILTKYRNPAN LAEAERYPNV NWAKEMFKPV TQTYSANLNV SGGTSFVKYF ASADFLMDGD IFKEWDNNRG YQAGYGFNRI NVRTNLDFQL TPTTLLVTNL SSSNGVKKSP WGATGGEYDM WQGAYGVAPD AMLPRYSDGT WGYYAQDPVA ATNSILNLAR SGIMKRTTSR ITTDFTLNQD LKSIAKGLST SGTISWDNSF VESQRGINDL YNDPQTKWID PATGESRYRF TYDAANMFDF QEAIRWAPQA GTMDNGATYR RLFYQLKLNY NRTWNKHTGS AMGLFSREDR ASGSEIPNYR EDWVFRTTYD YAGKYFAEVN GAYNGSEKFG KDNRFHFFSS GAVGWLLSEE AFFKRAKFLD LLKLRVSYGK IGDDNINQRW LYMTQWGYDG QAKIGENNSD VSPYVWYRES SVGNPSVHWE TVTKTNLGAD FSLFNGTISG SADYFNDYRT DILIAGGSRA IPSYYGTNAP VANLGKVRVK GYELEVKVNH QLTNGIRLWA NANMTHAKDR IIEADNPSLL PDYRKSEGKQ IGQTYSYVSH GYYNNWDELY GSTEQNTNDR QKLPGNFQIV DYNGDGKIDT YDNIPYGFPE RPQNTYNGTV GFEWKGFSAF AQFYGVNNVT RQVVFTSFGS RMNTVYNQGE YWTKDNPTAN SPLPRLMTAT DPSTYGNFYM YDGSYVRLKN AEVAYTFNRG WIERFGLRTL RIYANGNNLW LWTKMPDDRE SNFAGTGWAA QGAYPTVKRY NFGINITL
|
| |