Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5059 |
Symbol | |
ID | 8728824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 6169053 |
End bp | 6172196 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003389833 |
Protein GI | 284039903 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.25287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAACA ACTCCTACAA AATGAACCAC TTTAGTGGCT ACGTAGGTCA GGTTGGTGAA CGCCAACCCC AGTCATTCAA ACGGAAAGCG TTGGTTAGCT TTCTGTCGAT GCTGGTAATG GTGCTACTGC TGAGCAATTC GGCAGCATGG GCACAAGGCC GTACTGTTAC GGGTAAAGTG ACCGATCCGG CGGGTACAGC GTTGCCCGGT GTGAGTGTGC AGTTGAAAGG TACGCAGCGT GGCACCAACA CCGACGCTGA TGGCAAGTAT TCGCTGGCCA ACGTGCCAGA CAATGCAACG CTTGTTCTGA GCTTTATTGG TTATACCTCG CAGGAGGTTG TTGTAGGAAA CCGGTCAACG GTAGATGTAA AGTTAGCCGA CGATACCAAA GCACTCGACG AAGTAGTCGT TGTGGGTTAT GGTACCGCAA AACGGAAAGA CCTGACCGGC TCGGTTGTTC AGGTATCGGC TAAAGATTTC AACGCGGGCG TAAACCCCAA CCCACTGCAG GCCATTCAGG GTAAAGTAGC CGGTCTGGTA ATTACTTCGC CATCTGGTGA CCCCAATCAA CAACCAACCG TCCGTTTGCG CGGGTACACC TCGCTGGCGG GTGGTTCTGA CCCACTGTAT GTGGTTGATG GTATGATTGG TGTGCCCATT AGTACCATTT CCCCTTCCGA CATCGAATCG ATGGATGTAT TGAAAGATGC ATCGGCTTCT GCTATTTATG GCTCACGCGC AGCTAACGGT GTTATTCTTG TAACCACCAA GCGTGGTAAA GCGGGCAAAA CGACCGTAAC CTTTAACAAC TATGTTAGTG CCGCCGTTAT TTCAAGACGT CTTGATCTGC TTGACGGCCC GGGCTATCGC GATGCCGTAA CGTCCATCAA AGGCTCATCG GCGCTGGGCG ATTTACAGCG TTTCCCAGCG GGTAATTACA ATACCGACTG GATTAAAGAA ATCACTCGTA CCGCCATTGT GAATAACCAC GATTTGGCTA TTGCGGGTGG TTCGCCAACG TTTAGCTACC GTGGTTCGTT AAATTACATC AACAACCAGG GTATTGTTAA GAAAACGGGT TTCGATCGTA TCACAGGGCG TATTAACCTC GATCAGAAAG CGTTGGATAA CCGTCTGAAT ATCCAATATA ACCTTTCTTA TTCTGAGACA AATAAGGATT TCCCTGACAA TGGGAGCCAG GGTGCACCAA GTGTAGGTTC CTTGTTGAAT CGTGCCACTA CCTTTCTGCC AACACTGCCT ATTCGTAATG CGGACGGTTC TTACTACGAA GTAGGAGGGA GCTTCGACCT GTTCAACCCA GTTGCTATGC TCAATAATTC AGTGAATACG GCTGTGCAGC GTTATTTACA GGCGGGCGCT AACCTTCGAT ACGAAATTCT GGATGGCCTG ACCTTAGGCG TCAGCGGGCA AATTCAGCGT GACAATTCGA CAACAAGCTT TTACACCAAT CCTATTATTA AGGCGTTTTC TGCTAACAAT GGCCGTGCTG GGAGGGGCTT TTCCGAATCA AACAGCCGAC TACTGGAAAC GACGCTTAAT TATGTAAAAG GCTTTGGTAC TCAAAATAGT AACTATTCCT TATTAGCTGG TTATTCGTAT CAGCAGTTCG ACAACGACGG CTTCAACGCG TCCAACACTG GTTACTTAAC TAGTGAAATT AACTACAATA ACCTTAATTT AGGATCAGGT ACAATCATTC TGCCAGGAAG TGGTTATGTT GGCTCCTACC GTAACCAGTC GAAACTCATC TCGTTCTTCG GACGGGCCAG TGTTAACCTG AACGACAAGT ACAATGTAAC CGCCACGATC CGCCGGGATG GCAGTTCTAA GTTCGGGGTC AACAATAAAT GGGGTATCTT CCCATCCATT GGTGCAGGCT GGACAATCAG CAACGAGTCG TTCTTCCCAA AGGGTAATTC TCTTAATTAC CTGAAATTGC GGGCCGGTTG GGGACAAACG GGTAATTCGG AAGGTATTGC TGCCTATAAC TCGATTCAAT TGTATGGTCA GAGTGGTAGC TATTACGATG GAACGATCAG TGACTTCCTG CCAGGCTATG GTATTACCCA GAATGCTAAT CCGAACCTGA AATGGGAAGT ACTGACTCAG TCGAACGTAG GGCTGGATTT CCAACTGCTG GGTGGGCGCT TCTCGGGTAC GCTTGAGTAT TACAACAAGC TGACCAAAGA CATGCTGTAC CCCTACTCAG TACCGGCCGA TGGTAAAAAA TACTTTACCA ACATCATTCT GGCCAACGTA GGTTCGATGC GGAACAGCGG AGTTGAATTA TCGTTTGGTG GCGACGTAAT CCAGAAAGGG TCTTTCTCCT GGAATGCCCG TGTGGTGGGA GCTTACAACA AAAACACGAT TGTTAACCTG AAAAACGATG AGTTTGATTC AGGTACCGTT CGTTTCAATG CCTTCGGTGG CCGTGGCTTG TCGGACGTAT TTGCTTCGTT CATCTGGCCG GGTCAGTCGC TGGGTCAGTT CAACAACGTA CCCACCTTCA CCGGTGCTTA CTCGGCAGAT GGCCAGCCGC TGCTGAAAGC AGCTTCGGGC GATACGCCGG TAACAGACGT ATCGAAAGCT GATGCAGCCG CTGCTTTTGC TGCGGGTAGT CCGTTGAAGC AGGGTAACCC ACAGCCGTTC CTGAACGCTT CGTTCATCAA CACGTTCCGT TACAAAGGTT TTGATTTCTA CTTCCAACTG CGGGGAACCT TTGGCAACAG CATTCTGAAC AACCTGCGCT CGAATCTAAT GATTCCTGGC TCAATTCTGG AAACCAACAT GCTGAAAGAC GTAACGACAC TGCCCAAAAA CTATGGTGTG AACGTTCTGT CGACCAACTG GCTCGAAAAA GGCTCATTTG TTCGGTTCGA TAACTGGCAA ATTGGTTACA GTATCCCACT GCCAGCCAGC AAGTACATCT CGAATGCCCG CGTTTATGTA GGTGGTAATA ACCTGTTCAT CATTACCAAA TACAAAGGTA TCGATCCGGA ATTGCAGGTT AAAGGTGATC TGCCTAACAG CCTCACCCAG GCACCCAACT CGGTTGGCAT TGATGCCAGT GGTATTTATC CCAAAACGCG CACATTCCAG TTAGGCCTTA ACCTGACGTT CTAG
|
Protein sequence | MMNNSYKMNH FSGYVGQVGE RQPQSFKRKA LVSFLSMLVM VLLLSNSAAW AQGRTVTGKV TDPAGTALPG VSVQLKGTQR GTNTDADGKY SLANVPDNAT LVLSFIGYTS QEVVVGNRST VDVKLADDTK ALDEVVVVGY GTAKRKDLTG SVVQVSAKDF NAGVNPNPLQ AIQGKVAGLV ITSPSGDPNQ QPTVRLRGYT SLAGGSDPLY VVDGMIGVPI STISPSDIES MDVLKDASAS AIYGSRAANG VILVTTKRGK AGKTTVTFNN YVSAAVISRR LDLLDGPGYR DAVTSIKGSS ALGDLQRFPA GNYNTDWIKE ITRTAIVNNH DLAIAGGSPT FSYRGSLNYI NNQGIVKKTG FDRITGRINL DQKALDNRLN IQYNLSYSET NKDFPDNGSQ GAPSVGSLLN RATTFLPTLP IRNADGSYYE VGGSFDLFNP VAMLNNSVNT AVQRYLQAGA NLRYEILDGL TLGVSGQIQR DNSTTSFYTN PIIKAFSANN GRAGRGFSES NSRLLETTLN YVKGFGTQNS NYSLLAGYSY QQFDNDGFNA SNTGYLTSEI NYNNLNLGSG TIILPGSGYV GSYRNQSKLI SFFGRASVNL NDKYNVTATI RRDGSSKFGV NNKWGIFPSI GAGWTISNES FFPKGNSLNY LKLRAGWGQT GNSEGIAAYN SIQLYGQSGS YYDGTISDFL PGYGITQNAN PNLKWEVLTQ SNVGLDFQLL GGRFSGTLEY YNKLTKDMLY PYSVPADGKK YFTNIILANV GSMRNSGVEL SFGGDVIQKG SFSWNARVVG AYNKNTIVNL KNDEFDSGTV RFNAFGGRGL SDVFASFIWP GQSLGQFNNV PTFTGAYSAD GQPLLKAASG DTPVTDVSKA DAAAAFAAGS PLKQGNPQPF LNASFINTFR YKGFDFYFQL RGTFGNSILN NLRSNLMIPG SILETNMLKD VTTLPKNYGV NVLSTNWLEK GSFVRFDNWQ IGYSIPLPAS KYISNARVYV GGNNLFIITK YKGIDPELQV KGDLPNSLTQ APNSVGIDAS GIYPKTRTFQ LGLNLTF
|
| |