Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4801 |
Symbol | |
ID | 8728565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5855033 |
End bp | 5858257 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003389578 |
Protein GI | 284039648 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATGA TAGCCTTTTT GTTTTATAGC TCACTATGGG CCCAGACGCC CATTCAGGGA ACCGTGCTCG ATGCCAAAAA AGAGCCACTC GTGGGCATAA ACGTATTGGT AAGAGGAACT ACGCGCGGAA CGGTTACAGA CGTGAACGGC AAGTTTACCG TCAATGCCGA CCCGGCCGCT ACGCTGATTT TCTCGGGCGT TGGGTTTGTT CGTCAGGAAG TGGCCGTTGG AAGTCGGACA GATGTGTTGG TAACGCTGGC GGAAGATAAT GCCGTACTGA GCGAAGTGGT TGTTACAGCA CTGGGTATTA AACGGGATAA GCGTTCGCTG GGGTATTCCA TTCAGGAACT GAACGGGCAA GATATTGCAA CAGCTAAAGA GGCCAATGTG GCTACGAGCC TGGCCGGGAA AATGGCCGGT GTGCAGGTGA CCCGCTCGGC CAATGGAGCC GGTGGCTCGT CGCGGGTGAT TATCCGGGGC GCTAACTCGC TCGTGGGAAA CAGCCAGCCG CTATATGTGA TCGACGGTAT TCCGATGGAC AACCAGAACC CAAGGGCTCC CGGCAGCTCG GGCGGCATCG ACTACGGCGA CGGTATTTCG AACATCAATT CCGAAGACAT TGAGACGATC TCTGTCCTGA AAGGCCCTAA CGCGGCTGCG CTATACGGGC AGCGGGGCAG TAACGGCGTA GTGCTGATCA CGACCAAGTC GGGTAAGAAC CGCAAAGGGA TCGGGGTTAA ATACGGCATT GACTACTCGC TGGGTGATGC GCTGGTACTG CCCGATTTCC AGGACGAATA CGGGCAGGGT CTGGATGGTA CGTTTACCAA CTTCCGTGGA AATGACGGTA AAATCTATAC TTGGGCAGCT GCTCAGGCCG CCGGTATTCA GGGGGTTCCC AAGATGAGCG GTGGCCGTGA CCGATTTACC CGCTCAAGCT GGGGACCGCG TATGGAGGGG CAACCTTATG AAGACCAATG GGGCAACTTG CTGAATCTGA CCCCGCAACC CAATACCTTC CAGAAGTTTT TCAATACCGA AAAACAGATG GTCAACAACC TGAGTCTGGA GGGAGGTAAC GATGCCGTGA ACTACCGGGT GGCCTATTCC AACACGAATA TCAACGGCTA TGTACCCACC AATACCCTTA ACCGGAATAA CATCAGCTTA CGGACGGTGG CTAAAGTTAC CTCGAAGTTA GAGGCCGATG TGAAGGTTAA TTACATTGCT CAGCAGGGCG TAAACCGGCC AACCGTTTCC GACGCAGCCG ATAACCCGGC CTACATCTTT ATCAGTCAGC CCCGGAGTAT GCCAATGGAT ATTCTGGCCA ATTCGGCCTG GACGGCGGCT GATATTTCCA AACAACTCGG CTACGGTACA ACGCCCTTCG TAGGCCTCGA AAAAACCTAC GCTACCAACT CGTCAACGGC GAACCCCTAC TGGACAATGT CCCGAACCCG CAACTCGGAT GAACGCCAGC GAATTATCGG ACTGGTCAGA CTGAGTTATC AGTTCAACGA CTGGATCCGG CTGACGGCCC GTACCGGTAC GGATTTCTAT ACCGATCAGC GATTCCGCTA CCGCGACAAG GGTACCTACG TGACGGCTAA TAAAAACGGG GATATTACCG AAGAGGTGAC CCGCACCCGC GAAGATAACA GCGATGTACT GCTGTCTCTT ACGCCCAAGG TTTCGGACGA CATTTCGTTC TCGTTCAACC TGGGCGCTAA CCACCAGCGT TATTACTCGC GCACAACGGG CAATACAGGT AATGAATTTA TTGCGCCTAA TCTGTTCATT ATCAATAATA CGCTAACCAA TTCGTATGTC TTCGGCCTGA CGGAATCGTC CATCAATTCG GTGTATGGGT CGGGGTCGGT TGGGTACAAG GAAATGGCGT TCATCGATTT CTCGGCCCGG AATGACTGGT CGTCGACGCT GTCGCCCAAA AACAACTCGT TCTTTTACCC GGCCATTAGC GGCAGCCTTA TTCTGACGGA TGCGCTGCGG TTGCAAAGTC CGACGCTGAG TTTCGTGAAG GTAAGAGCTT CCTGGGCACA GGCGGGTAGC TCGGGCAGCC CGTACCAGCT CAACGGAAAT TACTCGCTGG ACCAGTACAC CCAGGGCGGC ATTCCACTGG CTTCGTTTGC GTCGACCATT CCCGATCCAA ACCTGAAAAA TGAGCTGACT ACGTCCAATG AGTTTGGGCT GGAAGCGCGG CTGTTCAAAA ATCGGGTGGG GGTAACGGTT GCCTATTACA ATGCCAGCAC CCGAAACCAG ATTCTGAACG TACCGCTGCC GCCGTCGAGC ACCTTCACTT CCCGACTGAT CAATGCGGGT GAAATTCGTA ACCACGGCAT CGAACTGTCG GTCAATGCCA CGCCCGTTAA GCTGGCTTCC GGTTTCTCCT GGGATGCCAC ATTGAACTAC TCGCATAACC GCAACGAAGT GGTTTCGCTG GCGGAAGGGG TTTCGACCTA CATACTTGGC AGCGACCGGG GCGTGCAGGT TATAGCCACA CCCGGCAAGC CGTTTGGTAC GATTCTGGGC AACGGGTTTC AGTGGCTTCG GGATGGCTCC GGAAATCGGA TCATTGACCC CGCCACGGGC CTTCCGGTCA AAACAAATTC CAAGATCCTA TACGAAATGG GTAACGCACT GCCGAAGTGG ATTGGTGGTT TCAACAACGT ATTCCGGTAC AAAGGTCTTA CTCTATCCGG TCTGATCGAC GTTAGTCAGG GCGGCAAAGT ATACTCGCAA AGCTTGCGCG AAGAACTAGT GTACGGCACG ATCAAAAAGA CGCTGCCGGG CCGCGATGGA AGTTACGTTG CCGAAGGCGT TGTCGGTTCG AAATCGGCCG ATGGCACCTG GACCGGCACC GGGCAGGCGA ATACCAAAAC GGTACGGGCG CAGGACTATT GGAACGTCGT GGCACCGGAC AAAGACAACG TGGTAGCGGA AGAGATGCTG AACGATGCCA GCTACGTTAT CCTGCGGGAA ATGACGCTCA ATTACAGTTT GCCGGCTAAG CTGGTGAGCC ATACGCCGTT CCGCAATATC CGGGCTGGTG TGTATGGCCG GAATCTATTT TACTTACAAC GCAAGACAGA GGGCTTTGCA CCGGAAGCCT CGGCATTCAA CGTCAACAAC TCGTCGCTGG GACTCGAATC GACCGCACTG CCGTTGCTGC GGTATGTTGG GGTTAGCCTG AATGTAGAAC TGTAA
|
Protein sequence | MTMIAFLFYS SLWAQTPIQG TVLDAKKEPL VGINVLVRGT TRGTVTDVNG KFTVNADPAA TLIFSGVGFV RQEVAVGSRT DVLVTLAEDN AVLSEVVVTA LGIKRDKRSL GYSIQELNGQ DIATAKEANV ATSLAGKMAG VQVTRSANGA GGSSRVIIRG ANSLVGNSQP LYVIDGIPMD NQNPRAPGSS GGIDYGDGIS NINSEDIETI SVLKGPNAAA LYGQRGSNGV VLITTKSGKN RKGIGVKYGI DYSLGDALVL PDFQDEYGQG LDGTFTNFRG NDGKIYTWAA AQAAGIQGVP KMSGGRDRFT RSSWGPRMEG QPYEDQWGNL LNLTPQPNTF QKFFNTEKQM VNNLSLEGGN DAVNYRVAYS NTNINGYVPT NTLNRNNISL RTVAKVTSKL EADVKVNYIA QQGVNRPTVS DAADNPAYIF ISQPRSMPMD ILANSAWTAA DISKQLGYGT TPFVGLEKTY ATNSSTANPY WTMSRTRNSD ERQRIIGLVR LSYQFNDWIR LTARTGTDFY TDQRFRYRDK GTYVTANKNG DITEEVTRTR EDNSDVLLSL TPKVSDDISF SFNLGANHQR YYSRTTGNTG NEFIAPNLFI INNTLTNSYV FGLTESSINS VYGSGSVGYK EMAFIDFSAR NDWSSTLSPK NNSFFYPAIS GSLILTDALR LQSPTLSFVK VRASWAQAGS SGSPYQLNGN YSLDQYTQGG IPLASFASTI PDPNLKNELT TSNEFGLEAR LFKNRVGVTV AYYNASTRNQ ILNVPLPPSS TFTSRLINAG EIRNHGIELS VNATPVKLAS GFSWDATLNY SHNRNEVVSL AEGVSTYILG SDRGVQVIAT PGKPFGTILG NGFQWLRDGS GNRIIDPATG LPVKTNSKIL YEMGNALPKW IGGFNNVFRY KGLTLSGLID VSQGGKVYSQ SLREELVYGT IKKTLPGRDG SYVAEGVVGS KSADGTWTGT GQANTKTVRA QDYWNVVAPD KDNVVAEEML NDASYVILRE MTLNYSLPAK LVSHTPFRNI RAGVYGRNLF YLQRKTEGFA PEASAFNVNN SSLGLESTAL PLLRYVGVSL NVEL
|
| |