Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_2991 |
Symbol | |
ID | 8726742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 3616965 |
End bp | 3620213 |
Gene Length | 3249 bp |
Protein Length | 1082 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003387801 |
Protein GI | 284037871 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.40723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCATG CTTTACGACA GAGTTACCAA AAACTGCCGC TGTTTATTTT CTGCTGGCTT TTTTGCCTGG GAGCTTTTGC TCAGGAGAGA AAAATCACGG GGCGAATTAC AGATGGTAAT GACAATAGCG CACTTCCCGG TGCAAACGTA GTTGTCAAAG GCACACAAAC GGGCGTAGTG ACGGATGCCA ATGGGCAATT CTCCTTAAAC GTGGCAACCG GCCGGGACGT ACTCACAATT TCGGCCATTG GCTATGCCTC ACAGGAAGTT ACCATTGGGG CGCGCACGTC GCTGAACATT TCGTTATCCC CCGATATCAA AACGCTTAAT GAAGTTGTCG TAACGGGTTA TGGTGCACAG GCCAAACGGG ATATTACGGG TGCCGTAGCA ACTGTCGATA CCAAACAACT CCTCTCGGTT CCCTCAACCA ACGTTGGTCA GGCTCTACAA GGTCGTGTTG CGGGGGTTCA GGTAGGGAAT GAAAACTCCC CCGGTGGCGG GGTCATGGTT CGTATCCGCG GTTTCGGTAC GATCAACGAT AACTCGCCCC TGTACGTCAT TGATGGCGTG CCTACCAAAG GCAACCTGAA TACATTGAAC CTGAATGATG TAGAGAGCAT GCAGATTCTG AAAGATGCGT CTGCTGCATC TATTTACGGC TCACGCGCCG GTAATGGTGT GGTTATTATC ACAACCAAGA AAGGAAAAGC CGGAAAGCCC AAATTTACGT ACGATACGTA CTACGGTTCG CAACGGCACG GTAAGCTGCT CGATATGCTG AACACGCAGG AGTACGCCGA CCTGATCTGG GAATCCCGCA GAAACTCCGG TGTACTAGGC CCCAATGGAA ACCCGGTTCA CTCGCAGTTT GGCAATGGTG TAACCCCGGT CATTCCGGAT TTCGTGCTCC CTACTGGTGC CTCGGCCAAC GACCCTCGTT TAGCCCGAAA CCCGGATGGA ACGTATGTCA ACTACAATAA CGACATCAGC TCGCCAGGTT TTCTGCTGAT TACGCCGTCC AATAAAACAG GAACCAACTG GATGGAAGAG ATTTTTACCA CGGCCCCCAT CCAGAACCAT CAGTTAGGCG TATCGGGAGG TAGTGAAAGC GGTCGTTACG CCATGTCGCT GAATTACTTC AACCAGGATG GTATTATGAA GTATACGGGC TACAAACGCT ATTCGTTACG GGCTAATACC GAGTTTAACG TCAACAAACG GGTTCGTGTT GGCCAGAACT TCCAGGTAGC TTATGGCGAG CGCATTGGTC AGCCAAATGG TAATAACGCC GAAAGTAACC CCGTTTCGTT CGCCTACCGT ATTCAGCCGA TCATTCCGGT TTATGACGTA GCCGGAAATT TTGCAGGTAC ACGCGGGGGT GACCTCGACA ATGCCAATAA CCCGGTTGCC CTGCTGTACC GCAATAAAGA CAACGTTCAG AAAGAAGTCC GGCTATTTGG TAATGCCTTC GCTGAAGTCG ACATCCTTAA AAACCTGACA GCCCGCACTA GCTTCGGTAT TGATTACAAC CTTTATAACT ACCGAAACTA TACCATTCGG GACATCGAAT CGGCCGAAGC ACGCGGTTCG AACCAGCTCC AGACCAACAA CAATTATGAA TGGACCTGGA CCTGGTATAA TACGCTGACA TATAATGTTA ACCTCGGCGA CCGGCATCGT TTCAATGTAA TCGCCGGTAC CGAGTCGATC AAAAACTATT TTGAAACCTA CGATGCTACC CGGACAAATT TTGCGGTAGA CGACATCGAA AACCGCTACC TGAGTGCCGG TACGGGAGTT CAGACCAACA ACGGAGGTGC GTCGAACTGG CGGCTGGCAT CGGAGTTTGC TAAAGTTAAC TACGCGCTCG ACGATAAGTA TCTAATCGAC CTGACCGTCC GGCGTGACCG GTCGTCCCGT TTTGCGAAGG AATTCCGGTC GGCGGTATTC CCGGCTGCGA GCGTAGGCTG GCGTGTTTCG AAAGAAAACT TCTTTAAGCC ACTCACGCTG TTCGATGACT TGAAATTCCG CGCAGGTTGG GGCCAGACGG GTAACCAGGA GATTGGTAAC TACAACTCGT TCACCCAGTT CAGCACAAAC CCTATTACGT CGTTCTACGA CATCAACGGC ACGCGTACGT CGGCGGTGCC CGGTTATGAA CTTACCCAGT TTGGTAACGC CAAAGCTAAA TGGGAAACCA CGACCAGCCT GAACATTGGT TTCGACGCCA GCCTGCTTAA AAACAAGCTG ACCGTTGGCT TCGACTGGTA TACCCGCACC ACCTCCGACA TGCTTTTCCC TGTTCAGGCC CCGCTCACTC AGGGCGTAGC CACGGTGCCT TTCCAGAATA TTGGTTCGAT GCGTAACCGT GGTATTGACC TGATGATCAA TTACGGCGAT AAGATCGGTT CCGGCGGCCT CACGTATAAT GTAGGTGCCA ACTTCAGCAC CTACCGCAAC GTGGTAACGA AGACGAATGG TGATCCCAGC ACGCAGTACT TTGGTATTAA CGATGAGCGG ATTCAGAACT TTGTGGTCAC CCAGCAAGAC TACGCTATTT CTTCCTTCTT TGGCTATACA ATCGACGGCA TCTTCCAGAC CAACGAAGAA GCTAAAGCTG CGCCAATCCA GTTTGGTAAC GCAGCCGCAG AGAACGTAGC TGGTCGCTTT AAATTCCGCG ACATCAATGG TGATGGTAAA ATTGACACCA AAGACCTCAG CATCATTGGC AGTCCGCATC CGAAGTTCAC ATATGGCCTT AATCTCAATC TGAACTATAA AAACTTCGGA CTAACCCTGT TTGGACAAGG GGTTGAGGGC AATCAGATCT TCAATTACAC CAAATACTGG ACGGACTTCC CAACGTTTGG CGGTAACCGC AGCTCCCGCA TGCTGTATCA ATCCTGGCGG CCCGGCAAAA CGGACGCTAT TCTGCCCCAG CTTCGCTCAA GCGATCAGGT TAGTATCCAA CCGTCTACCT ATTACCTGGA AAGCGGCTCA TATTTCCGGA TGAAAAACAT CCAGCTTACC TACCAGCTGC CACAGTCACT GCTCTCGAAA CTGGGTGTTG GCGCTACCTC AATTTACATT CAGGGCCAGA ACATGTTCAC CATCACCAAA TACTCCGGCA TGGACCCTGA AATTAACCTG CGTAGCTATT CGGCCGGTAA CGACCGCCAG ATTGGCGTAG ATGGCGGCTC TTACCCGGTA GCCAAAACCG TATTAGTTGG TTTGAACCTG TCATTTTAG
|
Protein sequence | MKHALRQSYQ KLPLFIFCWL FCLGAFAQER KITGRITDGN DNSALPGANV VVKGTQTGVV TDANGQFSLN VATGRDVLTI SAIGYASQEV TIGARTSLNI SLSPDIKTLN EVVVTGYGAQ AKRDITGAVA TVDTKQLLSV PSTNVGQALQ GRVAGVQVGN ENSPGGGVMV RIRGFGTIND NSPLYVIDGV PTKGNLNTLN LNDVESMQIL KDASAASIYG SRAGNGVVII TTKKGKAGKP KFTYDTYYGS QRHGKLLDML NTQEYADLIW ESRRNSGVLG PNGNPVHSQF GNGVTPVIPD FVLPTGASAN DPRLARNPDG TYVNYNNDIS SPGFLLITPS NKTGTNWMEE IFTTAPIQNH QLGVSGGSES GRYAMSLNYF NQDGIMKYTG YKRYSLRANT EFNVNKRVRV GQNFQVAYGE RIGQPNGNNA ESNPVSFAYR IQPIIPVYDV AGNFAGTRGG DLDNANNPVA LLYRNKDNVQ KEVRLFGNAF AEVDILKNLT ARTSFGIDYN LYNYRNYTIR DIESAEARGS NQLQTNNNYE WTWTWYNTLT YNVNLGDRHR FNVIAGTESI KNYFETYDAT RTNFAVDDIE NRYLSAGTGV QTNNGGASNW RLASEFAKVN YALDDKYLID LTVRRDRSSR FAKEFRSAVF PAASVGWRVS KENFFKPLTL FDDLKFRAGW GQTGNQEIGN YNSFTQFSTN PITSFYDING TRTSAVPGYE LTQFGNAKAK WETTTSLNIG FDASLLKNKL TVGFDWYTRT TSDMLFPVQA PLTQGVATVP FQNIGSMRNR GIDLMINYGD KIGSGGLTYN VGANFSTYRN VVTKTNGDPS TQYFGINDER IQNFVVTQQD YAISSFFGYT IDGIFQTNEE AKAAPIQFGN AAAENVAGRF KFRDINGDGK IDTKDLSIIG SPHPKFTYGL NLNLNYKNFG LTLFGQGVEG NQIFNYTKYW TDFPTFGGNR SSRMLYQSWR PGKTDAILPQ LRSSDQVSIQ PSTYYLESGS YFRMKNIQLT YQLPQSLLSK LGVGATSIYI QGQNMFTITK YSGMDPEINL RSYSAGNDRQ IGVDGGSYPV AKTVLVGLNL SF
|
| |