Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_6451 |
Symbol | |
ID | 8730235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 7821239 |
End bp | 7824427 |
Gene Length | 3189 bp |
Protein Length | 1062 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003391207 |
Protein GI | 284041277 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.402176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAAT TTATATTGGG CAGTTGGCTT CTGGCCCTTC TTTCCTGCTT ACCCGCTCTG GCGCAGGATT TTACGGTGAG TGGCCGGGTA ACATCCTCCG AAGACGGGGG CGGGCTGCCC GGCGTAAGCG TCCAGCTTAA AGGAACAACG CGCGGCACCA CCACCGACGC CGAAGGCAAC TATCGGTTAA GCGCGCCCGC TACGGGCCGG TTGGTATTCA GCTTCATTGG CTATGCGTCT CAGGAGATTG CCATAGGTAA CAAATCGACC ATCGCGGTGA ACATGGTGCC CGATGCTGCT AATCTGGATG AGGTCATCGT AACCACCTTT GGTACGGCAA AACGGGCATC CTTTACGGGT TCGGCAGGGA CGCTGTCAAC AACACAGATT CAGAATCGTG GCGTCAGTAA CGTGGCACAG GCACTTTCGG GTGCGGTGTC GGGCGTTCAG ACAACGGCCG GCAGCGGTCA GCCGGGCTCC GCTCCCGAAA TTCGTATTCG GGGCTTCGGC TCGATCTCGT CGGGAAATGA CCCGCTGTAT GTGGTAGACG GCATTCCGTA TTCGGGCAAT ATTGCCAACA TCAACCCCAG CGACATCGAA AGCGTGTCGG TGCTGAAAGA TGCGGCCTCA ACCGCCCTGT ACGGTGCACG GGCGGCAAAT GGTGTGGTGG TTGTAACCAC AAAAAAAGGG CTGAAAGACC GGAGTACCAT CAACGTGCGT TATACGCAGG GCTTTAGTAG CCGGGGGCTG CCCGAATACG ACCGGGTTGG CGTTGGCGAG TACTACCCGC TGATGTGGGA AACCTACCGG AACAGCATTG CCTACCGGGC CACCAACCCC GTAGCCCTGG CAACGGCAAA CGCCGATGCC ACAAACCGAC TGGTTAGTCT GGTCGGCTAC AACGTATACA ACGTGCCCGG CAATCAGTTG GTTAATACCG ACGGGCAGTT CAACCAAAAC GCCCAGCTTC TGTTTTCGCC CGACGATCTG AACTGGGAGA AGCCGATTAC GCGTCAGGGG AACCGGCGTG AACTGAACGT AAGCTTCGCT GGCGGGCAGA AAAACTCCGA CTATTTTGTG TCTTTGGGCT ACCTCAACGA CAAAGGGTAT CTGATCCGTT CCGACTTCGA GCGGTTTACG GGCCGGATCA ACATCAACTC CCAGATGAAA CCCTGGTTTC GGGTGGGGGC TAACCTGTCG ACGACCATCT CGAAGTCCAA CCAGGCGGAT GCCGATGGCA GCACCAATTT CGTGAACCCG TTCTTTTTCT CACGGAATAT TGGTCCTATC TATCCTGTGT ATGCCTACGA CCCAACTAAT GTCGGCCAGT TTTTGACGCT GCCGAACGGT CAGCGACGGT GGGATTACGG GAACCTGACG TCCCTGGGCT TACCGGCCCG GCCCCAGTTT GGTGGTCGCC ATTCGGTTGC GGAGACGTTG CTGAACCAGA ACTTCCTGCG CCGTAACGTA CTGGGCGCAC GGGGTTTCGC CGAAGTTTCG TTCCTGAAAG ATTTCAAGTT TTCGGTGAAC GTAGGTACGG ACATTACTAA TACGAATGTG TTCACCTATG GCAACCCCGA AGTGGGTGAC GGCGCTCCGG CGGGCCGTGC GAACCACCAG TTTCAGAACA TCACCAGCTT CAACCTCAAC CAGCTACTGA ACTACAATAA GTCGTTCGGG AAAAATACGT TCGATGTGCT GCTGGGCCAC GAGAACTTCA GCATCAACGA CAACAACCTG GAAGGCTCGC GTTCGCAGCA GATTGTGGAC GGTAACTACG AGTTAGGCAA CTTCACGACC ACAACCTTTC TGTCGTCGGT GTACAACACG CGTCGGGTAG AAGGCTATTT CTCCCGTATC AACTACGACT ACGACCAGAA ATACTTCCTC TCTGCTTCGG TTCGGCGCGA TGGCTCCAGT AAGTTCTATC GGGATTCCCG CTGGGGTACG TTCTACTCCG TGAGTGGTGC ATGGCGTATC GACCAGGAAG ATTTTCTGCG GTCTATTCCA ACCATCAACT CCCTGAAACT ACGGGCTTCG TATGGCCAGA CCGGTAATGA TGGCGGAGGA AATACGGCGG CTTTTGCTCA GGACAATACC ATCAGTTACT ACGCCTGGCA GCCGCTGTTC GGCTTAGGGA GCTGGAACAA CGCATCGGAA GCGGGTATTC TGCAAACGAG CCTTGGCAAC CAGAACCTGG CCTGGGAATC GAGCAACTCC TTCGATGCCG CGCTGGAGTT CAGCCTGTTC AAAGGGCGCG TTTCCGGTAC GGTTGAGTAC TTCGACCGCC GGTCGTCCAA CCTGATCTTC GCCGTACCCT TGCCCCTATC GGATGGTATT TCGACGGTTA CCCGAAATAT CGGCACGATG TACAACCGGG GTATGGAGAT TGAACTGGGC ATTGAACCCA TCCGAACGAA GGACTTCACC TGGCGCATCG ACCTGAACGC CACCCGCGTG AAAAACCGGA TTACGAAGAT GCCCGATGAG AATCCCGAAA TTATTGATGG CACGAAGAAA CTGGCCGTTG GCCGGTCTAT CTACGACTAC TGGCTCCGCG AATACATGGG CGTGAACCCC ACAACGGGCG AAGCACAATA CAGGGCGGCA AACTATGTAG CATCGAACTC CCGCATTACC GAAGGCGGTG ATACGCTGAC AACCAGCGTC AACAATGCGC GCTACCATTA CAATGGCTCG TCTATCCCCA CGGTGTCGGG TGGATTTACG AATACCTTCC GCTACAAAGG CATTACCCTG TCGGCGCTGA CCGTATATCA GTTAGGCGGT AAAACCTACG ACGGAGCCTA CGCAGCCCTG ATGAGTTCGG GCGGGTATGG CAGCGCCAAA TCCGTCGATA TTCTGAACCG GTGGCGGAAC CCCGGCGACA TCACCAATGT GCCCCGTATG GATGCTGGAC GCACGTCGGA TTTTGATGCC GCATCGGACC GCTGGCTCAC GAATGCCAGC TACCTGAACC TGCGTACGGT AACGCTTTCG TACGCGCTGC CCGCTACCTT GTCGCGCAGA GCCTTCCTGG AGAATGCACA GGTGTACATC ACTGGCGAGA ACTTCCTTAT CCTGTCTCAC CGGAAAGGGA TGAACGTTCA GCAAACCTTC ACGGGGGTAA CCAGCAACGT ATTCAGCCCA GCCAAAAGCA TTATTCTGGG TGTTTCATTT ACGCTTTAA
|
Protein sequence | MGKFILGSWL LALLSCLPAL AQDFTVSGRV TSSEDGGGLP GVSVQLKGTT RGTTTDAEGN YRLSAPATGR LVFSFIGYAS QEIAIGNKST IAVNMVPDAA NLDEVIVTTF GTAKRASFTG SAGTLSTTQI QNRGVSNVAQ ALSGAVSGVQ TTAGSGQPGS APEIRIRGFG SISSGNDPLY VVDGIPYSGN IANINPSDIE SVSVLKDAAS TALYGARAAN GVVVVTTKKG LKDRSTINVR YTQGFSSRGL PEYDRVGVGE YYPLMWETYR NSIAYRATNP VALATANADA TNRLVSLVGY NVYNVPGNQL VNTDGQFNQN AQLLFSPDDL NWEKPITRQG NRRELNVSFA GGQKNSDYFV SLGYLNDKGY LIRSDFERFT GRININSQMK PWFRVGANLS TTISKSNQAD ADGSTNFVNP FFFSRNIGPI YPVYAYDPTN VGQFLTLPNG QRRWDYGNLT SLGLPARPQF GGRHSVAETL LNQNFLRRNV LGARGFAEVS FLKDFKFSVN VGTDITNTNV FTYGNPEVGD GAPAGRANHQ FQNITSFNLN QLLNYNKSFG KNTFDVLLGH ENFSINDNNL EGSRSQQIVD GNYELGNFTT TTFLSSVYNT RRVEGYFSRI NYDYDQKYFL SASVRRDGSS KFYRDSRWGT FYSVSGAWRI DQEDFLRSIP TINSLKLRAS YGQTGNDGGG NTAAFAQDNT ISYYAWQPLF GLGSWNNASE AGILQTSLGN QNLAWESSNS FDAALEFSLF KGRVSGTVEY FDRRSSNLIF AVPLPLSDGI STVTRNIGTM YNRGMEIELG IEPIRTKDFT WRIDLNATRV KNRITKMPDE NPEIIDGTKK LAVGRSIYDY WLREYMGVNP TTGEAQYRAA NYVASNSRIT EGGDTLTTSV NNARYHYNGS SIPTVSGGFT NTFRYKGITL SALTVYQLGG KTYDGAYAAL MSSGGYGSAK SVDILNRWRN PGDITNVPRM DAGRTSDFDA ASDRWLTNAS YLNLRTVTLS YALPATLSRR AFLENAQVYI TGENFLILSH RKGMNVQQTF TGVTSNVFSP AKSIILGVSF TL
|
| |