Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5066 |
Symbol | |
ID | 8728831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 6189913 |
End bp | 6192918 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003389840 |
Protein GI | 284039910 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000778295 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCTAT TCTGCGTGAT GAGCTTCAGC GCAATAGCTC AAACGAAGGT TGCCGGGAAA GTGATTGCCG ATGATAAAAA AGAGGAGTTA GCTGGTATAA GCATCGCCGT GAAAGGAAAG GTTATTGGCA CCATTACCGA CCAGAAAGGA AATTTTTCCT TTACAACCAA CACGCCAACA CCGTTCACGG TCGCTATCTC CGGGGTTGGT TTCGAAACGC AGGAGTATGT GATCAATGGC AACCGTACGG ACCTGAACGT AAGCCTGAAA GAACAGGTGA CGATTGGTCA GGAAGTGGTC GTATCAGCCT CCAGGGTCGA AGAAAGTGTC TTGAAATCAC CGGTATCCGT TGAGAAAATG GATATTCGGG CTATTCAGTC TACACCTTCC GTTAATTTTT ATGATGGCTT AGCCAACGTA AAAGGGGTCG ATGTGGCCAC ACAGGGAATG CTGTTCAAGT CGATAAACCT GCGGGGTTTT GGCGCAACGG GTAACCCAAG AACCGTGCAG TTGATCGATG GGATGGACAA CTCGGCACCG GGTCTGAACT TCCCGGTCGA TAATATCGTA GGTGTTCCGG AGCAGGACGT TGAAAGTGTC GAGATTTTGC CCGGCGCGGC TTCTGCCCTT TACGGACCCA ATGCCATTCA GGGGCTGATT CTGATTAATA GCAAAAGCCC GTTCCTGTAC CAGGGGTTGA GCGCTAACGT CAAAACGGGT ATTATGGATG CATCGAACCG GACAACGTCT ACAACGGGTT TTTATGATGC GTCCATTCGG TACGCTAAAG CGTTCAATAA CAAGTTTGCT TTCAAGATGA ACCTATCTTA CATAAAGGCA AAAGACTGGG AAGCAACGAA TTACACGAAC CTGAATGGTG CTGGAAATTC TGATCCCAAC CGGGGAGCGG GTACGGCTGT CAACTATGAT GGCGTAAACG TGTATGGCGA TGAGAACCAG CAAAATATGC GTACGGTGGG TCAGGCGTTG ATCGGGGCGG GTCTTTTGCC TGCTGCTGCC CTGAACATAC TGCCTAACGT AAACATCAGC CGTACGGGTT ATCCCGAAGT GAATCTGGTT GATTACAACA CCAAAAGCTT TAAGTTCAAC GGCGCGCTGC ACTACCGAAT CTCGGATAAG GTCGAAGCCA TTGGCCAGCT GAATTATGGT ACCGGTACCA CCGTTTATAC CGCTACCGGC CGGTATTCGT TACGGGATTT CAGCATAGCG CAGGCCAAGC TCGAACTGCG GGGCGATAAT TTTATGGTAC GTGCCTACAC AACGCAGGAG CGGTCGGGTA AATCATTCAC GGCAGGTTTG GCCAGTATTG GCTTTAACGA GGCCTGGAAA CCAAGTGCTA CCTGGTTTGG GCAATATGTT GGGGCGTATG CAGCCGCCCG GGGAGCCGGT CAGGGAGATG ATGCCGCCCA ACTGTCAGCT CGTGGCATAG CCGATCAGGG TCGCCCGATA CCCGGTACAG AAGCGTATAA GGCTTTGTAT GATAAACTAA GCTCGACGCC CATCAGTCAG GGGGGCGGGG CCTTCTCCGA CAAGTCGAAC CTGTATCATG TAGAAGGGCT GTACAATTTT AAAAACCAGA TTAAGTTTGC CGATGTACTG GTTGGAGCCA ACTACCGGCA ATACCAGTTG GCTTCGGAAG GAACGCTCTT TGCCGATCAG GCGGCCGGAC GCAACGGTAC CATTGGTATT ACGGAGTTCG GCGGGTTTAT TCAGGCCAGT AAATCCCTGT TCAGCGAACA CCTCAAGTTG ACGGCGTCGA CCCGTTACGA CAAAAACCAG AATTTCGAAG GGCAGTTTAC GCCCCGTGTA TCGGCCGTAG CAACCTTCGG GGAGCACAAT ATCCGGTTAT CATACCAGAC TGGTTTCCGT ATTCCAACCA CGCAGAATCA GTATATCGAC CTGAAAACGC CACTGGCCCG GCTGATCGGT GGTTTGCCGG AGTTTTCGGA TCGTTACAAT CTGGCGAACT CCTACTCCCG TACTGATGTA ACCGCCCTCG GCGCTGCCAT CACCGCCAGT GCTGCCAGCC CCACCGTTCA GCAGGCCGCC GTACAGCTGA TTACGCAGCA GGTGACAGCG CAGGTGACGG CGCAGGTAAC TGCTCAGGTA AATGCCGCCG TTGCCGCCGG TCAGATTCCG GCCAGTGCTG CCGCAGCAGC CATTCAAAGT GCAGTTGCCT CTACCTTAAC TGCGGTGTTG CCCGGTCAGA TTGCCGCCAA TATCAACAAT GCAGTAACAG CCGTAGCTAT TAACAGCAAT ATTGGCAACC TGAAGCCTTA CCAGCGGCAG GCATTCAAGC CGGAGCGTGT GGCGAGCTAT GAAATTGGTT ACCGAAGCGT ACTGGGTAAG CGCCTGTTTG TCGATGCCTA TTATTACTAT AGTGTGTACA CAAATTTCAT TGGCAGCGTT ATCCTGCTTC AGCCAACGGC TCCGGTGGCT GCGGGCCTGC CGCTGGCATC CGGTGTGTTA AGCGGAGGAA CGCGGAATGC GTATTCGATG CCTGCCAACA GCAGCGAAAA AATCAATACG TCGGGTTGGG CGCTGGGTCT GAATTATCAG TTACCAAAAG GTTATGGTAT ATCGGGTAAT CTGGCCAACA ACAAGCTCAA TAACTTCACG CCAACGGCGG AGCTACAGAC ATCGGGCTTC AACACGCCGG AATATCGCTG GAACTTAGGC TTCACCAAAC GGCCTATGGC TAACTCGAAT ATTGGCTTTG CCGTTGCCTT CAAACATCAG GATGCGTTCA CCTGGGAGGG CTTTGCCGTA CCTACCGAAC TGGTGCCGAA TCTGTACGAG AAAACAATTG TACCGGCTAT CAGTAACTTC GATGCGCAGG TCAATTACAA GGTGTCAAGC CTTAAGTCGA TTGTGAAAGT GGGTGCAACC AACCTGTTTG GAAAGCCTTA CTTCCAGGCC TATGGTAGTT CATACGTTGG TTCGACCTAC TACATCAGCC TGACGTTCGA TCAACTGATG AACTAG
|
Protein sequence | MGLFCVMSFS AIAQTKVAGK VIADDKKEEL AGISIAVKGK VIGTITDQKG NFSFTTNTPT PFTVAISGVG FETQEYVING NRTDLNVSLK EQVTIGQEVV VSASRVEESV LKSPVSVEKM DIRAIQSTPS VNFYDGLANV KGVDVATQGM LFKSINLRGF GATGNPRTVQ LIDGMDNSAP GLNFPVDNIV GVPEQDVESV EILPGAASAL YGPNAIQGLI LINSKSPFLY QGLSANVKTG IMDASNRTTS TTGFYDASIR YAKAFNNKFA FKMNLSYIKA KDWEATNYTN LNGAGNSDPN RGAGTAVNYD GVNVYGDENQ QNMRTVGQAL IGAGLLPAAA LNILPNVNIS RTGYPEVNLV DYNTKSFKFN GALHYRISDK VEAIGQLNYG TGTTVYTATG RYSLRDFSIA QAKLELRGDN FMVRAYTTQE RSGKSFTAGL ASIGFNEAWK PSATWFGQYV GAYAAARGAG QGDDAAQLSA RGIADQGRPI PGTEAYKALY DKLSSTPISQ GGGAFSDKSN LYHVEGLYNF KNQIKFADVL VGANYRQYQL ASEGTLFADQ AAGRNGTIGI TEFGGFIQAS KSLFSEHLKL TASTRYDKNQ NFEGQFTPRV SAVATFGEHN IRLSYQTGFR IPTTQNQYID LKTPLARLIG GLPEFSDRYN LANSYSRTDV TALGAAITAS AASPTVQQAA VQLITQQVTA QVTAQVTAQV NAAVAAGQIP ASAAAAAIQS AVASTLTAVL PGQIAANINN AVTAVAINSN IGNLKPYQRQ AFKPERVASY EIGYRSVLGK RLFVDAYYYY SVYTNFIGSV ILLQPTAPVA AGLPLASGVL SGGTRNAYSM PANSSEKINT SGWALGLNYQ LPKGYGISGN LANNKLNNFT PTAELQTSGF NTPEYRWNLG FTKRPMANSN IGFAVAFKHQ DAFTWEGFAV PTELVPNLYE KTIVPAISNF DAQVNYKVSS LKSIVKVGAT NLFGKPYFQA YGSSYVGSTY YISLTFDQLM N
|
| |