Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3420 |
Symbol | |
ID | 8727173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4144486 |
End bp | 4147716 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003388227 |
Protein GI | 284038297 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0439065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAAC TCTTACCACC TAAACTCCTG TCCGATAGGT TAATTAGAGT TTCAAGCACT CAACTCCTAC TGGCAGCTTT GTGTGTTAGT TTTACTTATG CGGGAAAGCC GGTTAATCCA AAATTCCCCG TCAATCAGTC AGTGAAGCAG GCTGACCGTA CACTCACAGG CCGGGTAACA GACGAAAAGT CCGAAGGACT TCCCGGCGTG AGTGTTATCC TGAAAGGAAC CCAGCGCGGA ACCGTAACCG ATGCCGATGG ACAGTATAAA CTTGACGTAC CCGATGGGGC CTCTACACTT GTGTTCTCCT TTGTTGGTTA CCTGCCACAG GAAGTTAGTG TTGGGAATCA AACGTCAATC AACGTCAGCC TGAAAACCGA CAGTAAAGTA CTGGATGAGA TCGTAGTCAT CGGCTATGGT ACGACGCGCA AATCCGACCT TACCGGCGCT GTCACCGGCG TGAAGGAGGC CCAGTTGCAA GAGCGCCCTG CGCCTTCGTT GAACCAGGCC CTGTCAGGTC GCATGCCCGG CGTGCAGGTC AACACCAACT CGGGACGACC CGGCGGTCGG ACCACCGTCC GTATCCGGGG CTTCAGCTCC ATCAACTCCT CCAACAACCC CCTCTACGTC GTTGATGGCG TCATGCTCCC CCAAGGTACC GGCGACCAGT TCAGTAACCC AATCGATTAC ATCAACCCCA ACGACATCGT CAACGTGGAG GTCCTGAAAG ATGCCTCTTC GACGGCTATC TACGGGGCTC GTGGCGCCAA CGGCGTTATT CTGGTACAAA CTCGTAAAGG GAAAGCCGGT GAAAGCCGGG TCACCTACGA CGGTCAGTTC AGCGTTAACA CCATCGGACC CAACAAGCCA AAGGTGCTCA ACGCCAAGGA GTACCTGGCT ACCGAAGACC TCGCCTATGC CAACATGGCC AAGTATGACC CCGTTGGCTG GGCCGCGGGT AAGTGGTCTT ACCTGGACCC GATAGCCCGG CGCAAAGCCT TCAGCGCGGC TCACCCTGGT GTGTTTGATG CCAACCTGAA CCCACTCTAC GACACCGACT GGTTCAAGGA GTCGGCTCAG AACAAGCTTT CCCAGAACCA CCAGTTAGGT TTCAGTGGCG GTAACGAGCG CACCCAGTAC TCCCTCTCGC TGAACTACCG CGACGATCAG GGTCTGATCA AGACCTCCTA CATGAAGCGT TACTCGGGTC GTTTCTCGAT CGATGATCAG GTCAAGAGCT GGCTTAAGAT TGGCGGGACA ATGAGCTACA ACTACCAGAC GGAAAACCTG GTGGACATCA ACGATGCGGT GGCCCGTCAG ATCGTCGAAG ACTTCCCCTT CCTGCCCGTA CGCTACCCGG ACACCGGCGT CTTCGCCGAG AACCGGGACT ACCCTTATGC AGAAGGCACC ATGAGTTCGG TACACCGCCT GATGGACCGT AAGTACATCC AGAACACCCA GACTATTCTG GGCAGTTTGT TTACCAACAT CACTTTCGGC AAAGGGCTGG AGATGCGTAC AGTACTGGGT ACCAACGTCC AGACGCAGGA GATTAACCAG TCGCAAACCC GTACGCTTAA CATTGGCAAT AACGGTAACG CATCGACCAA CAACAACCGG CAGAATTTCT GGTCGTTGGA GAACTACCTG ACCTACAACA AACAGTTTGG TCAGGATCAC TCCTTCACCG GACTACTAGG TCTGTCGTGG CAGGAGACTA ACACCTTTGG CATCGGTGCC AGCGTAAGCG GTTTTGCCAC CGACTACTTC GGCTTTAACA ACCTGGGCGC TGGTGCGATC AACCCATCGG TGAGTTCGGG TGCTTCACGG TTTGCCTTTA ACTCCTACTT CGGTCGGATC AACTACGGCT ACAAGAACAA GTACCTCTTC ACCGCTACCG GCCGGGCTGA TGGCTCCTCG AAGTTCGGAG AAAACCACAA GTTTGCCTTC TTCCCCTCGG CGGCTCTGGC CTGGCGGGTA TCGGAAGAAG ACTTCCTGAA AGGCAACCCC GTTATCTCGA ATTTGAAAGT GCGCACCAGC TACGGTCTGA CGGGTAACTC CGAGATTCCG CCTTACTCCT CGCTCTCGCT GTTGAGTTCA AACTACGCTA CGATCTATAA TGATACGAAG GTGAGTGGCA CGGGTATCAA CCGTCTGGCT AACCCCGACC TGCGCTGGGA AAAAACCGCT CAGACTGATG TAGGTCTGGA AGTTGGCTTC CTCAAAGGAC GCATCTCGCT GGAAGCCGAT TACTACTACC GTCTGACAAC CGACATGCTC CTGGATGCCC CCGTACCACA ATCGAGCGGC TATGCCACCA TTCGGCGTAA CGTAGGCTCG ATGGAGAACA AAGGTTTTGA GTTCGGAGTG AACACGGTTA ACATCAACCG GGGTACTTTC AGTTGGAACA CCTCCTTCAA CATCTCCCTT AACCGCAACA AAGTCCTCTC CCTGGCTACT CCGTCTGACA TCTTCAACGT AGGTGGTCCT AACTTCACTA ACCCCACCAA TGTCATCCGG GTAGGTGAAG CAGTAGGTTC GTTCTGGGGT CTGACCCGGG TAGGCGTATG GAGTGAAGCG GAGCGGGAAG AAGCGGCCAA GTTTACCAGC TACCGCAACG GTCTGACCAT TCTGCCCGGC GACATCAAGT ACCTCGACGT AAACGGCGAC AAGGCCATCA CCGATGCTGA CCGCAGCATC ATTGGCAACG GTAGTCCTAA AGGCTGGGGT GCCATGACCA ACAACATTCG TCTGGGCAAC TTCGATGCCA CCCTGGAACT TCAGTACATG TTTGGTAACG ACGTCATGCT GATGAACTTA CACCCCAGTG AAGACCGGCA GGCTCTGGCC AACAGCTACT CGTCGGTGCT CAACGCCTGG ACGCCAACCA ATCAGGGTAG CCAGATTGCT CAGGTACGCG ACACACGGGC GGGCTACGTA ACCAACGTCG ACAGTCACTG GATTAAGGAC GGTTCGTTCC TGCGGGGCCG CAACCTCCTG TTTGGCTATA CGCTGCCGGC TAACGTAACG TCTAAATTAA AGATGAACCG GTTACGGGTA TACGTTTCCG CTCAGAACTT CTTCCTGTTG CTGAAAGATC CTATTGTTGG TGATCCGGAA GTAACGCCCA CCAACCAGGG AACAGGCAAC AGCGCCTTCT CACAGGGCAT GATCTGGCAC AACTACCCTA AACCAACTAC CTATCTCCTT GGTCTGCAAA TTGGCTTGTA G
|
Protein sequence | MAKLLPPKLL SDRLIRVSST QLLLAALCVS FTYAGKPVNP KFPVNQSVKQ ADRTLTGRVT DEKSEGLPGV SVILKGTQRG TVTDADGQYK LDVPDGASTL VFSFVGYLPQ EVSVGNQTSI NVSLKTDSKV LDEIVVIGYG TTRKSDLTGA VTGVKEAQLQ ERPAPSLNQA LSGRMPGVQV NTNSGRPGGR TTVRIRGFSS INSSNNPLYV VDGVMLPQGT GDQFSNPIDY INPNDIVNVE VLKDASSTAI YGARGANGVI LVQTRKGKAG ESRVTYDGQF SVNTIGPNKP KVLNAKEYLA TEDLAYANMA KYDPVGWAAG KWSYLDPIAR RKAFSAAHPG VFDANLNPLY DTDWFKESAQ NKLSQNHQLG FSGGNERTQY SLSLNYRDDQ GLIKTSYMKR YSGRFSIDDQ VKSWLKIGGT MSYNYQTENL VDINDAVARQ IVEDFPFLPV RYPDTGVFAE NRDYPYAEGT MSSVHRLMDR KYIQNTQTIL GSLFTNITFG KGLEMRTVLG TNVQTQEINQ SQTRTLNIGN NGNASTNNNR QNFWSLENYL TYNKQFGQDH SFTGLLGLSW QETNTFGIGA SVSGFATDYF GFNNLGAGAI NPSVSSGASR FAFNSYFGRI NYGYKNKYLF TATGRADGSS KFGENHKFAF FPSAALAWRV SEEDFLKGNP VISNLKVRTS YGLTGNSEIP PYSSLSLLSS NYATIYNDTK VSGTGINRLA NPDLRWEKTA QTDVGLEVGF LKGRISLEAD YYYRLTTDML LDAPVPQSSG YATIRRNVGS MENKGFEFGV NTVNINRGTF SWNTSFNISL NRNKVLSLAT PSDIFNVGGP NFTNPTNVIR VGEAVGSFWG LTRVGVWSEA EREEAAKFTS YRNGLTILPG DIKYLDVNGD KAITDADRSI IGNGSPKGWG AMTNNIRLGN FDATLELQYM FGNDVMLMNL HPSEDRQALA NSYSSVLNAW TPTNQGSQIA QVRDTRAGYV TNVDSHWIKD GSFLRGRNLL FGYTLPANVT SKLKMNRLRV YVSAQNFFLL LKDPIVGDPE VTPTNQGTGN SAFSQGMIWH NYPKPTTYLL GLQIGL
|
| |