Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3418 |
Symbol | |
ID | 8727171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4139187 |
End bp | 4142417 |
Gene Length | 3231 bp |
Protein Length | 1076 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003388225 |
Protein GI | 284038295 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC ACTTACCTCC TCAAAGCCTG CTGGGCAGGT TAGTCAGCGT AGTAATTACT CAGCTACTAC TGACTGCAAT GTGCGTTAAT TTTACTTATG CCAAGGTCCC TCTGGCATTA AAAACAGTGG CCGACCAACG TGCCATTACA GCTGATCGTA CACTCACGGG TCGGGTGACA GATGAAAAAG ATGAAGCCTT ACCCGGTGTG AGTGTTATCC TGAAGGGAAC CCAGCGCGGA ACCGTAACCG ATGCCGATGG CCGGTATAAA GTGGATGTTC CCACGGGTGG CGCTACGCTT GTGTTCTCCT TTGTCGGATA TGTCCCTCAG GAAGTACGCG TTGGCAACCA AACCTCACTC AATATCAGCC TGAAAGCCGA CAGCAAAGTG CTCGACGAAA TCGTCGTGAT CGGGTATGGT ACCGCCAAAA AGTCTGACCT TACCGGCGCT GTCACCAGCG TGAAGGAGGC TCAGCTTCAG GAACGGCCTA CATCCTCATT GAACCAGGCC CTGTCAGGTC GCATGCCCGG CGTGCAGGTC AACACCAACT CGGGACGACC CGGCGGTCGG ACCACCGTCC GTATCCGGGG CTTCAGCTCC ATCAACTCCT CCAACAACCC CCTCTACGTC GTTGATGGCG TCATGCTTCC CCAAGGTACC GGCGACCAGT TCAGTAACCC AATCGATTAC ATCAACCCCA ACGACATCGT TAACGTAGAG GTCCTGAAAG ATGCCTCTTC GACGGCCATC TACGGAGCAC GCGGTGCCAA CGGCGTTATT CTGGTCTCCA CTAGAAAGGG GAAGGCCGGT GAAAGCCGGG TTACCTACGA CGGTCAGTTC AGCGTTAACA CCATCGGACC CAACAAGCCA AAGGTGCTCA ACGCCAAGGA GTACCTGGCT ACCGAAGACC TCGCCTATGC CAACATGGCC AAGTATGACC CCGTCGGCTG GGCCGCAGGT AAGTGGTCTT ACCTGGACCC GATAGCCCGG CGCAAAGCCT TCAGCGCGGC TCACCCTGGT GTGTTTGATG CCAACCTGAA CCCACTCTAC GACACCGACT GGTTCAAGGA GTCGGCTCAG AACAAGCTTT CCCAGAACCA CCAGTTAGGT TTCAGTGGCG GTAACGAGCG CACCCAGTAC TCCCTCTCGC TGAACTATCG CGACGATCAG GGTCTGATCA AGACCTCCTA CATGAAGCGT TACTCGGGTC GTTTCTCGAT CGATGATCAG GTCAAGAGCT GGCTCAAGAT CGGTGGTACA CTGAGTTATA ATAACCAGAC GGAAAACCTG GTGGACATCA ACGATGCGGT GGCCCGTCAG ATCGTGGAGG ACTTCCCCTT CCTACCCGTG CGCTACCCGG ACACTGGCGT CTTCGCCGAG AACCGGGACT ACCCCTATGC AGAAGGCACC ATGAGTTCGG TGCACCGCCT GATGGACCGT AAGTACATCC AGAACACCCA GACCACTTTG GGTAGCCTCT TCACCAACAT CACGTTAGGC AAAGGGCTGG AGATGCGTAC GGTATTGGGT GCCAACGTTC AGACGCAGGA GATCAACCAG TCGCAAACCC GTACGCTTAA CATCGGCGGT AACGGTAACG CATCGACCAA CAACAATAAG ACCTCGTTCT GGTCGCTGGA ACATTACCTG ACCTACAACA AACAGTTTGG TCAGGACCAC TCCTTCACCG GACTGCTGGG TCTTTCGTGG CAGGAGACTA ACACCTTTGG CATCGGTGCC AGTGTGAGCG GTTTTGCCAC CGACTACTTT GGCTTCAACA ACCTGGGTGC TGGTGCTACC AACCCATCGG TGAGTTCAAG CGCATCACGG TTTGCCTTTA ACTCCTACTT CGGTCGGATC AACTACGGCT ACAAGAACAA GTACCTCTTC ACGGCTACCG GCCGGGCCGA TGGCTCCTCG AAGTTCGGAG AGAATTACAA GTTTGCCTTC TTCCCCTCGG CGGCTCTGGC CTGGCGGGTA TCGGAAGAAG ACTTCCTGAA GGGCAATCCC GTTATCTCGA ATTTGAAGGT CCGCGCCAGC TACGGCTTGA CGGGTAACTC TGAAATTCCA CCGTATCAGT CACTGTCGTT GCTTAGCTCG AACTATTCGA CGATCTACAA CGACGGCCGC GTTGGTGGTA CGGGTATCAG CCGTTTGGCT AACCCCGACC TGCGCTGGGA AAAAACCGCT CAGACTGATG TAGGTCTGGA AGTTAGCTTC CTCAAAGGAC GCATCTCGCT GGAAGCCGAC TACTACTACC GTCTGACAAC CGACATGCTC CTGGATGCCC CCGTACCACA ATCGAGCGGC TATGCAACCA TCCGGCGTAA CGTAGGCTCG ATGGAGAACA AAGGCTTCGA GTTCGGTTTG AACACGGTCA ACATCAACCG GGGTACTTTC AGCTGGAATA CAAACTTCAA CATCTCGTTG AACCGCAACA AAGTCCTCTC CCTGGCTACT CCATCCGATA TTTTTGGGGT AGGTGGTCCT AACTTCACCA ACCAGACGAA TATCATTCGT ATTGGTGAAT CAGTAGGTTC GTTCTGGGGT CTGACCCGCG TGGGTGTATG GAGTGAAGCG GAGCGGGAAG AAGCGGCCAA GTTCACCAGC TACCGCAACG GTTTGACCAT TCTGCCCGGC GACATCAAGT ATCTCGACGT AAACGGCGAC AAGGCCATCA CCGATGCTGA CCGCAGCATC ATTGGCAACG GTAGTCCTAA AGGCTGGGGT GCCATGACCA ACAACATTCG TCTGGGCAAC TTCGATGCCA CCCTGGAACT TCAGTACATG TTTGGTAACG ACGTCATGCT GATGAACTTA CACCCCAGTG AAGACCGGCA GGCTCTGGCC AACAGCTACT CGTCGGTGCT CAACGCCTGG ACGCCAACCA ATCAGGGTAG CCAGATTGCT CAGGTACGCG ACACACGGGC GGGCTACGTA ACCAACGTCG ACAGCCACTG GATCAAGAAT GGTTCGTTCC TGCGGGGTCG TAACCTGCTA TTCGGTTACA CCTTCCCGGT TGAGATGACT AACAAGCTTA AGATGAACCG TCTGCGGATG TATGTGTCGG CTCAGAACTT CTTCCTGTCA GTTGAAGACC CCATCGTAGG TGATCCGGAA GTAACGCCCA CCAACCAGGG CTCAGGCAGC AGTGCCTTCT CACAAGGTCA AATCTGGCAT AACTACCCCA AACCAACCAC GTACATGCTG GGCCTCCAGA TTGGCTTGTA A
|
Protein sequence | MKKHLPPQSL LGRLVSVVIT QLLLTAMCVN FTYAKVPLAL KTVADQRAIT ADRTLTGRVT DEKDEALPGV SVILKGTQRG TVTDADGRYK VDVPTGGATL VFSFVGYVPQ EVRVGNQTSL NISLKADSKV LDEIVVIGYG TAKKSDLTGA VTSVKEAQLQ ERPTSSLNQA LSGRMPGVQV NTNSGRPGGR TTVRIRGFSS INSSNNPLYV VDGVMLPQGT GDQFSNPIDY INPNDIVNVE VLKDASSTAI YGARGANGVI LVSTRKGKAG ESRVTYDGQF SVNTIGPNKP KVLNAKEYLA TEDLAYANMA KYDPVGWAAG KWSYLDPIAR RKAFSAAHPG VFDANLNPLY DTDWFKESAQ NKLSQNHQLG FSGGNERTQY SLSLNYRDDQ GLIKTSYMKR YSGRFSIDDQ VKSWLKIGGT LSYNNQTENL VDINDAVARQ IVEDFPFLPV RYPDTGVFAE NRDYPYAEGT MSSVHRLMDR KYIQNTQTTL GSLFTNITLG KGLEMRTVLG ANVQTQEINQ SQTRTLNIGG NGNASTNNNK TSFWSLEHYL TYNKQFGQDH SFTGLLGLSW QETNTFGIGA SVSGFATDYF GFNNLGAGAT NPSVSSSASR FAFNSYFGRI NYGYKNKYLF TATGRADGSS KFGENYKFAF FPSAALAWRV SEEDFLKGNP VISNLKVRAS YGLTGNSEIP PYQSLSLLSS NYSTIYNDGR VGGTGISRLA NPDLRWEKTA QTDVGLEVSF LKGRISLEAD YYYRLTTDML LDAPVPQSSG YATIRRNVGS MENKGFEFGL NTVNINRGTF SWNTNFNISL NRNKVLSLAT PSDIFGVGGP NFTNQTNIIR IGESVGSFWG LTRVGVWSEA EREEAAKFTS YRNGLTILPG DIKYLDVNGD KAITDADRSI IGNGSPKGWG AMTNNIRLGN FDATLELQYM FGNDVMLMNL HPSEDRQALA NSYSSVLNAW TPTNQGSQIA QVRDTRAGYV TNVDSHWIKN GSFLRGRNLL FGYTFPVEMT NKLKMNRLRM YVSAQNFFLS VEDPIVGDPE VTPTNQGSGS SAFSQGQIWH NYPKPTTYML GLQIGL
|
| |