Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4222 |
Symbol | |
ID | 8727981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5084658 |
End bp | 5087690 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003389006 |
Protein GI | 284039076 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAT TTTGTGGTAC TATTTTGGGA CTGGCGCTGC TGGTTCTGCT GCCGGGTTAT CTGTTTGCTC AGCAGCTGCG TATAACGGGT AGAGTAACAT CACAGCAGGA TGGGCTGCCG ATACCTGGTG TCAACATCTC CGTTCGGGGA ACAACAAACG GGGTCAGTAC AGATGCCAAC GGTAATTACA GCATCACCGT TTCGGGTAGT TCGGCCGTGC TGCTGCTTAC TTCCATTGGT CTGGTGCAGC AGGAGATAAC GGTGGGTAAC CGCACGGTGA TCAACGTCCA GATGAAGGAA GCCGTTAATG AGCTGAGTCA GGTTGTCGTT ACGGGTTACA ACACTACACA GCGAAAAGAT ATTACCGGCT CTATCGCATC CGTTTCTCCC GATAAATTCA AAGATATTCC CGTTGCCAGC TTCGACCAGG CTTTGCAGGG CCAGGCGGCT GGTGTGCAGG TAACACAGTC GTCGGGTACA CCCGGTGGCG GACTTACCGT GCGGGTGCGG GGCAATACGT CCATATCGGC CAGTAACCGC CCGCTGTTTA TTGTCGATGG GGTGCCCGTA TCAGACGGGG GCTTATCGGG TCGGGAGTTT GGCGGGCAAA CAGACAATGC GCTGTCGCTC TTTAACCCCA ACGACATCGA ATCCATCCAG GTACTGAAAG ATGCCTCCGC CAAAGCGATC TATGGCTCAC GGGCTGCGAA CGGCGTGGTG CTGATAACGA CCAAGCGCGG GAAGGCCCAG AAAACGAGCT TCACCGCCGA TGTGCAGCGG GGGTTAACGG ACGTGGTAAA GCGGCCGGAT CTACTTAATT CGGTCGAGCT GCTCGAATTA CAGCGCGAAG CGGTTACCAA TGCCGGGCTG GACCCCGACA AACTGGGTCT GATAAAAGGG GTTACCGACG GGCAGAATAC GGACTGGATA GATGCCGTAC TGCGAAGAGG GGTTTACCAG CAATATCAGC TGTCGACGCA GGGCGGTAAC GACCGCACAC AGTTTTACCT CAGTGGCAGC TACCGCGATG AACAGGGTGT ACAGCTGAAT AACCAGTTTA CCCGGTATAC AGGCCAGTTG AAACTGGATC ATAAAGCAAC CGACAAATTA TCGTTCGGGA CAAACGTGAC CCTGTCAAGG GCGCTGAACA AACGGGTAAA AGGCGATAAC TTTCTGGATG GTGTGTACTC TGGTGCCATG AAAAGTCTGC CGTACTATTC GCCTTACAAT GAGCAGGGGC GACTTTACGG CCCCGCCGAC GCCGAATACC CTGGATTCCC AAATTTCAAC CCCGTTGCAC AGGCTGTGCT GCCGCGATTC AACGCCTACA CGGTGAAAAT ATTGGCGGGT CTGTATGCCG AATACGAAAT CCTCCAGAAC CTCCGCTTTC GGTCGAAAGT AAATATCGAC TACAACAACG TAACCGAAGA TCAATTTGAA CCGTCCACAA CGGCAATTGG AGGGTTTCTG TCCAGCGTAG GCGGGCAGGG CTACGGGGTG TTCATCAATC AGTCGTCATC GACCTTTGTC AATACAAATA CCCTTACCTA TAATTTTCAG CTGGCTGAAA AGCACCAGTT CAACGCGCTG GCAGGGGTAG AGATTCTACA GGCTACCGCC CGGGACGGTA ATGTTCAGGG TCGATTATTT CCCAGCGATG ACTTTACCTA CATAAATTCA GCGGGTATTG TCGATCAGGG GGGCTCTTCC GTAACGAACA ACGGCCTGCT GTCGACCTTC GGCGAAGTCC GCTACAGTTA CGATGAAAAA TACCTGGCCA CGATTACCGC CCGTTACGAT GGATCGTCGC GTTTTGGGCA GAGCCGCCGG TTCGGGGTGT TTCCGTCAGC CTCCTTTGCC TGGCGTATTT CGAGCGAGAA ATTCATGGAA CGCTTCCGGT TCCTGAGCGA CCTGAAGTTA CGGACGAGCT ACGGCTTTAC GGGCAACGAG CGCATTGGCG ATTTTCAGTT TCTGGGCACT TGGGCATCAG TTACCTACAG TGGCGCAACG GGCGTGGGTC CGGCCACGCT GGCTAATGCA AACCTGCAAT GGGAGCGCAC CCGCGAAGCG AACATAGGCC TAGATGCTTC GTTCTTTAAC GGGCGGCTTA ATTTTATCGT TGATGCCTAT GATAACCTGA CGGATAAACT CCTGTTTGCC CAGCCGATTC CGCAAACCAC TGGCTTCAGC ACCGTGCAGG GCAACATCGG GAAAGTATCC AACAAAGGCC TTGAACTAAC CATTTCGACG GTGAACGTCA ATAAGGCTGT TCGCTGGAGT ACCGATTTAA ACCTGTCCCA CAATGTAAAC AAAGTGGTGG AACTGGCCAG TACAGAGCCT GTTCTGCGGG GCTATCAGGG CAATGGGGTA GCCACCACCA ACGTGGTAAT ACCAGGTCAG CCACTGGGTA CATTCTGGGG GTTGAAATTC CTGGGAGTTG ACCCCGCTAC CGGCGACGCG ATCTATGATG ATAAAAACGG CGATGGGCGT ATTACTCCCG CCGACGGACA GGTTATTGGC AATGCCCAGC CCAAGGTGTA TGGCGGGTTG ACCAACAAGA TTTCCTGGAA AGGGATTGAC CTGAGTGCGC TGCTTCAGTT TTCGTACGGG AACAGCATTC TCAACTTCTC GAACCAAACG CTCCTAAACT CGGGTGCCGA CATTCAGAAT AACCAGACGC GGCAGGCACT CAAACGCTGG CGTAAAGAAG GCGATATCAC GAGCGTACCC CGTTACGAAT ACCAGAATAC CTATAATAAC TACACCAGCA GCCGGTTTGT GGAAGACGGG TCTTATCTGC GGCTGAAAAA CGTTTCGCTG GGCTACAACA TTCCCAAGAC CTGGATCAAT AAATACAAAG TGGCCAACGC CCGTCTGTAC GTCTCGGCTA CGAACATCCT AACCTGGAGC CGGTATTCTG GCGCAGATCC GGAAGTAAGC ACGCTCGATG GCTCTACCAC GGCGCAGGGC ATTGACTTTT TCACCTTCCC TCAGATCAAA ACGGTATTGG TAGGGGCAAC CCTTAGCTTT TAA
|
Protein sequence | MRKFCGTILG LALLVLLPGY LFAQQLRITG RVTSQQDGLP IPGVNISVRG TTNGVSTDAN GNYSITVSGS SAVLLLTSIG LVQQEITVGN RTVINVQMKE AVNELSQVVV TGYNTTQRKD ITGSIASVSP DKFKDIPVAS FDQALQGQAA GVQVTQSSGT PGGGLTVRVR GNTSISASNR PLFIVDGVPV SDGGLSGREF GGQTDNALSL FNPNDIESIQ VLKDASAKAI YGSRAANGVV LITTKRGKAQ KTSFTADVQR GLTDVVKRPD LLNSVELLEL QREAVTNAGL DPDKLGLIKG VTDGQNTDWI DAVLRRGVYQ QYQLSTQGGN DRTQFYLSGS YRDEQGVQLN NQFTRYTGQL KLDHKATDKL SFGTNVTLSR ALNKRVKGDN FLDGVYSGAM KSLPYYSPYN EQGRLYGPAD AEYPGFPNFN PVAQAVLPRF NAYTVKILAG LYAEYEILQN LRFRSKVNID YNNVTEDQFE PSTTAIGGFL SSVGGQGYGV FINQSSSTFV NTNTLTYNFQ LAEKHQFNAL AGVEILQATA RDGNVQGRLF PSDDFTYINS AGIVDQGGSS VTNNGLLSTF GEVRYSYDEK YLATITARYD GSSRFGQSRR FGVFPSASFA WRISSEKFME RFRFLSDLKL RTSYGFTGNE RIGDFQFLGT WASVTYSGAT GVGPATLANA NLQWERTREA NIGLDASFFN GRLNFIVDAY DNLTDKLLFA QPIPQTTGFS TVQGNIGKVS NKGLELTIST VNVNKAVRWS TDLNLSHNVN KVVELASTEP VLRGYQGNGV ATTNVVIPGQ PLGTFWGLKF LGVDPATGDA IYDDKNGDGR ITPADGQVIG NAQPKVYGGL TNKISWKGID LSALLQFSYG NSILNFSNQT LLNSGADIQN NQTRQALKRW RKEGDITSVP RYEYQNTYNN YTSSRFVEDG SYLRLKNVSL GYNIPKTWIN KYKVANARLY VSATNILTWS RYSGADPEVS TLDGSTTAQG IDFFTFPQIK TVLVGATLSF
|
| |