Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_2729 |
Symbol | |
ID | 8726479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 3298537 |
End bp | 3301536 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003387543 |
Protein GI | 284037613 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0287081 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAC TTTTATTTGC GGGAGTAGTG TGGCTACTCT CGTGCGGAGG ATTGCTGGCA CAATCGACCG CAGCGCGTGT ATCCGGCACC GTTAAAACCG ACAACGGAGA ACCTCTGCCG GGGGCCAACG TGGTCATCAA AAATCAGACA AAAGGCGCTA CAACCGACGC CAACGGCCTT TTCAGCCTAG ATGCTCGCTC CGGCGATGAG TTAATGATCT CGGCCATTGG TTACCAGAGC ACTCAGGTTA AAATCGGCAC AAAAAATACC CTTGAAATTT TTCTGCGCGA ATCGGCATCC CAACTCAACG AGGTCGTTGT GGTTGGTTAC GGTACACAGG ATCGAAAAAA TCTGGTCGGC TCCGTTACAC AGGTCAACGC CGATGAGATC AAGAATCGTC CCGTAGCTAG TTTCGACCAA CAGTTGCAGG GTCGGGCAGT TGGTGTTCAG GTGGCGGCTA ACACGGGCGT TCCGGGCGAC GGTATTTTCT TTCGTATTCG GGGTACCACA TCCATCAACG CCAGCAACGA CCCGCTGTAT GTGGTCGACG GGGTATTCGT TAATAATCAA TCGCTCCAGA AAATCACCAC ACAGGGGCAG GCCAACAATC CGCTGGCTGA CATCAACCCC GCTGATATTG AGTCGATTTC GATTTTGAAA GATGCCGAAG CGACGGCTAT TTACGGGGCG CGGGCGGCCA ATGGCGTTGT GCTGATCACC ACCAAACGAG GCAGCTATAA CAGCAAAACC AAAGTTAGTT TGAATGCATC CGTCGGGCAG GCATGGGCTC CCAAACTGTG GGATCTGGTA ACCGGTCCCG AGCACGCGAC CATCATCAAC GAAGCCTGGA TAAACGACGG CAAACCAGCC GCTACCCGAC CCTTCCGGCC GATCTCGGAA GGCGGTCGCG GATTACCGGA AGAGCAACCC ACCTACGACC GGCTGCACGA TATTTTCCGG ACGGGCGCCC TGCAAAACTA CGATCTGGCC GTTTCGGGCG GCACCAAACA AACCCGCTTT TACATCGGTG GTGGGTACAC AAGCCAGCAG GCAACGCTGC GTACCAACGA CTTTTCGCGG GCGAGTTTCA AGCTGAACCT GGATCAGGAC ATTACGGATA AAATCCGCAT CGGCACCAGC AATATACTCT CTCAGTCGAA CCGGACCAAT GCACGGGTTG GCGATGGACC ACAGGGCGGT ATTTTACAGG CTGCTTTACA CACGCCGACC TACCTGCCAA AATTCAATAC AGACGGCTCC TACGCCAAAT GGGCCGGTTT CGACAACCTC GATGTGCTGA TCAACAATAC GGATATGCAC TCGACCAGCA CCCGCTACAT CGGCAACATT TATGGTGAAT ACGACATTAT CAGCGGCCTG AAACTACGCA GTAGCTGGAG CATCGACTAC AACGATTACA ACGAATACGA GTACTGGAAC ACGCTAACCA ACCGGGGTAG CGCGAGCAAA GGACTGGCTA CATCGAGCGT TAGTAAAAAT ACGATCTGGA TCAATGAACA GACATTGTCG TACCGACGGT CATTCGGTAC CCAGCACAAT TTCGGCGCGC TGGTTGGGAA TACCTTGCAG GGAAACGTAT CAACGCAGAC GCTGGCTCAG GGCACCAATT TCCCGTCCGA TGCATTTAAG CAGATCGCGT CGGCATCGGT AACGACCGCT TCTTCTAACC GGAATCAGTA TAACCTGGTC TCGTTCTTCG GGCGGGTCGA TTATAATTTC TCGAAAAAGT ATTTTCTGGA AGCCAGCCTT CGGGCCGATG CGTCGTCCAA GTTTGCCGAG GGGCATCGCT GGGGCTATTT CCCCTCGGCG GGGGTGGCCT GGCAGCTTAA GCAGGAGAAT TTTTTGCGGG ATGTCAATTT CCTGAGCGAC CTAAAAATTA GAGCCAGTGT GGGCTGGACG GGCAACCAAA ACGGCATCGG CAACTATGCC TCCCGCGGCT TGTGGGGTGG CGGCAACAAC TACCTCGACA ATCCGGGAAC GGTACCCGTT CAACTGGCCA ACCCGGAACT GAAGTGGGAA ACCACCCGCC AAACCAACGT AGGATTGAAC GTCGGTCTAC TGAGCAACCG CATTGGCCTG GAAATAAACG CCTATTCCAA ATACACCTAC GACCTCCTGC TTCAGGTGCC ACTGGCGCAG AGTTCGGGTT TCTCCAGTAT CTACCGGAAC GATGGCGAAA TCAGCAACCG GGGGCTTGAA TTTGGTATCA ATACCCAGAA CATCAACAAA AGCAGCTTCC AGTGGAATAC CAGCTTTAAC ATTGCCGCTA ACGTAAACCG CATCGAAAAG CTTTCCATCC CGGTCGATGC CAGCTATGCG GCCGAACGCA TGGCGCAGGG ACAGGCGTTT CACTCCTTTT ACGTCTACCG ACAACTGTAT GTCGATCCCA AAACGGGCGA CGCGGTCTAT GACGATGTCA ATAAAGACGG CAAAATCACC GTGGCCGACC GACAATTTTA CGGCAGTGCG TTGCCCAAAT TTTTTGGTGG GCTGAACAAC ACCTTCGCTT ACAAAGGGTT CGATCTGTCG GTATTTTTCA ATTTCAGCTA CGGCAGTAAA GTCTTTAACA ACAACCGCTT CTTCCACGAG TCGGGCGGGA CGCGGGATGA CCGGCGGGCC ATCAACAAGA ATCAGCTAAA GCGCTGGCAG AAAGAGGGCG ATATCACGGA TGTGCCTCGC GTGACGACCA TTGGCAACAA CTACAATCTG AGCCCCACCA GCCGATTTGT AGAAGACGGG TCGTTTCTCC GACTGAACTC GCTCGTGTTG GGCTACACCA TTCCAAAAGC CGTTTTGCGC AAAGTGGGCA TTTCGTCGGC GCGGGTGTAC TACAGCGGCT CCAACCTGTG GCTGCTGAGC AACTACCAGG GTCCTGACCC TGAGGTAAAC GTCACCGCCG ACCCTACCAC CCAGGGATAT GATCTGGGCA CCCCTCCACA ACCCCGAACG GCACAATTTG GCATCAACCT CACTCTCTGA
|
Protein sequence | MQKLLFAGVV WLLSCGGLLA QSTAARVSGT VKTDNGEPLP GANVVIKNQT KGATTDANGL FSLDARSGDE LMISAIGYQS TQVKIGTKNT LEIFLRESAS QLNEVVVVGY GTQDRKNLVG SVTQVNADEI KNRPVASFDQ QLQGRAVGVQ VAANTGVPGD GIFFRIRGTT SINASNDPLY VVDGVFVNNQ SLQKITTQGQ ANNPLADINP ADIESISILK DAEATAIYGA RAANGVVLIT TKRGSYNSKT KVSLNASVGQ AWAPKLWDLV TGPEHATIIN EAWINDGKPA ATRPFRPISE GGRGLPEEQP TYDRLHDIFR TGALQNYDLA VSGGTKQTRF YIGGGYTSQQ ATLRTNDFSR ASFKLNLDQD ITDKIRIGTS NILSQSNRTN ARVGDGPQGG ILQAALHTPT YLPKFNTDGS YAKWAGFDNL DVLINNTDMH STSTRYIGNI YGEYDIISGL KLRSSWSIDY NDYNEYEYWN TLTNRGSASK GLATSSVSKN TIWINEQTLS YRRSFGTQHN FGALVGNTLQ GNVSTQTLAQ GTNFPSDAFK QIASASVTTA SSNRNQYNLV SFFGRVDYNF SKKYFLEASL RADASSKFAE GHRWGYFPSA GVAWQLKQEN FLRDVNFLSD LKIRASVGWT GNQNGIGNYA SRGLWGGGNN YLDNPGTVPV QLANPELKWE TTRQTNVGLN VGLLSNRIGL EINAYSKYTY DLLLQVPLAQ SSGFSSIYRN DGEISNRGLE FGINTQNINK SSFQWNTSFN IAANVNRIEK LSIPVDASYA AERMAQGQAF HSFYVYRQLY VDPKTGDAVY DDVNKDGKIT VADRQFYGSA LPKFFGGLNN TFAYKGFDLS VFFNFSYGSK VFNNNRFFHE SGGTRDDRRA INKNQLKRWQ KEGDITDVPR VTTIGNNYNL SPTSRFVEDG SFLRLNSLVL GYTIPKAVLR KVGISSARVY YSGSNLWLLS NYQGPDPEVN VTADPTTQGY DLGTPPQPRT AQFGINLTL
|
| |