Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3993 |
Symbol | |
ID | 8727751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 4801958 |
End bp | 4805203 |
Gene Length | 3246 bp |
Protein Length | 1081 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003388782 |
Protein GI | 284038852 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.356607 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00078443 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGAAC GATTATTTGC ACTTCTCTTC TTACTTCCTT TGCTTTCTTT TGCCCAGCAG CGCGTTGTGC GTGGTAAGGT AAGCGATGCT AATGGGAAAG AATCTCTACC CGGTACCACG GTAACGGTGA AAGGTGGTAC AGCGGGTACA GTAACCGACG CACAGGGAAC TTATCAGATA AATGTACCCG ATAAAGCAGC AACGCTGGTT TTCTCGTCTG TCGGCTTTCG ACTACAAGAG ATAGTTGTTG GTAATCAGCA GGTTATCGAT GTGTCGCTCA TTGCCGATAC CAAGCAGCTC AGTGAAGTGG TGGTTACCGC AGCGGGTATC AAACGGGATA AAAATGCGCT CGGTTATTCG GTATCAACGC TGGATGCCAA CAAGCTGGCT CAGCGGTCAG AACCGGATCC ACTCCGGGCA CTTACCGGTA AAGTCGCGGG TGTTAATGTG CAGGGTTCAG GGGGGGCGGC CGGTGGTGCA ACGAATATTA CTATTCGGGG AAATTCGTCG CTGGGCAATA ACAACCAGCC ACTGTTTGTG GTCGATGGGG TTCCCTTTGA TAATTCCAGT TTCGGCAGTA CGGACGGATT TGTTGGCGGG TCGACCGTTA CCAACCGGGC TTTTGACCTG GACCCGAACA ATATTCTTAC CATGACGGTG CTGAAAGGAG CCGCTGCGGC TGCCCTTTAC GGCTCCAGAG CGGCTAACGG AGCCATCATC GTGACCACAA AAGCCGGTAA AAGCACCAGC CGGAAAGGGC TTGAAATCAC GTATAGTTCC GGCTACTCGA CAGAAACGGT GGCCGGTTTG CCGGATTACC AAACCAAGTA TGGACAGGGA ACGAACTTCG ATTATCGGAG TGGTGTATTC GGCTCGTGGG GACCGCCGTA CGCAGGCTAT ACCAGCCTGA TTCCCACCCG GGCTACCATT CCGCACCCGC TCACGACGAA TAACCGCTAT CCATCAACGG TATTTCCGCA GTTTTATCAG GCCGATGGCA CAACGCCCAT TCAGGTACCT TACCAGTCGT ATTCGCAGGG CAACGCCAAA AATTTCTTTC GGACAGGAAA CGTCTTTGAA AACGCCCTCT CGGTATCGAC AGGTGGCTCG AAGGGGAATT TTACGGTGGG TCTTTCCCGC ACGGGTAATC AGGGTGTAGT GCCCGAGAAC CAGATCACCC GTACCGGTAT CAACATTGGT GGTAACGCCC AGCTGGATAA TAAGTTCTAC GTGAGCGGTG CCCTGAACTA CGTCAATACC GAGCAGGTAT CTCCGCAGGT GACGGCGGCC AATGGCAGCG GCAGTTCCAT TATGGACATC CTGATGTTTG TGCCTACCAG CTTCGACCTG ACGGGTTACC CGAACACCAA TCCGCTGAAC GGCAATAATG TATACGACCG CGTCGGTACC GATAACCCTT ACTGGTCGGT CAAAAACAGT CCGACAATCA GTAAGGTTGA TCGCTATTAC GGTAATGTCG TGCTGGGGGT CGATCCGTTG CCCTGGCTGA ACGTGCAGAA TACCGTCGGC TTCAATGCGT ATACCGACCG GCGTGTGGTC GTCAATGGAA AAGGCGGGGA CTACTTCCCG AACGGCAACA TCACGAACGA TAATATCTAT CGGCAGGAAC TGGACAACAC CCTGCTCGTA ACGGCAACTA AACCGCTTTC GGAAAACATT GGCCTGAAAG TAATTCTGGG AAACAACGTT AACCAGCGCG TAACGGAGCG GCAGGTTGTG TTTGGCGACG GGATCATTTT TCGGGGTATC AACTCGCTGA ACAACACCAG CGTGACCATT CCCCGCGTGT TGCCCAATAA CCGCAATAAT TTCAAACAGC GATACTTTGC ATTCTTCACC GATATTTCGC TGGACTACAA AAACTACGCG TTTTTAAACC TCGTAGCCCG TAACGACGTA TCGTCTACAC TGCCCGCCAG CAACCGAAGC TACCTGTATG GCGGGGCCAG TGCCTCCCTG ATCTTTACCG AAGCGTTGAA ACTGCCTAAA AACGTTCTTT CGTTTGGTAA GCTTCGGGCT GGCTACACCC GCGTAGGTAA CGAAGCAACG CCTTACCAGA CGCAAACGGT GTACATCGCC AACCCTGTTC TGGGCGTTGG GTCGGGCACG GGATCTATCT CATCGCCGTT CAGCGGCCAA AGTACGCTGT CCGAATCAGA CTTGCTGGCC AATTCAGAGT TGAAGCCCGA GTTCATTACC GAGCTTGAGG TGGGTACGGA GTTACAGTTT TTCAATAACC GGATAGGCCT GGACATCACG TACTATAACA AGATCAGTAC GTCGCAGATT TTTACGGTCA ACGCCACACC CTCGTCGGGG TACACGCAGC GGGTAATTAA TCTGGGGCGT TCGTCTAACG AAGGCATCGA GATTGGTCTG ACGGCAACGC CCGTAAAACT CAAGAACGGG TTTAGCTGGG ATATCTCATC GGCCTTCACC ATGAACCGCA ACATTGTACT GGACATAGGT TCCCTGAAAG AACTGCCCTA CGGTGGTTTC TCGGACCTAG GTAGTGTACA CATTGCCGGT CAGCCCTACG GACAGATTCG CGGGTCGACG TATGCCAGAG ATGAGGTGGG CAATATCCTG GTCAATCCCA ACACGGGAAA GCCGATCCTG AGCGGTAAAA CGGCCGCTAT CGGTAATCCA AACCCCGATT TTATTCTGGG CGTAACCAAT ACCTTTAGTT ATAAAGGGCT TACGCTGAGC GCGTTGTTCG ACTGGAAGAA AGGGGGCGAC ATGTACTCCT TTACGGCTTA CGAACTGCTA AGCCGGGGTG CAACCAAAGA CACCGAAGAG CGGGAGGCCA TTCTGGTGGG ACCGGGTGTG CTGGGCGATG TGAACACGCT GAAACCCCTG CTGGATGGCG AAGGCAAAAA GATTCCGAAC AACATTGGTA TTGCCGTAGC TGATTACTAT TTCACGGGCG GCTTTGGACC GGGTGGCGCG GGCGAAACGA ATATTTTCGA CGCCACCATT TTCCGGCTTC GTGAAGTATC GCTGGGCTAC CAGTTTCCGA AAAAATGGCT GGCGAAAACA CCCTTTGGCG GGGCGTTCCT ATCCGTCAGT GGGCGCAACC TATGGTACCT GGCTCCCAAC TTTCCCAAGT ACCTGAATTT TGATCCCGAG GTTAGCTCAT TAGGTGCCGG TAATTCACAG GGCTTCGATT TCATCGGTAT ACCGACAACC CGCCGACTGG GTGTCAACCT GCGGTTCAGC TTCTGA
|
Protein sequence | MKERLFALLF LLPLLSFAQQ RVVRGKVSDA NGKESLPGTT VTVKGGTAGT VTDAQGTYQI NVPDKAATLV FSSVGFRLQE IVVGNQQVID VSLIADTKQL SEVVVTAAGI KRDKNALGYS VSTLDANKLA QRSEPDPLRA LTGKVAGVNV QGSGGAAGGA TNITIRGNSS LGNNNQPLFV VDGVPFDNSS FGSTDGFVGG STVTNRAFDL DPNNILTMTV LKGAAAAALY GSRAANGAII VTTKAGKSTS RKGLEITYSS GYSTETVAGL PDYQTKYGQG TNFDYRSGVF GSWGPPYAGY TSLIPTRATI PHPLTTNNRY PSTVFPQFYQ ADGTTPIQVP YQSYSQGNAK NFFRTGNVFE NALSVSTGGS KGNFTVGLSR TGNQGVVPEN QITRTGINIG GNAQLDNKFY VSGALNYVNT EQVSPQVTAA NGSGSSIMDI LMFVPTSFDL TGYPNTNPLN GNNVYDRVGT DNPYWSVKNS PTISKVDRYY GNVVLGVDPL PWLNVQNTVG FNAYTDRRVV VNGKGGDYFP NGNITNDNIY RQELDNTLLV TATKPLSENI GLKVILGNNV NQRVTERQVV FGDGIIFRGI NSLNNTSVTI PRVLPNNRNN FKQRYFAFFT DISLDYKNYA FLNLVARNDV SSTLPASNRS YLYGGASASL IFTEALKLPK NVLSFGKLRA GYTRVGNEAT PYQTQTVYIA NPVLGVGSGT GSISSPFSGQ STLSESDLLA NSELKPEFIT ELEVGTELQF FNNRIGLDIT YYNKISTSQI FTVNATPSSG YTQRVINLGR SSNEGIEIGL TATPVKLKNG FSWDISSAFT MNRNIVLDIG SLKELPYGGF SDLGSVHIAG QPYGQIRGST YARDEVGNIL VNPNTGKPIL SGKTAAIGNP NPDFILGVTN TFSYKGLTLS ALFDWKKGGD MYSFTAYELL SRGATKDTEE REAILVGPGV LGDVNTLKPL LDGEGKKIPN NIGIAVADYY FTGGFGPGGA GETNIFDATI FRLREVSLGY QFPKKWLAKT PFGGAFLSVS GRNLWYLAPN FPKYLNFDPE VSSLGAGNSQ GFDFIGIPTT RRLGVNLRFS F
|
| |