Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5933 |
Symbol | |
ID | 8729714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 7190835 |
End bp | 7192304 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | protein of unknown function DUF1501 |
Protein accession | YP_003390694 |
Protein GI | 284040764 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTC AGGACGAAAT ACATGACCAA CTCAGCCGCC GGACCTTTCT AGGGCAATCT AGTGCTGGTC TGGGTGCCAT TGCGTTAGCA TCACTGCTAA ACCCGACAAA TCTGTTCGGT GGCGCATCGT CACCCGGCAC CTCCATGCCG GGAGAAAATC CGGCGGTAGG CAAGCCGCAC TTTCCACCGA AAGTGAAACG GGTAATTTAT TTATTTCAGA GTGGAGCGCC GTCGCAACTC GAATTGTTCG ATTACAAGCC AAAGCTCGAA GCCATGTGGG GGCAGGATTT ACCGGCTTCG GTACGCAACG GCCAACGCCT GACGGGCATG AGTGCCGGAC AAAGCCGGTT TCCATTGGCG GCTTCTAAGT ATAAGTTCGC GCAGTACGGA CCCGGTCGCA TGTGGCTTAG TGAATTGTTG CCGCATACGG CGAAAATTGC CGGGGATTTA ACCTTTGTGC GCTCCCTGCA TACCGAGGCC ATCAACCACG ACCCGGCTGT TACCTTTTTT CAGACGGGAA GCCAACAAGC CGGGCGACCC AGTTTCGGCT CCTGGATCAG TTACGGACTA GGCTCAGACA ATCAGAATCT TCCATCCTTT GTAGTACTTC TGTCCAAAGG GCGCGATGGC GACCAGCCGT TATATGCCAA ACTCTGGAGT AATGGATTTT TACCATCTGT GCATCAGGGC GTGGTGTTCC GGTCGGGCCC TGACCCGGTG TATTACCTTA ACAACCCGCC GGGAGTCGAT AAAACCAGCC GTCGGCGGAT GCTCGATTAT TTGGATAAAC TGCATCAGGA ACAATTCAAA CACGTACTGG ACCCGGAAAT AAACAACCGG ATGGCACAGT ACGAAATGGC GTATCGGATG CAGACATCGG TTCCCGAAAC GCTCGACATT TCGAAAGAGC CGGACTATAT CTTCGACATG TATGGTCCCG ACAGCCGCAA GCCGGGCACG TTTGCTGCCA ACTGCCTGCT GGCCCGCAAA CTGGTCGAAA AAGATGTTAA GTTTATCCAG TTGTATCATC AGGGATGGGA CCAGCACGGC AATCTGCCCA ACGATATTAA AATACAAACA AAAAGCGTTG ACCAGCCCTC GGCCGCACTG ATCATGGACC TCAAACAGCG TGGTTTACTG GACGATACGC TCGTGATCTG GGGCGGAGAA TTTGGCCGTG GGGCATACTC ACAGGGAAAA CTCACCCGCG ATAATTACGG GCGAGACCAC CATCCACGAG CGTTTTCGGT CTGGATGGCG GGGGCCGGGG TTAAAAAAGG TATGGTCTAC GGCGAAACCG ATGATTTCGG CTATAACGTT GTCAAAGACC CTGTTCACGT GCATGATTTC CAGGCGACGG TCCTGCATCT GCTCGGAATC GACCACGAAA AACTGACCTT CAAAAGCCAG GGACGACGGT ATCGACTAAC CGACGTGCAT GGCAAAGTAG TGAAGCCGAT ATTAGCATAA
|
Protein sequence | MDIQDEIHDQ LSRRTFLGQS SAGLGAIALA SLLNPTNLFG GASSPGTSMP GENPAVGKPH FPPKVKRVIY LFQSGAPSQL ELFDYKPKLE AMWGQDLPAS VRNGQRLTGM SAGQSRFPLA ASKYKFAQYG PGRMWLSELL PHTAKIAGDL TFVRSLHTEA INHDPAVTFF QTGSQQAGRP SFGSWISYGL GSDNQNLPSF VVLLSKGRDG DQPLYAKLWS NGFLPSVHQG VVFRSGPDPV YYLNNPPGVD KTSRRRMLDY LDKLHQEQFK HVLDPEINNR MAQYEMAYRM QTSVPETLDI SKEPDYIFDM YGPDSRKPGT FAANCLLARK LVEKDVKFIQ LYHQGWDQHG NLPNDIKIQT KSVDQPSAAL IMDLKQRGLL DDTLVIWGGE FGRGAYSQGK LTRDNYGRDH HPRAFSVWMA GAGVKKGMVY GETDDFGYNV VKDPVHVHDF QATVLHLLGI DHEKLTFKSQ GRRYRLTDVH GKVVKPILA
|
| |