Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_6052 |
Symbol | |
ID | 8729833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 7342913 |
End bp | 7344274 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003390813 |
Protein GI | 284040883 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCTC TACTCCTCCT GTTCCTCGTC CTCTTCTGCC AACCGGTTCG GGTGGTTGGT CAAACGAAGC CTCCGGTAAA AAAACCCAAC ATCCTCCTCA TTTTAGCCGA TGACCTGGGC TATGGCGATT TGAGCAGTTA CGGGGCGCCT GATATCCGAA CACCCCACAT TGATTCGCTC GTTCGGGCGG GGATGCGGTT CAGCCACTTC TACGCGAACT CGTCCGTTTG TTCACCGTCA CGCGCTGCCC TATTGAGCGG GCGGTATCCC GAGCAGGTGG GCGTACCGGG CGTTATCCGA ACCATGCCCG ACGACAACTG GGGCTATCTG TCGCCAAGCG CCGTTCTGCT GCCTTCGATA TTGAAGAAAA ACGGGTACTA TACGGCCCTG GTTGGCAAAT GGCATCTGGG TCTGGAGCCG CCAAACCTGC CCAACGACCG CGGGTTCGAC CTGTTTCACG GCTTCGAGGG CGATATGATG GACGACTACT ACACACATTT ACGCCATGAC CGGAACTACA TGCGGCTCAA TCGGCAGACC ATCAATCCGC AGGGACACGC CACCGATCTG TTCACACAGT GGGCAACGGA TTACCTTGAG CAACGCGCCG GTCAATCAAA TCCTTTTTTC CTGTATCTGG CTTACAATGC CCCGCACGAC CCCATTCAGC CCCCCGCCGA CTGGCTGGCA AAAGTAAAAG CGCGTCAGCC GGGCATCAGT GAGAAACGCG CTAAGCTGGT AGGGTTGATT GAACACATGG ACGACGGCAT TGGCAAGGTC ATTCAAACCT TACGGGCAAA AGGCCTATAT GAAAATACGC TGATTGTGTT TGTCAGCGAC AACGGCGGAA AGCTGTTCGA TGGGGCAACT AATGGGCCAC TGCGTAGCGG AAAAGGACAC ATGTACGAAG GGGGCATTCG CATACCGGCC TGCGTAGTCT GGCCCGGTAA AGTTGCCGCT CAAAGTCAGT CGCAGCAACC GCTTTTATTG ATGGATATCT TCCCAACACT GGCTGAGGCT ACGGGTACAG TGATAAATTA CCCGATTGAC GGGCGGAGCT TCCTATCCAT TTTACGAGGA GAACGTCAGC TGTTAGCTGC CGAACGGCCT CTTTTCTTCA TTCGGCGCGA AGGTGGCAGC GAATACAATG GCAAAACAAT CGACGCGGTT CGGCTCGGCG ACTGGAAACT GCTTCAGGAC AGCCCATACA GCCCGTTGGA ATTATACAAT CTGAAAGAAG ATCCGCAGGA AAAAACAAAC CGGGCAAGCG ACCGGCCGGA GGAATTCAGA CGGCTGGAAA AACTTATGCG CGAACACACA CGCCAGGGTG GAGCCATACC ATGGGAAAAG GAAGGCTTAT AA
|
Protein sequence | MKPLLLLFLV LFCQPVRVVG QTKPPVKKPN ILLILADDLG YGDLSSYGAP DIRTPHIDSL VRAGMRFSHF YANSSVCSPS RAALLSGRYP EQVGVPGVIR TMPDDNWGYL SPSAVLLPSI LKKNGYYTAL VGKWHLGLEP PNLPNDRGFD LFHGFEGDMM DDYYTHLRHD RNYMRLNRQT INPQGHATDL FTQWATDYLE QRAGQSNPFF LYLAYNAPHD PIQPPADWLA KVKARQPGIS EKRAKLVGLI EHMDDGIGKV IQTLRAKGLY ENTLIVFVSD NGGKLFDGAT NGPLRSGKGH MYEGGIRIPA CVVWPGKVAA QSQSQQPLLL MDIFPTLAEA TGTVINYPID GRSFLSILRG ERQLLAAERP LFFIRREGGS EYNGKTIDAV RLGDWKLLQD SPYSPLELYN LKEDPQEKTN RASDRPEEFR RLEKLMREHT RQGGAIPWEK EGL
|
| |