Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5003 |
Symbol | |
ID | 8728767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 6093848 |
End bp | 6095449 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003389779 |
Protein GI | 284039849 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGGC CCAAACAAGC GTTCGTATCG TCCCCTCAGA CCACGTTGAA AACATTGTTT TTTCTGGCTC TGCTCGCTAT TATAAGCAGT CAGTTAGCCA TTGGGCAGTC GGTCAAACGC CCGAATATCC TGTACATTCT GGCCGACGAT ATGGGCTTTT CGGACATTGG CTGCTACGGG GGCGAGGTCA ACACGCCGAA TCTTGATAAA CTGGCGGCTG GCGGTATCAA GCTGCGGAGT TTTTATAACA ACGCCCGCTG CTGCCCAACC CGAGCCTCTT TGCTCACGGG GCAGTATCCG CACACCGTTG GCATGGGCCT GATGGTGACC ATGCCCAACG CAGCCATTCA GCCGGGGAGT TATCAGGGAT TTCTGGATGC GCGTTACCCG ACTATTGCCG AGCGACTGAA AGAAACGGGC TATAGCACCT ACATGCTCGG CAAGTGGCAC GTGGGCGAGC GCCCCGAGCA TTGGCCCCTG AAGCGGGGTT TCGAGCACTA CTTCGGCCTG ATCTCCGGCG CATCGAGCTA TTACGAAATC ATTCCTGCCG AGAAAGGCAA GCGGTTCATT GTCCTCGACG ATAAGGAGTT TACCCCGCCC GCCGACGGTT TTTACATGAC CGACGCCTTC ACCGATTACG CCGTTCAGTA CCTCAACCAA CAGAAGCAGG AACAGGCCGA CAAACCGTTT TTTATGTACC TGGCCTACAC TGCGCCCCAC TTTCCACTGC ACGCGTATGA GTCGGACATT GCCAAATACG AGAAACTGTA TGCGCAGGGG TGGGATGTGA CCCGTACTAA ACGCTACCAG AAAATGCAAC AGCTTGGGCT GATCGACAAG CGTTACCAAC TGACGCCCCG CCCTGCTAAC GTACCCGCCT GGAATTCGGC CACCGATAAA GCGCAGTGGA TTCGGAAAAT GGCCGTGTAT GCTGCCATGA TCGACCGGAT GGACCAGAAT ATTGGTCGGC TTATTAAAAC CCTGAAAGCC AACGGCCAGT ACGACAATAC GCTCATCGTG TTCATGTCGG ACAACGGGAG TTCGAACGAA AATATGGAAA GCCGGAAGCT GAACGACCCC ACCAAAAAGA TCGGTGAACG CGGTTCTTAC GTCACCTACG ATACGCCTTG GGCCAACGTG TCGGTTACGC CGTTTCGGAA GTACAAGCGG TTTCTGCACG AGGGCGGCAT GATTACACCC TGCATTATGC AATGGCCCCG CAACATTCGG CCAGCCGCTG GCTATGTGGA TGGCATTGGC CACGTCATGG ACCTGCTGCC TACAAGTCTT GAATTAGCGG GCTTGTCGGC CAACGATTTG CCCGGCAAAA GCTTGTCGTA TCTATGGACA CCTAAAAAGA CCGAACCACG CACCTATTGC TGGGAACACG AAGGCAACAA AGCCATCCGA AAAGCTGACT GGAAACTGGT AAAAGATACC GAAGACGCCG ATTGGGAACT GTACAACATC AAAACTGACC CCTGCGAAAC CAACGATTTA GCCAGAAACC AACCCCAACG CGTGGCCAGT ATGCGAACCG AGTTCGATAC ATGGGCACAA CGGGTGGGCG TTCGCGAACG ACCGGCCGGG AAGTCGGAAT AG
|
Protein sequence | MKRPKQAFVS SPQTTLKTLF FLALLAIISS QLAIGQSVKR PNILYILADD MGFSDIGCYG GEVNTPNLDK LAAGGIKLRS FYNNARCCPT RASLLTGQYP HTVGMGLMVT MPNAAIQPGS YQGFLDARYP TIAERLKETG YSTYMLGKWH VGERPEHWPL KRGFEHYFGL ISGASSYYEI IPAEKGKRFI VLDDKEFTPP ADGFYMTDAF TDYAVQYLNQ QKQEQADKPF FMYLAYTAPH FPLHAYESDI AKYEKLYAQG WDVTRTKRYQ KMQQLGLIDK RYQLTPRPAN VPAWNSATDK AQWIRKMAVY AAMIDRMDQN IGRLIKTLKA NGQYDNTLIV FMSDNGSSNE NMESRKLNDP TKKIGERGSY VTYDTPWANV SVTPFRKYKR FLHEGGMITP CIMQWPRNIR PAAGYVDGIG HVMDLLPTSL ELAGLSANDL PGKSLSYLWT PKKTEPRTYC WEHEGNKAIR KADWKLVKDT EDADWELYNI KTDPCETNDL ARNQPQRVAS MRTEFDTWAQ RVGVRERPAG KSE
|
| |