Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1251 |
Symbol | |
ID | 8724984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 1528602 |
End bp | 1530059 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003386100 |
Protein GI | 284036170 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.601083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.331628 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACGAAC AACCTCATTT CCAACCCTTT CCTAAAAAAG TATTGAAACG ACTTCTGCTA TCCCTTATCG TCCTGGCCGG GGCGCTCGGC CTTCGGCAGC CGATTGACCC GGCATCGCGC CCGCCCACGG CTCCCAATAT CATTTTCCTG CTGGCCGATG ACCAGCGGTG GGATGCCCTG GGTGTTGCCG GAAATAAAAC AATCCAGACG CCTAACCTCG ACCGGCTGGC GCGGGAGGGG TTCTATTTCC GGCGTTCGTA CGTGACTACG CCTATCTGCT GCATCAGCCG GGCCAGTATC CTGAGCGGGC AGTACGCCCG CCGGCACGGG ATTGTGGATT TTGTGACGCC CTTTACCGAT TCTGCTCTGG CGCAGACCTA CCCGGCGCTG CTCCGGAAAG CCGGTTACCG AACGGGATTC ATTGGTAAGT ATGGCGTGGG AAATGTGATG CCCATCAATG AATATGATTA CTGGCGGGGT TTCGATGGGC AGGGTAACTA TGCGGCCAAG GATGCGCAGG GGAAGCCGAT TCACCTGACC GATTTAATGG GCCAGCAAAT GGACGAGTTT CTTCAGGGAA ATCCGGCCGG AAAGCCGTTC TGCTTATCGG TGAGCTTCAA AGCGCCCCAC GCACAGGATG CGGCCAACCC TGAATTCCCC TATGCCGAAC GGTTCACCGA CCTCTACCGC GACCAGACGC TAAAACGCCC CGCTGCCGCC GATGATAAAT ACTACCGACA GTTTCCAGAC TGGTTTCGGC ATAACGACCA GAACGAGTCC CGCATTCGCT GGAGCCGCCG GTTCGCTACG GATTCGATGT TTCAGCAGAC CACCAAATCG TATAACCGGC TGATTACGGG TATTGATGAC GTCGTGGGTA ACCTCCGCCG AACCTTACAG GAGCGGGGAC TCGCCGACAA TACCATCATC ATATACACCA GCGATAACGG GTTTTACGAA GGCGAATATG GCTTTGCCGA CAAATGGTAC GGCCATGAGT TGTCGATCCG GGTACCGCTC ATCATTTACG ATCCCCGCCA ACCGAATCGG CAAGGTCGCA CCACCGACAA GTATACGCTC AACATTGATT TCGCTCCTAC CCTGCTCACG CTGGCGGGGG TACCGGTGCC GGGCCGGATG CAGGGGCGCA GCCTTACGCA ACTGATGGAC GCCCGCGACG GCGCAGCACT CAAAACACCC TGGCGAACGG CCTTCTATTT TGAGCACATG TTTAATACGC CTGCCGTATT TATTCCTCAA TCCGAAGGGG TGCTGAGCGC CGATAGAAAG TACGTTCACT ACTACAATCT CCGCGAACCG GCAGACAGTT ACGAAGAAGT ATACAACCTG AAAACCGACC CGCTGGAACT TCGTAATCTG GCGGTTGAGC CGACAGGAAA AGCAGCAAAG AAGTCACTGC TGCCCATTTT TGACCAACTC AAAGAAGCCG CCAGATAA
|
Protein sequence | MNEQPHFQPF PKKVLKRLLL SLIVLAGALG LRQPIDPASR PPTAPNIIFL LADDQRWDAL GVAGNKTIQT PNLDRLAREG FYFRRSYVTT PICCISRASI LSGQYARRHG IVDFVTPFTD SALAQTYPAL LRKAGYRTGF IGKYGVGNVM PINEYDYWRG FDGQGNYAAK DAQGKPIHLT DLMGQQMDEF LQGNPAGKPF CLSVSFKAPH AQDAANPEFP YAERFTDLYR DQTLKRPAAA DDKYYRQFPD WFRHNDQNES RIRWSRRFAT DSMFQQTTKS YNRLITGIDD VVGNLRRTLQ ERGLADNTII IYTSDNGFYE GEYGFADKWY GHELSIRVPL IIYDPRQPNR QGRTTDKYTL NIDFAPTLLT LAGVPVPGRM QGRSLTQLMD ARDGAALKTP WRTAFYFEHM FNTPAVFIPQ SEGVLSADRK YVHYYNLREP ADSYEEVYNL KTDPLELRNL AVEPTGKAAK KSLLPIFDQL KEAAR
|
| |