Gene Slin_1251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1251 
Symbol 
ID8724984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1528602 
End bp1530059 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content56% 
IMG OID 
Productsulfatase 
Protein accessionYP_003386100 
Protein GI284036170 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.601083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.331628 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACGAAC AACCTCATTT CCAACCCTTT CCTAAAAAAG TATTGAAACG ACTTCTGCTA 
TCCCTTATCG TCCTGGCCGG GGCGCTCGGC CTTCGGCAGC CGATTGACCC GGCATCGCGC
CCGCCCACGG CTCCCAATAT CATTTTCCTG CTGGCCGATG ACCAGCGGTG GGATGCCCTG
GGTGTTGCCG GAAATAAAAC AATCCAGACG CCTAACCTCG ACCGGCTGGC GCGGGAGGGG
TTCTATTTCC GGCGTTCGTA CGTGACTACG CCTATCTGCT GCATCAGCCG GGCCAGTATC
CTGAGCGGGC AGTACGCCCG CCGGCACGGG ATTGTGGATT TTGTGACGCC CTTTACCGAT
TCTGCTCTGG CGCAGACCTA CCCGGCGCTG CTCCGGAAAG CCGGTTACCG AACGGGATTC
ATTGGTAAGT ATGGCGTGGG AAATGTGATG CCCATCAATG AATATGATTA CTGGCGGGGT
TTCGATGGGC AGGGTAACTA TGCGGCCAAG GATGCGCAGG GGAAGCCGAT TCACCTGACC
GATTTAATGG GCCAGCAAAT GGACGAGTTT CTTCAGGGAA ATCCGGCCGG AAAGCCGTTC
TGCTTATCGG TGAGCTTCAA AGCGCCCCAC GCACAGGATG CGGCCAACCC TGAATTCCCC
TATGCCGAAC GGTTCACCGA CCTCTACCGC GACCAGACGC TAAAACGCCC CGCTGCCGCC
GATGATAAAT ACTACCGACA GTTTCCAGAC TGGTTTCGGC ATAACGACCA GAACGAGTCC
CGCATTCGCT GGAGCCGCCG GTTCGCTACG GATTCGATGT TTCAGCAGAC CACCAAATCG
TATAACCGGC TGATTACGGG TATTGATGAC GTCGTGGGTA ACCTCCGCCG AACCTTACAG
GAGCGGGGAC TCGCCGACAA TACCATCATC ATATACACCA GCGATAACGG GTTTTACGAA
GGCGAATATG GCTTTGCCGA CAAATGGTAC GGCCATGAGT TGTCGATCCG GGTACCGCTC
ATCATTTACG ATCCCCGCCA ACCGAATCGG CAAGGTCGCA CCACCGACAA GTATACGCTC
AACATTGATT TCGCTCCTAC CCTGCTCACG CTGGCGGGGG TACCGGTGCC GGGCCGGATG
CAGGGGCGCA GCCTTACGCA ACTGATGGAC GCCCGCGACG GCGCAGCACT CAAAACACCC
TGGCGAACGG CCTTCTATTT TGAGCACATG TTTAATACGC CTGCCGTATT TATTCCTCAA
TCCGAAGGGG TGCTGAGCGC CGATAGAAAG TACGTTCACT ACTACAATCT CCGCGAACCG
GCAGACAGTT ACGAAGAAGT ATACAACCTG AAAACCGACC CGCTGGAACT TCGTAATCTG
GCGGTTGAGC CGACAGGAAA AGCAGCAAAG AAGTCACTGC TGCCCATTTT TGACCAACTC
AAAGAAGCCG CCAGATAA
 
Protein sequence
MNEQPHFQPF PKKVLKRLLL SLIVLAGALG LRQPIDPASR PPTAPNIIFL LADDQRWDAL 
GVAGNKTIQT PNLDRLAREG FYFRRSYVTT PICCISRASI LSGQYARRHG IVDFVTPFTD
SALAQTYPAL LRKAGYRTGF IGKYGVGNVM PINEYDYWRG FDGQGNYAAK DAQGKPIHLT
DLMGQQMDEF LQGNPAGKPF CLSVSFKAPH AQDAANPEFP YAERFTDLYR DQTLKRPAAA
DDKYYRQFPD WFRHNDQNES RIRWSRRFAT DSMFQQTTKS YNRLITGIDD VVGNLRRTLQ
ERGLADNTII IYTSDNGFYE GEYGFADKWY GHELSIRVPL IIYDPRQPNR QGRTTDKYTL
NIDFAPTLLT LAGVPVPGRM QGRSLTQLMD ARDGAALKTP WRTAFYFEHM FNTPAVFIPQ
SEGVLSADRK YVHYYNLREP ADSYEEVYNL KTDPLELRNL AVEPTGKAAK KSLLPIFDQL
KEAAR