Gene Slin_5003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5003 
Symbol 
ID8728767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6093848 
End bp6095449 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content54% 
IMG OID 
Productsulfatase 
Protein accessionYP_003389779 
Protein GI284039849 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGGC CCAAACAAGC GTTCGTATCG TCCCCTCAGA CCACGTTGAA AACATTGTTT 
TTTCTGGCTC TGCTCGCTAT TATAAGCAGT CAGTTAGCCA TTGGGCAGTC GGTCAAACGC
CCGAATATCC TGTACATTCT GGCCGACGAT ATGGGCTTTT CGGACATTGG CTGCTACGGG
GGCGAGGTCA ACACGCCGAA TCTTGATAAA CTGGCGGCTG GCGGTATCAA GCTGCGGAGT
TTTTATAACA ACGCCCGCTG CTGCCCAACC CGAGCCTCTT TGCTCACGGG GCAGTATCCG
CACACCGTTG GCATGGGCCT GATGGTGACC ATGCCCAACG CAGCCATTCA GCCGGGGAGT
TATCAGGGAT TTCTGGATGC GCGTTACCCG ACTATTGCCG AGCGACTGAA AGAAACGGGC
TATAGCACCT ACATGCTCGG CAAGTGGCAC GTGGGCGAGC GCCCCGAGCA TTGGCCCCTG
AAGCGGGGTT TCGAGCACTA CTTCGGCCTG ATCTCCGGCG CATCGAGCTA TTACGAAATC
ATTCCTGCCG AGAAAGGCAA GCGGTTCATT GTCCTCGACG ATAAGGAGTT TACCCCGCCC
GCCGACGGTT TTTACATGAC CGACGCCTTC ACCGATTACG CCGTTCAGTA CCTCAACCAA
CAGAAGCAGG AACAGGCCGA CAAACCGTTT TTTATGTACC TGGCCTACAC TGCGCCCCAC
TTTCCACTGC ACGCGTATGA GTCGGACATT GCCAAATACG AGAAACTGTA TGCGCAGGGG
TGGGATGTGA CCCGTACTAA ACGCTACCAG AAAATGCAAC AGCTTGGGCT GATCGACAAG
CGTTACCAAC TGACGCCCCG CCCTGCTAAC GTACCCGCCT GGAATTCGGC CACCGATAAA
GCGCAGTGGA TTCGGAAAAT GGCCGTGTAT GCTGCCATGA TCGACCGGAT GGACCAGAAT
ATTGGTCGGC TTATTAAAAC CCTGAAAGCC AACGGCCAGT ACGACAATAC GCTCATCGTG
TTCATGTCGG ACAACGGGAG TTCGAACGAA AATATGGAAA GCCGGAAGCT GAACGACCCC
ACCAAAAAGA TCGGTGAACG CGGTTCTTAC GTCACCTACG ATACGCCTTG GGCCAACGTG
TCGGTTACGC CGTTTCGGAA GTACAAGCGG TTTCTGCACG AGGGCGGCAT GATTACACCC
TGCATTATGC AATGGCCCCG CAACATTCGG CCAGCCGCTG GCTATGTGGA TGGCATTGGC
CACGTCATGG ACCTGCTGCC TACAAGTCTT GAATTAGCGG GCTTGTCGGC CAACGATTTG
CCCGGCAAAA GCTTGTCGTA TCTATGGACA CCTAAAAAGA CCGAACCACG CACCTATTGC
TGGGAACACG AAGGCAACAA AGCCATCCGA AAAGCTGACT GGAAACTGGT AAAAGATACC
GAAGACGCCG ATTGGGAACT GTACAACATC AAAACTGACC CCTGCGAAAC CAACGATTTA
GCCAGAAACC AACCCCAACG CGTGGCCAGT ATGCGAACCG AGTTCGATAC ATGGGCACAA
CGGGTGGGCG TTCGCGAACG ACCGGCCGGG AAGTCGGAAT AG
 
Protein sequence
MKRPKQAFVS SPQTTLKTLF FLALLAIISS QLAIGQSVKR PNILYILADD MGFSDIGCYG 
GEVNTPNLDK LAAGGIKLRS FYNNARCCPT RASLLTGQYP HTVGMGLMVT MPNAAIQPGS
YQGFLDARYP TIAERLKETG YSTYMLGKWH VGERPEHWPL KRGFEHYFGL ISGASSYYEI
IPAEKGKRFI VLDDKEFTPP ADGFYMTDAF TDYAVQYLNQ QKQEQADKPF FMYLAYTAPH
FPLHAYESDI AKYEKLYAQG WDVTRTKRYQ KMQQLGLIDK RYQLTPRPAN VPAWNSATDK
AQWIRKMAVY AAMIDRMDQN IGRLIKTLKA NGQYDNTLIV FMSDNGSSNE NMESRKLNDP
TKKIGERGSY VTYDTPWANV SVTPFRKYKR FLHEGGMITP CIMQWPRNIR PAAGYVDGIG
HVMDLLPTSL ELAGLSANDL PGKSLSYLWT PKKTEPRTYC WEHEGNKAIR KADWKLVKDT
EDADWELYNI KTDPCETNDL ARNQPQRVAS MRTEFDTWAQ RVGVRERPAG KSE