Gene Slin_6052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6052 
Symbol 
ID8729833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7342913 
End bp7344274 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content55% 
IMG OID 
Productsulfatase 
Protein accessionYP_003390813 
Protein GI284040883 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCTC TACTCCTCCT GTTCCTCGTC CTCTTCTGCC AACCGGTTCG GGTGGTTGGT 
CAAACGAAGC CTCCGGTAAA AAAACCCAAC ATCCTCCTCA TTTTAGCCGA TGACCTGGGC
TATGGCGATT TGAGCAGTTA CGGGGCGCCT GATATCCGAA CACCCCACAT TGATTCGCTC
GTTCGGGCGG GGATGCGGTT CAGCCACTTC TACGCGAACT CGTCCGTTTG TTCACCGTCA
CGCGCTGCCC TATTGAGCGG GCGGTATCCC GAGCAGGTGG GCGTACCGGG CGTTATCCGA
ACCATGCCCG ACGACAACTG GGGCTATCTG TCGCCAAGCG CCGTTCTGCT GCCTTCGATA
TTGAAGAAAA ACGGGTACTA TACGGCCCTG GTTGGCAAAT GGCATCTGGG TCTGGAGCCG
CCAAACCTGC CCAACGACCG CGGGTTCGAC CTGTTTCACG GCTTCGAGGG CGATATGATG
GACGACTACT ACACACATTT ACGCCATGAC CGGAACTACA TGCGGCTCAA TCGGCAGACC
ATCAATCCGC AGGGACACGC CACCGATCTG TTCACACAGT GGGCAACGGA TTACCTTGAG
CAACGCGCCG GTCAATCAAA TCCTTTTTTC CTGTATCTGG CTTACAATGC CCCGCACGAC
CCCATTCAGC CCCCCGCCGA CTGGCTGGCA AAAGTAAAAG CGCGTCAGCC GGGCATCAGT
GAGAAACGCG CTAAGCTGGT AGGGTTGATT GAACACATGG ACGACGGCAT TGGCAAGGTC
ATTCAAACCT TACGGGCAAA AGGCCTATAT GAAAATACGC TGATTGTGTT TGTCAGCGAC
AACGGCGGAA AGCTGTTCGA TGGGGCAACT AATGGGCCAC TGCGTAGCGG AAAAGGACAC
ATGTACGAAG GGGGCATTCG CATACCGGCC TGCGTAGTCT GGCCCGGTAA AGTTGCCGCT
CAAAGTCAGT CGCAGCAACC GCTTTTATTG ATGGATATCT TCCCAACACT GGCTGAGGCT
ACGGGTACAG TGATAAATTA CCCGATTGAC GGGCGGAGCT TCCTATCCAT TTTACGAGGA
GAACGTCAGC TGTTAGCTGC CGAACGGCCT CTTTTCTTCA TTCGGCGCGA AGGTGGCAGC
GAATACAATG GCAAAACAAT CGACGCGGTT CGGCTCGGCG ACTGGAAACT GCTTCAGGAC
AGCCCATACA GCCCGTTGGA ATTATACAAT CTGAAAGAAG ATCCGCAGGA AAAAACAAAC
CGGGCAAGCG ACCGGCCGGA GGAATTCAGA CGGCTGGAAA AACTTATGCG CGAACACACA
CGCCAGGGTG GAGCCATACC ATGGGAAAAG GAAGGCTTAT AA
 
Protein sequence
MKPLLLLFLV LFCQPVRVVG QTKPPVKKPN ILLILADDLG YGDLSSYGAP DIRTPHIDSL 
VRAGMRFSHF YANSSVCSPS RAALLSGRYP EQVGVPGVIR TMPDDNWGYL SPSAVLLPSI
LKKNGYYTAL VGKWHLGLEP PNLPNDRGFD LFHGFEGDMM DDYYTHLRHD RNYMRLNRQT
INPQGHATDL FTQWATDYLE QRAGQSNPFF LYLAYNAPHD PIQPPADWLA KVKARQPGIS
EKRAKLVGLI EHMDDGIGKV IQTLRAKGLY ENTLIVFVSD NGGKLFDGAT NGPLRSGKGH
MYEGGIRIPA CVVWPGKVAA QSQSQQPLLL MDIFPTLAEA TGTVINYPID GRSFLSILRG
ERQLLAAERP LFFIRREGGS EYNGKTIDAV RLGDWKLLQD SPYSPLELYN LKEDPQEKTN
RASDRPEEFR RLEKLMREHT RQGGAIPWEK EGL