Gene Slin_1250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1250 
Symbol 
ID8724983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1527111 
End bp1528580 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content57% 
IMG OID 
Productsulfatase 
Protein accessionYP_003386099 
Protein GI284036169 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.820666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.257041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTATA TATCCCTTTG TGTAGCCGGT GCGCTGCTAA CCGGTCTGAT CGCCGGGAAA 
CCATCGCCCG CACCCATTCG GCCAACGCCC AGTCAGGCCG GTGTCCGTAA ACCCAACATC
GTGATCGTCA TGGCCGACCA GTGGCGGGCG CAGGATTTGG GCTATGCGGG CAATCGGGAG
GTGATCACGC CGAATCTGGA TAAACTGGCC CTGGAGTCCG TTAACGCCCC CCTGTGCGTG
GCCGAAGTGC CGGTCTGCTC ACCGAGCCGG GCCAGCCTGC TAACGGGGCA GCACGCTACT
ACCCACGGAG TGTTTTATAA CGACCGACCG CTACGCAACG AAGCCGTCAC CTTAGCCGAA
GTATGCCAGC AAAACGGCTA CAAAACCGGA TTCATTGGCA AATGGCATAT CAACGGCGGG
TTAGCCAAGG ACTTCGCAGC CGGTCGTCTG GCACCGATTC CCGTTGACCG CCGACAGGGC
TTCGAGTACT GGCGGGGGCT GGAATGCACC CACGACTACA ACAACTCGCC TTACTACAAC
GAGGTGAACA AGCGGTTCGT CTGGCAGCAG TACGATGCCA TCAGCCAGAC CGATTCGGCC
ATTTCGTTCA TGACCCAGTC GCGCAAGGAG CCGTTTCTAT TGGTGCTCGC CTGGGGGCCA
CCGCACGACC CGTACCAGAC GGCCCCGAAA GAATACCGAC AACGGTACGC CGACAAAACG
TTGTCCCTGC GCCCCAATGT ACCCGCCAAA GACACGATGG AAGCCAACCG GGCTCTGAAA
GGATATTACG CGCATATCAA CGCCCTCGAC GACTGCATCG GTCGGTTACA GGCTGCGCTT
AAAGGGGCTA AACTGGACGA AAACACCATT TTCGTGTTCA CCTCCGACCA CGGCGATATG
CTGTACTCGC ACGATCAGAT CAACAAACAA AAGCCCTGGG ACGAGTCGAT CCGGATACCG
TTTCTGCTCA AATACCCGGC GGGACTGAGT CGGAAAGGCC GCACGCTGGA TGTTCCCATC
ACACTTACCG ATGTAATGCC TACGGTGCTG TCGCTGAGCG GCCAGACCAT TCCGGCCAGT
GTACAGGGGC AGAACGTTGC CAGCCTGATT CGCCAGCCCC GCGCTCCCCG GCCGGACGAT
GCCGCGCTGA TTGCCTGTAT CGTACCGTTC CACCAATGGA ATTATGGGCG CGGTGGCCGC
GAATATCGGG GAATTCGTAC AGCACGGTAT ACTTACGTGC GCGACCTGAA AGGCCCGTGG
CTGTTGTACG ATAATCAGCA GGACCCTTAC CAGCTGACGA ATTTGGCCAA TGAGCCTAAA
CTGGCCGGGA CTCAGAAACA ACTTGAGGGT ATTCTAGCGC AAAAACTCCG GGCCGCCAAC
GATAACTTCC AGGCCGGAAA CGTATACATG GATAAATGGA ATTACCCCTG GGCTTACATC
GACTCGCTGG GCAATCCATA TTATAAGTAG
 
Protein sequence
MKYISLCVAG ALLTGLIAGK PSPAPIRPTP SQAGVRKPNI VIVMADQWRA QDLGYAGNRE 
VITPNLDKLA LESVNAPLCV AEVPVCSPSR ASLLTGQHAT THGVFYNDRP LRNEAVTLAE
VCQQNGYKTG FIGKWHINGG LAKDFAAGRL APIPVDRRQG FEYWRGLECT HDYNNSPYYN
EVNKRFVWQQ YDAISQTDSA ISFMTQSRKE PFLLVLAWGP PHDPYQTAPK EYRQRYADKT
LSLRPNVPAK DTMEANRALK GYYAHINALD DCIGRLQAAL KGAKLDENTI FVFTSDHGDM
LYSHDQINKQ KPWDESIRIP FLLKYPAGLS RKGRTLDVPI TLTDVMPTVL SLSGQTIPAS
VQGQNVASLI RQPRAPRPDD AALIACIVPF HQWNYGRGGR EYRGIRTARY TYVRDLKGPW
LLYDNQQDPY QLTNLANEPK LAGTQKQLEG ILAQKLRAAN DNFQAGNVYM DKWNYPWAYI
DSLGNPYYK