Gene Slin_5375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5375 
Symbol 
ID8729140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6536379 
End bp6538061 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content52% 
IMG OID 
Productsulfatase 
Protein accessionYP_003390142 
Protein GI284040212 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAC CGACCGTTTT TCGCTCGCTG CTGGGTGCAA CTGCCGTGGC TCTGCTGCTG 
GCTATACCCG GCTACACCCC TGTTCCGCTG GAGGAGCCTC AACCCAACCG TGCGGCTCAA
CAGCGACCTA ATGTGATTTT CATCATTTCT GACGATCACA CATCACAGGC CATCAGTGCG
TATGGCAGCA AACTGGCTAA AACGCCCAAC ATCGACCGAA TTGCCCGGGA AGGAGCCATT
CTTTATAACA ATGTTGTGGC CAACTCGATC TGCGGCCCCA GTCGCGCTAC CTTGCTGACT
GGCCAGTTCT CGCACCGGAA CGGATACAAA TTCAACGAAA AGGTATTCGA TATCAGCCAG
CCGGTTTTCA CTGAGGAGTT GCAAAAGAAC GGCTACCAAA CGGCCTGGAT CGGCAAAATG
CACCTGGGGA GCCTGCCCCA CGGATTCGAT TACCTGAATA TTCTGCCGGG GCATGGCAAT
TATTACAATT CGGATTTTGT CGACTCCAAT AACAAAACGA CCCGGCACAT GGGTTACGTG
ACCGATGTCG TAACGAGCCT CTCCACCGAC TGGCTCGCTC ACCGCGATAC GGCGAAACCT
TTCTTTCTGG TGGTTGGTCA CAAAGCGACC CACCGTGAGT GGATGCCCGC TGTGGAGGAT
TTAGGCGCTT ACGACAACGT CACGTTTCCT ATACCGCCCA CGTTTTATGA CGACTATGAG
GGTCGGTTAG CGGCTCAAAA GCAGGAAATG AGCATCGACA AGTCAATGAA TCTGAGAGCA
GATCTGAAAG TCGATGTTAA ATATGAGGCT GATGAAGCCA CGATGGAGCA GGAAAAGGCC
GACTTTCGGA AGGCGTTTTA CGGGTCCAAT CAACCTACCC CGGCGCAGGA AAAACAGCTG
GACACCTACG TTCGGGAAGG CTCGTACCGA CGCCTGAACC CCGAACAGAA AAAAGCCTTT
TCCAGCTATT ATGGCAAAAT CAGTAAGGAG TTTGCCGATA AAAAGCTGAC CGGTAAAGCC
CTGGCGGAGT GGAAATATCA GCGTTATCTG AAAGATTATC TGTCTACCGC CAACTCGCTG
GATCGCAACA TCGGCAAGCT GCTCGATTAC CTGGATAAAA GCGGACTGGC TAAAAATACC
GTAGTCGTAT ACACCTCCGA TCAGGGCTTT TACCTGGGCG AACACGGCTG GTTCGATAAG
CGGTGGATTT ATGAGGAATC CCTGAAAACG CCGTTTGTAA TCCGGTATCC GGGCGTTATC
AAACCGGGCA GTCAGGTGAA GCAGGTCGTA TCGAATGTCG ATTGGGCCCC CACCTTATTG
AGCCTGACGG GCACGCGTGT TCCCGACTAT GTGCAGGGCG AATCGTTCCT GCCGCTGCTG
ACCGGCGGCA AAAACGACTG GCGAAATCAG GCTTACTACC ACTATTATGA GTACCCACAG
CCGCACCATG TCTCGCCCCA TTTCGGGTTG CGTACGGCTC AGTACACGCT GGCTCGTTTT
TACGGCCCGG AAGACTTTTG GGAACTGTAC GACATCCAGA AAGACCCCCA GAATCTTAAC
AATGTGTACG GTCAGAAAGG CTACGAAAAA GTAACGGCTC TGCTGAAAAA GCAGTTGAAG
GACCAAATCA TTAAGTACAA AGATGAGGAA GCACTCAAGT TGATGGCCGC AAATCCGCAG
TAG
 
Protein sequence
MSKPTVFRSL LGATAVALLL AIPGYTPVPL EEPQPNRAAQ QRPNVIFIIS DDHTSQAISA 
YGSKLAKTPN IDRIAREGAI LYNNVVANSI CGPSRATLLT GQFSHRNGYK FNEKVFDISQ
PVFTEELQKN GYQTAWIGKM HLGSLPHGFD YLNILPGHGN YYNSDFVDSN NKTTRHMGYV
TDVVTSLSTD WLAHRDTAKP FFLVVGHKAT HREWMPAVED LGAYDNVTFP IPPTFYDDYE
GRLAAQKQEM SIDKSMNLRA DLKVDVKYEA DEATMEQEKA DFRKAFYGSN QPTPAQEKQL
DTYVREGSYR RLNPEQKKAF SSYYGKISKE FADKKLTGKA LAEWKYQRYL KDYLSTANSL
DRNIGKLLDY LDKSGLAKNT VVVYTSDQGF YLGEHGWFDK RWIYEESLKT PFVIRYPGVI
KPGSQVKQVV SNVDWAPTLL SLTGTRVPDY VQGESFLPLL TGGKNDWRNQ AYYHYYEYPQ
PHHVSPHFGL RTAQYTLARF YGPEDFWELY DIQKDPQNLN NVYGQKGYEK VTALLKKQLK
DQIIKYKDEE ALKLMAANPQ