Gene Slin_4179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4179 
Symbol 
ID8727938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5031006 
End bp5032550 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content52% 
IMG OID 
Productsulfatase 
Protein accessionYP_003388964 
Protein GI284039034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.10978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACGG GTCTGAAACC ATTATTGAGT AGTATTCTGG TATTGCTTTG TGTAGTATAT 
ACTGTGTTCG ACGCCGTTGA ACCGACCCGT TTAACCAGCA GTTCCGGGCA GAAAACGAGG
GCTGCGGACA GTCGGCCAAA CATTGTTTTG ATCGTTGCCG ATGATCATGG CCGGGAGGTT
TTAGGCTGCT ATGGCGCATT GGCTATCAAG ACGCCACACA TCGATCAGTT AGCCGCCGAC
GGGGTTCGCT TTTCCAATGC GTTTTGTACA ACGGCCAGTT GCAGTCCCAG CCGCTCTGTA
TTGTTGACGG GTTTGCAGAA TCATACCAAT GGAATGTATG GCCTTGAACA TCAGGAACAC
CACTTCGCTT CCTTCGATAC CGTACGGTCG TTACCCGTTC TGCTTGAAAG AGCGGGATAC
CGCACCGCCC GAATTGGGAA ACTGCACGTA GCGCCGGAGA AGGTGTACCA TTTTCAACAG
GTACTCAAAG GTGGTGGAGT AAACGATCCG GCATCTATCG GCCGTAGCCC GGTCGAAATG
GCCCGCTTCT GTTATCCTTT TCTGGAGGCC ACAACGCATA CCAGCGCACC AACGAACCAA
CCGAACACAG CTCAACCCTT CTTTCTTTAC TTTGCCACGG ATGATCCGCA CCGCAGCAAC
ACGGTGGCCA CCGATGGATC GCCGGTGTTT GATGGTACTA AACCAAATGT ATTCGGGAAT
CGGCCAGGGG GATATCCGCA GGTGGGCGAC CATTTCTATC AGCCTCGGGA TGTACGCGTA
CCGGCTTACT TACCCGACAC AAAAGCGTGC CGGGCCGAAC TGGCACAGTA TTATGAAGCC
ATCAGCCGAC TGGATGCGGG CGTTGGCCGA CTGATCGACT ACCTGAAAGA CACCGGGCAG
TACGATAATA CCCTGATTGT CTATCTATCC GATAATGGCG CGCCTTTTCC GGGAGCTAAA
ACAACCTTGT ATGAACCGGG TATGCGGTTA CCGTGCATTG TTAAATTGCC GAAACCAAAG
AAACGGGGAT TTGTCCAGGA TGCGATGATT TCCTGGGCCG ATATAACACC TACACTGCTG
GATTTTGCCG GTGTCCGGCC CAGAAATTCA CCAAAGCTAG GGCGATCCTT CAAGGATATT
ATCGAGCAGG AACAGGTAAC GGGTTGGGAT GAAGTATATG CCTCGCACTC GCTGCACGAA
GTGACCATGT ATTACCCCAT GCGAGTGGTA CGGGAACGTC GGTATAAACT GATTTATAAC
ATTGCTTATC AACTGCCGTT TCCTATGGCG CTCGACTTAT ACCACTCCTT TACGTGGCAG
GATGTGCTCC GCACGAAGCA GAAATTGTAC GGCAAACGAA CGGTGAACAC CTATCTGCAT
CGCCCGCGGT TCGAATTATA CGATCTGCAA ACGGACCCGG ATGAAGTGAA AAATCTGGCC
GTCAATCCCC AATTTAAAGC GGTACTGGCC CGGATGCAAG CCCGGCTCAA ACGGTTTCAG
CAGCAAACCC GCGACCCGTG GATGAGCAAA TGGAATGTTG AGTGA
 
Protein sequence
MVTGLKPLLS SILVLLCVVY TVFDAVEPTR LTSSSGQKTR AADSRPNIVL IVADDHGREV 
LGCYGALAIK TPHIDQLAAD GVRFSNAFCT TASCSPSRSV LLTGLQNHTN GMYGLEHQEH
HFASFDTVRS LPVLLERAGY RTARIGKLHV APEKVYHFQQ VLKGGGVNDP ASIGRSPVEM
ARFCYPFLEA TTHTSAPTNQ PNTAQPFFLY FATDDPHRSN TVATDGSPVF DGTKPNVFGN
RPGGYPQVGD HFYQPRDVRV PAYLPDTKAC RAELAQYYEA ISRLDAGVGR LIDYLKDTGQ
YDNTLIVYLS DNGAPFPGAK TTLYEPGMRL PCIVKLPKPK KRGFVQDAMI SWADITPTLL
DFAGVRPRNS PKLGRSFKDI IEQEQVTGWD EVYASHSLHE VTMYYPMRVV RERRYKLIYN
IAYQLPFPMA LDLYHSFTWQ DVLRTKQKLY GKRTVNTYLH RPRFELYDLQ TDPDEVKNLA
VNPQFKAVLA RMQARLKRFQ QQTRDPWMSK WNVE