Gene Slin_4261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4261 
Symbol 
ID8728020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5143438 
End bp5144958 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content56% 
IMG OID 
Productsulfatase 
Protein accessionYP_003389044 
Protein GI284039114 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.362858 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAG CCCTTTTCCT TTTATTCAGT AGCTTATGCC TACTCGGCCA ATCGGCTGTG 
GTAGCGCAGC AGAAGCCCAA TATTGTGCTC ATCTACGCCG ATGATCTGGG CTATGGCGAC
ATCAGCTGCA ATGGCGCGAC AAAAATCCGT ACGCCAAACA TCGACCGGGT GGCCCGCGAA
GGATTGAACT TCACCAACGC CCACGCGTCG TCGTCGACCT GTACACCCTC GCGCTACACC
CTCCTGACGG GTGCCTACGC CTGGCGTAAA ACAGGCACCG GCATTGCGCC AGGCGATGCC
GCTCTGCTCA TCCCGACCGA CCGCGTCACG ATGCCCGGCA TCTTACAAAA AGCAGGCTAT
AAGACGGGGG TCGTTGGCAA ATGGCATCTG GGACTTGGCC CCAAAGGGGG TCCCGACTGG
AATGGCGACA TAAAACCCGG ACCGCTCGAA ATCGGCTTCA CGTACTCTTT TCTGCTACCC
GCCACCGGCG ACCGGGTGCC CTGCGTGTAT GTCGAAAATC ACCGTATCGT CAATCTGGAC
CCGGCCGACC CGGTTCAGGT AAGTTATAAA GAGCCGATCG GAACCGAGCC GACCGGCAAA
GACCATCCGG AGTTGCTTAA AATGTTGTTC TCCCACGGAC ACGACCAAAC GATCATCAAT
GGAGTTAGCC GAATTGGTTA CATGAGCGGG GGAAAGTCGG CCCGGTGGGT CGATGAGGAG
ATGGCGGATG TGCTGACGGG CAAAGTGAAC CAGTTTATCG AGACCAGCAA AAGCGGTCCT
TTCTTCGTGT ATTTCTCCAC GCACGACATT CACGTGCCGC GTATGCCCCA CTCCCGTTTT
GCGGGCAAAA GTGGGATGGG GCCGCGTGGT GACGCCATTC TGCAGCTGGA CTGGTGCGTG
GGCGAAGTCA TGAAAACCCT GGACCGGCTG GGTTTAAAAG ACAACACGAT GGTGATCATC
AGCAGTGATA ACGGCCCGGT TGTCGATGAC GGCTACAAAG ATCAGGCGGT TGAAAAACTA
AACGGCCACA AACCCGCCGG ACCTCTGCGT GGGGGTAAAT ACAGTGCGTT CGATGCCGGA
ACCCGGGTGC CGTTTATCGT ACGCTGGCCG GGGAAAGTGA AGCCTGGCAT CTCCGATGCG
CTGTTTAGTC AGGTCGACCT CGCGGCTTCT TTTGCTGAAC TAGTGGGCCA GCCATTGGCG
AAAGGAGAAG CTCCCGACAG CTTTAATAGC CTGACGACGC TCCTGGGGAC AACTAAAAAG
AGTCGTGAAT ACGTTATAGA ACATGCGATC AATGGCACGC TTTCGCTGAT ACGTGGCAAC
TGGAAATACA TCGAACCTTC TGGTGGCCCG ATACTCAACC GTGAAACCAA TATCGAAACG
GGGTATGCCC CACAGCCGCA GTTGTATAAC CTGCAAACCG ATCTTGGCGA AACGAAGAAC
CTGGCCGAGA GCAACCCACA ACTAACTTCC GAGCTGGCCG CATTACTGAA AACCATCCGC
GAAAAAGGAA ACACCAATTA G
 
Protein sequence
MKPALFLLFS SLCLLGQSAV VAQQKPNIVL IYADDLGYGD ISCNGATKIR TPNIDRVARE 
GLNFTNAHAS SSTCTPSRYT LLTGAYAWRK TGTGIAPGDA ALLIPTDRVT MPGILQKAGY
KTGVVGKWHL GLGPKGGPDW NGDIKPGPLE IGFTYSFLLP ATGDRVPCVY VENHRIVNLD
PADPVQVSYK EPIGTEPTGK DHPELLKMLF SHGHDQTIIN GVSRIGYMSG GKSARWVDEE
MADVLTGKVN QFIETSKSGP FFVYFSTHDI HVPRMPHSRF AGKSGMGPRG DAILQLDWCV
GEVMKTLDRL GLKDNTMVII SSDNGPVVDD GYKDQAVEKL NGHKPAGPLR GGKYSAFDAG
TRVPFIVRWP GKVKPGISDA LFSQVDLAAS FAELVGQPLA KGEAPDSFNS LTTLLGTTKK
SREYVIEHAI NGTLSLIRGN WKYIEPSGGP ILNRETNIET GYAPQPQLYN LQTDLGETKN
LAESNPQLTS ELAALLKTIR EKGNTN