Gene Slin_6288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6288 
Symbol 
ID8730072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7628246 
End bp7629817 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content55% 
IMG OID 
Productsulfatase 
Protein accessionYP_003391046 
Protein GI284041116 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.44956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGCTT TTCTGGCAAT GAGACAACCA CTTCTTTGGG TTAGTGCATT CCTGCTACTG 
ATGGGATGGG TGCTTGTTTC CTTCAAGCCC CCCCGGACTA CCGTATCCCG GGATGCCGTA
CCCCGGACTG CCGTATCACC CAACATCATT TACATTTATG CCGATGATTT GGGCTATGCG
GAGTTAGGTT GCTACGGCCA GCAGAAAATC CGTACGCCGA ACCTGGACAA ACTGGCCCGC
GAAGGTATTC GTTTTACGCA GCACTACACG AGTATGCCCG TTTGCGCTCC CGCCCGCTGT
ATGCTCCTGA CCGGCAAACA CAGTGGTCAT TCATACATCC GGGGTAATTA TGAGATGGGC
GGCTTCCCTG ATTCACTGGA GGGCGGGCAG ATGCCGCTTT ATCCGGGTGC TTTCACCATT
GGTCGGCTGT TACAGCAGCA GGGCTACAAA ACGGCCTGTG TCGGTAAATG GGGCATGGGT
ATGGCCAATA CCACCGGCAA CCCAAACGAG CAGGGCTTCG ATTATTTCTA CGGCTACCTC
GATCAGAAAC AGGCGCACAA CTATTACCCC ACGCACCTCT GGGAAAACGG CAAACCCGAC
AAACTCAATA ACCCCGTCAT CGACGTACAC CGGCGGCTAA CTCCCGAAAC AGCTACGCCC
GAAGCTTTTG CCTATTTCCG GGGTAACGAC TATGCCATCG ACAAACTGGC GCAGAAAGCA
CAGGCCTTTA TCCGTCAGAA TAAAAGCGGA CCGTTTTTCC TATACCTACC GTTCACCGCA
CCCCATGTAT CATTGCAGGC ACCCGAAGCC GCCGTAAAGG AATATATCGG CAAATTCGGG
GATGGAGAGC AACGCACCGA ACGTCCGTAT CTGGGGGAGC AGGGGTACGC GTCGACGCCT
TACCCACGCG CTACCTATGC GGCTATGATT ACGCACATGG ACGCGCAGAT CGGGCAGTTG
ATGCAACTGC TGAAAGACCT GAAAATTGAT GAAAACACCC TGGTCATGTT TTCCAGCGAC
AATGGGGCTA CGTTCAACGG TGGGGTGGAG GCTGCTTATT TTAACAGTGT GGGCAAGCTG
CGTGGCCTGA AAATGGATGT GTACGAAGGT GGCATTCGCG AGCCGATGCT GGCCCGCTGG
CCCGGCAGAA TCAAGCCGAA CCAGACTACC GATCACGTAT CCGTTCAGTA CGACCTGCTG
GCTACGCTGG CCGAACTGGT AGGGTATAAA CGACCCTTCG CTACAGACGG CATCTCGTTC
CTGCCGACCT TACTGGGACA ATCGTCCAGC CAGAAGCAAC ACCCGTTCCT GTACTGGGAA
TACCCCGAAA AGGGCGGTCA GCTGGCCATT CGCATGGGCA ATTGGAAAGC CGTCAAAACC
AACGTACGCA AAGACCGAAC CACCCCCTGG GAGTTATATG ACCTAAACAA AGACGTAAGT
GAAACCACCA ACATAGCCGA CAAGCACCCG GACATCATCC GTCAGGCCAA CGCCATCGTC
GCCCGCGAGC ACATACCCAC CCACATCAAC GAGTGGGAAA TCGTAACCCC CAAAACCAAA
CCAGCAAACT AA
 
Protein sequence
MHAFLAMRQP LLWVSAFLLL MGWVLVSFKP PRTTVSRDAV PRTAVSPNII YIYADDLGYA 
ELGCYGQQKI RTPNLDKLAR EGIRFTQHYT SMPVCAPARC MLLTGKHSGH SYIRGNYEMG
GFPDSLEGGQ MPLYPGAFTI GRLLQQQGYK TACVGKWGMG MANTTGNPNE QGFDYFYGYL
DQKQAHNYYP THLWENGKPD KLNNPVIDVH RRLTPETATP EAFAYFRGND YAIDKLAQKA
QAFIRQNKSG PFFLYLPFTA PHVSLQAPEA AVKEYIGKFG DGEQRTERPY LGEQGYASTP
YPRATYAAMI THMDAQIGQL MQLLKDLKID ENTLVMFSSD NGATFNGGVE AAYFNSVGKL
RGLKMDVYEG GIREPMLARW PGRIKPNQTT DHVSVQYDLL ATLAELVGYK RPFATDGISF
LPTLLGQSSS QKQHPFLYWE YPEKGGQLAI RMGNWKAVKT NVRKDRTTPW ELYDLNKDVS
ETTNIADKHP DIIRQANAIV AREHIPTHIN EWEIVTPKTK PAN