Gene Slin_0605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0605 
Symbol 
ID8724333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp752045 
End bp753493 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content56% 
IMG OID 
ProductAlpha-N-acetylgalactosaminidase 
Protein accessionYP_003385468 
Protein GI284035538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.953329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCTA CACAAACGAA AACGAACATG GAAAACACGC GACGAAATTT CCTAAAAAAA 
GCCGGGCTAA GTGGTCTGGG GCTGGTTGCC GCCAGTCCGT TTGCCGCTTA CGCCACCACA
CCGGAAGAAC TGGCAAGCAT TGGCGAACAA GCGGCCCGAA CCCCTCCGCA AGTCTTCAAC
ATGAGCGGGT ATGCCGCGCC GAAGCTGGAT GTTGTCCGCA TCGGCATTGT CGGCATCGGC
AACCGGGGCA TGGGCGCGGT GGAGCGGATG AACAAAATCG AAGGCGTTGC CATAAAAGCC
CTTTGCGATT TGCGTCCCGA GCGGGTAAAT CTGGCACAGA AGCTCGTGGA AGCCGGTGGG
CACCGCCCGC AAACGTATTC CGGAAGCCCG GAAGCCTGGA AGAAACTCTG CGAACGAACC
GACCTCGACC TGATTTACAT TCTTACGCCC TGGGCACTGC ACACGCCCAT CGCGGTCTTC
TCCATGAACC ACGGCAAGCA TGTTTGCGTG GAGGTTCCGG CGGCCAAAAC GCTCGAAGAA
TGCTGGCAGC TGGTCGAAAC CTCGGAGCGG ACCAGGAAAC ACTGCATGAT GACCGAAAAC
TGCTGCTACG ACTTCACCGA ACTGCTCACG CTGAACATGG CGCGGCAAGG CTTCTTCGGC
GACATAGTCC ATTGCGAGGG GGCCTATATC CATAACCTGC AGGAGCTGAT CTTTTCGAAA
GAACACTTCT ATCAGATGTG GGAATTGACC GAAATGTACA AACGTACCGG CAACCTCTAC
CCGACCCACG GCCTGGGGCC AATCTGTCAG GTGCTGGATA TCAACCGGGG CGATCAGATG
GATTACCTGG TCTCCATGTC CAGTAATGAT TTTGTGCTGG GCAATATGGC CCGTTCTCTG
GCCGCCAGCG ACGATTTTTA CAAGCCCTAC GCCGGAAAAC CCTACAACGG CAACATGAAT
ACTAGCACCA TCCGGACCAA AAAAGGAAAG ACCATTTTGG TGCAGTTCGA TGTCTCCTCT
CCCCGGCCCT ATTCACGAAT TCAGCTGGTC AGCGGCACCA AAGGGGTAGC CCTGAAATAC
CCCGAGCCAG CCCGCTACTC AACGGGGCAT GAATGGATGA CGGAACAGGA GATCAAAGCC
CTTGAACAAA AATACATGCC GCCCATCGTG AAAAAAATGG GCGAAATCGC GAAGAACGTG
GGCGGTCATG GCGGCATGGA TTTTCTGATG GACTGGCGTA CCATCGACTG TCTTCGGAAC
GGGCTCCCGC TCGACCAGGA CGTTTATGAT GCCGCCCTGT GGAGTTCGAT AGGGCCGTTG
AGCGCGTGGT CAGTAGCGCA TCGCTCCAAT TCGATCGATG TCCCCGATTT TACGGGCGGG
TCCTGGCAGA AAAACAAACC CGTCGACATT TCAATGACCC GGGGTGGAAA TACGCAGGCA
AAAGTGTAG
 
Protein sequence
MASTQTKTNM ENTRRNFLKK AGLSGLGLVA ASPFAAYATT PEELASIGEQ AARTPPQVFN 
MSGYAAPKLD VVRIGIVGIG NRGMGAVERM NKIEGVAIKA LCDLRPERVN LAQKLVEAGG
HRPQTYSGSP EAWKKLCERT DLDLIYILTP WALHTPIAVF SMNHGKHVCV EVPAAKTLEE
CWQLVETSER TRKHCMMTEN CCYDFTELLT LNMARQGFFG DIVHCEGAYI HNLQELIFSK
EHFYQMWELT EMYKRTGNLY PTHGLGPICQ VLDINRGDQM DYLVSMSSND FVLGNMARSL
AASDDFYKPY AGKPYNGNMN TSTIRTKKGK TILVQFDVSS PRPYSRIQLV SGTKGVALKY
PEPARYSTGH EWMTEQEIKA LEQKYMPPIV KKMGEIAKNV GGHGGMDFLM DWRTIDCLRN
GLPLDQDVYD AALWSSIGPL SAWSVAHRSN SIDVPDFTGG SWQKNKPVDI SMTRGGNTQA
KV