Gene Slin_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0149 
Symbol 
ID8723877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp187122 
End bp188279 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content57% 
IMG OID 
ProductSarcosine oxidase 
Protein accessionYP_003385014 
Protein GI284035084 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00237457 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCTTTG ATGCGATTGT TGTTGGATTA GGCGCTATGG GCAGTGCCAC CCTCTACCAG 
CTTGCCAAAC AGACCCCCAA TGTGCTCGGC CTCGATCAGT TTGCGCCCCC GCACACCCTG
GGCTCTACGC ACGGCGACAC CCGCATTACC CGCCAGGCCA TTGGCGAAGG AGCCCACTTC
GTACCGCTGG CCCTTCGCTC CTACGACATC TGGCGCGAAC TCGAACAGCG CACCGGCGAG
GAATTACTAA CCATTACGGG TGGGCTGTTT ATCGGGCAGG AACACTCGCC CGTTCAGATG
CACAATAAGC CCGGCTGGCT CAGCACCACC ATTCGGGCCG CCGAACAATT TGGTATCGCC
CATCGGCTGC TGGATCATGC AGCCCTTCGG CGCGAATTTC CGCAGTTCAG ATATCGCCCC
GACGATATTG GGTATTATGA AGAAGAAGCC GGTTTTTTGA AGCCGGAGCG ATGCATTTCG
GTTCAGTTGG AACAGGCCCG GCAGTATGGG GCGTCTGTCC GGACCAACGA ACGGATGGTA
GCTTTTGACG CCACGAAGAC GGGTATTACC GTTCGGACGG AACAGGGCGT TTACCAGACC
CGAAAGCTGA TTTTGACCAC CGGGTCTTGG ATTACGGAGT CGCTGCGCCA TACGCCCTAT
CAGGAGTTGC TTACCGTATA CCGTCAGGTG CTGTACTGGT TCGCTATTGA GGGCAACTAT
ACACAGTATA CGCCGGATAA GCTACCGGTA TTTATTCTGA GCGAGCGAGA CCTGTACGGA
TTCCCGGCGG TGGGCGGTCC CGCTGGTGGT TTGAAAATTG CCACCGAAAC CTACGCCCAC
GCAACGAGTC CGCAGGTTGT AGACCGCACC GTCAGCGAAG CCGAAACACG ACGCATGTAC
GAAGAGCACA TTGCGCCCAA CTTCGTGGGG GTCGGTCCGG CCTGTGTCAA GTCCGTTGTA
TGCTTGTATA CGATGACACC AAACGGCGAT TTTATCATCG ACCAGCATCC AGACCATCCC
GATGTACTGC TGGCTTCGGC CTGTTCGGGG CACGGTTTCA AACATTCGGC GGCCGTTGGG
CAGATTCTGG CTGAATTAAC CCTTCAGAAC AGGACCGGCT TTTCGATCGA GCCTTTCCAG
CTAGGCCATT CGCGTTAG
 
Protein sequence
MIFDAIVVGL GAMGSATLYQ LAKQTPNVLG LDQFAPPHTL GSTHGDTRIT RQAIGEGAHF 
VPLALRSYDI WRELEQRTGE ELLTITGGLF IGQEHSPVQM HNKPGWLSTT IRAAEQFGIA
HRLLDHAALR REFPQFRYRP DDIGYYEEEA GFLKPERCIS VQLEQARQYG ASVRTNERMV
AFDATKTGIT VRTEQGVYQT RKLILTTGSW ITESLRHTPY QELLTVYRQV LYWFAIEGNY
TQYTPDKLPV FILSERDLYG FPAVGGPAGG LKIATETYAH ATSPQVVDRT VSEAETRRMY
EEHIAPNFVG VGPACVKSVV CLYTMTPNGD FIIDQHPDHP DVLLASACSG HGFKHSAAVG
QILAELTLQN RTGFSIEPFQ LGHSR