Gene Slin_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1416 
Symbol 
ID8725150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1720981 
End bp1722654 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content55% 
IMG OID 
Productglycoside hydrolase family 39 
Protein accessionYP_003386265 
Protein GI284036335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.336596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCG TAGTCTTACT TTACCTTTTA TTGTTCTGTT GCCGCATCGC TTTCGCGCAG 
CAGACGCCCG TTTCGATTCA GGTCGATGCC GCAAAACCCG TGGGCGACAT GAAGCCGTTC
TGGGCATTTT TCGGGTACGA CGAGCCTAAT TATACCACCC GGAAAGACGG CCAGAAACTG
CTGTCCGAAC TGCAGCAGTT AAGCCCCGTT CCGGTCTATG TTAGAGCGCA TAACCTGTTG
ACCTCTAAAG GCGAAAGCCC CGGCCCTGAC CTTAAATGGG GGTTTACGGA TGCGTATAAA
GAGGACGCCA ACGGCAAGCC CATCTATAAC TGGGTCACGG TCGACAGCAT CGTGGACACC
TACATCAAAC GGGGCATGAA GCCACTGATG GAAATTGGAT TCATGCCCAA AGACTTATCA
TCGAAACCCG AGCCATATGC CCATACCTGG AGTAAAAACG GCAACATCTG GACGGGCTGG
ACATATCCGC CCAAAGACTA TGACAAATGG CGGGAACTCG TGTACCAGTG GGGCAAACAT
TCCATTGACC GTTATGGCAA AAAAGAGGTG AGCACGTGGC TGTGGGAAGT CTGGAACGAG
CCGGACATTG GCTACTGGTC AGGTACGTTC GATGAATACT GCAAAACCTA CGACTATGCC
GCCGACGGTC TGAAACGGGC CTGTCCCGAA TGTACCATTG GCGGGCCGCA CACGACCAGT
CCCCGTAGCG ACAAAGCGTA CAGCTACCTG ACCCGGTTCA TCGAGCATTG CCTGCGGGGT
AAGAACTACG CGACTGGTAA AACCGGTACA CCCTTGCAAT ACATCGGCTT TCACGCCAAA
GGAAATCCCG AGATTGTCGA CGGGCACATT CGCATGAACA TGGGCGTGCA GTTAAAAGAC
GTGGAGCGGG CGTTTGAAGC CGTAAATTCA TTTCCGGAGC TGAAGAACAT TCCCATCATT
ATCGGCGAGT GCGACCCCGA AGGTTGTGCC GCCTGCTCGG CCACGCGTGA CCCCAAATAC
GGCTATCGCA ACGGCACCAT GTATTCCAGC TATACGGCGG CTTCGTTTGC CCGCATCTAC
GAACTGATGG ACCAGTACAA CGTGAACCTG CGCGGGGCCG TGAGCTGGTC GTTCGAGTTT
GAAGATCAGC CGTGGTTTGC CGGTTTCCGC GACCTGGCCA CACATGGCGT CGACAAGCCC
GTGCTGAACG TATTTCGGAT GTTTGGCAAG ATGAACGGCC AGCGGCTGTC GGTGCAAACC
AGCAAGGGGT TGGGTGCCGC CAACATCATT GCCAACGGTG TACGGGCCGA AAGCGACGTG
AACGCCATCG CCAGCAAAGG ACCTAACTCA GTTTGTGTGA TGGTCTGGAA TTACCACGAC
CATAACGTAC CTGGCCCTGC GGCTCCGGTA GAACTGACCG TGAGCGGACT CGGCAAAAAC
AAGGTACAGG TTCGCCATTA TCGGGTGGAT GATCAGCACA GCAATTCGTT CGAGGCATGG
AAGTCGATGG GTAGTCCACA GCAAGTTTCG ACCGATCAAT ATAAGTCGCT CGAAAAAGCG
GGACAGCTAC AGGAGCTGGC TCCTCCGACC TCGAAAGCTG TCGCCAATGG CAAAACAACA
CTAACCTTCG ACCTGCCCCG GCAGGGTGTG TCCTTTGTGC AACTGACCTG GTAA
 
Protein sequence
MKRVVLLYLL LFCCRIAFAQ QTPVSIQVDA AKPVGDMKPF WAFFGYDEPN YTTRKDGQKL 
LSELQQLSPV PVYVRAHNLL TSKGESPGPD LKWGFTDAYK EDANGKPIYN WVTVDSIVDT
YIKRGMKPLM EIGFMPKDLS SKPEPYAHTW SKNGNIWTGW TYPPKDYDKW RELVYQWGKH
SIDRYGKKEV STWLWEVWNE PDIGYWSGTF DEYCKTYDYA ADGLKRACPE CTIGGPHTTS
PRSDKAYSYL TRFIEHCLRG KNYATGKTGT PLQYIGFHAK GNPEIVDGHI RMNMGVQLKD
VERAFEAVNS FPELKNIPII IGECDPEGCA ACSATRDPKY GYRNGTMYSS YTAASFARIY
ELMDQYNVNL RGAVSWSFEF EDQPWFAGFR DLATHGVDKP VLNVFRMFGK MNGQRLSVQT
SKGLGAANII ANGVRAESDV NAIASKGPNS VCVMVWNYHD HNVPGPAAPV ELTVSGLGKN
KVQVRHYRVD DQHSNSFEAW KSMGSPQQVS TDQYKSLEKA GQLQELAPPT SKAVANGKTT
LTFDLPRQGV SFVQLTW