Gene Slin_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2108 
Symbol 
ID8725846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2549102 
End bp2551054 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content54% 
IMG OID 
Productglycoside hydrolase family 43 
Protein accessionYP_003386942 
Protein GI284037012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000141623 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGTT ACTTTAGCTT ATTTCTTATC CTGAGCACCT TGCTCTCGGG AGCCCCAGCT 
GTCCATGCAC AGCAGACGTC CAACTACACC CAGGTCAACA CCTACGTCAA CCCGGTACTT
CCCGGCGATC ATCCAGACCC AACCATGCTG CGCGTGGGGG ATGACTTTTA CCATTGCGGG
TCGAGTTTTC ATTTCACGCC CTATCTGCCC ATCTATCATT CCAAAGACAT GGTGCATTGG
GAGGTTATCA GCCGCGTAGT GCCGCCCACC GCTAGTTTTG TCTCCGACAG GCCTTCGGCG
GGTATCTGGC AAGGGGCCAT TACCTATTTC TACGGGTCGT ACTGGATTTA CTTCTCGGCC
AACGGGCAAT GGTTTTCCAA AGCCAGCAGC CCCAAAGGAC CGTGGTCGGA GCCGGTGCGG
GTGAAAGGTG ATCCCAAAAC GGGGCCACTG GGCTACGACA ATTCGATCTT TGTCGACGAC
GATGGCAAGC CGTACATGGT GATCAAGAAT GGTCAGAAAA CCAACCGATT GCAGGAGCTG
GGACGGGATG GCCAGCTGAC GGCCTCGGCC ATTGATCTCG ACTGGGTCAA CGCCAAACTA
CAATATAGCT GGGCGGAGGG GCCGGTGATG TGCAAGCGGA ATGGCTACTA CTACTATTTC
CCGGCGGGCG ACGTGTCGGG CGGGCAATAT GCCATGCGGG GCAAAGCCCT CACCGCCGAC
TCGACTCAAT GGGAACGGCT CGGCGATTTC TTCAAACCCA TCACCGATCC TAAAACGGGA
TTCCGTCGGC CCAACCACAT TTCGGCCCCG ATTCAGTTGG CCGATGGCAC CTGGTGGACA
ATTGGCCAAA GCTATGAAAA GCCGCTTGAA GGAGCCGATT GGTCGGGCAT GGGCAGGCAA
ACGGGCCTGT ATCAGGTTAT TTGGGAAGGA GACCGTCCGT GGGGCCTAGC GCCCACCACC
AAACCAGTGG CACGACCCAA TCTGCCCCAG TCGGGTATTC TGTGGCGAAG CGCACAGTCA
GACCGGTTCG ATAGCGACGT GCTTTCGCCG AACTGGCATT TTTTGAGCAA ACGGGTAGCA
GACCGGTATT CGTTGTCGGC GCGTAAAGGC TGGGTACGTC TCTCACCCGA CACCGCCCGT
GTGCACCTGA TGCAGAAAGA AACGGATCAT TATTATACCG CCGTCACCCG TGTCGACCTC
AACGCCGACG ATCCCGCTGC AAAAGCCGGT ATCTATTTGA CCAATGGCAA TCAGAAGGTG
TTTGCCCAAC TCTACAGTGG GTTCGATACG GGTAAAAAGA TCGTTTTCAG ACTCGATACG
GCCATCCGCA CGGTGCCCAA TTCAGTCGGC AGCGTTGTCT GGCTGAAAGT GGAACGGAAC
GAGCACCAGC TGACCGGCTA CTACAGCCGT GATGGAACGA AATGGACCGC CGTTGGGCCT
CCCATCAGTT CGGTAAGTCT GGACAAGGCC CAACCCAATT TCAATTCGTG GGTGGGTACC
AGTATAGGTT TATTTGCCGA AGGCAAACCC GCCGACTTTG ATTTTTTCAT CTGTAAAGAC
GGCTTTTCCA GCCTTCCGGC CGTGGGTTAC AGCAATTATT ATGGCGTCGA AAAAACAGGG
GTCACTGCCG GGTCTGGCGT TACCAATGAC TCCAGTCAGG GCGGCTGGTT TATGCTGTCG
GGCGTTGATT TAGGCATGGG TAACGACGCA GCTAAACAGG TTCAGCTCCA GGTATCGACT
ACTATGGAAG GAACGCTTGA GCTATGGCTC GACGATTTAA GTAACGGGAA AAAGATCGCC
ACCATCCCGT TTAAATCAAC TACTGGCAAA GCGACCTCGA AAACGGTAAG CCAATCCGTC
AAAGGGGTTT CAGGGCATCA CGATGTCTTT GTGAAATTTC CAGCTGGAAA AGCTAATATC
GTTACGGTGA AAGCGTTGCA ATTTCAAAAG TAG
 
Protein sequence
MNRYFSLFLI LSTLLSGAPA VHAQQTSNYT QVNTYVNPVL PGDHPDPTML RVGDDFYHCG 
SSFHFTPYLP IYHSKDMVHW EVISRVVPPT ASFVSDRPSA GIWQGAITYF YGSYWIYFSA
NGQWFSKASS PKGPWSEPVR VKGDPKTGPL GYDNSIFVDD DGKPYMVIKN GQKTNRLQEL
GRDGQLTASA IDLDWVNAKL QYSWAEGPVM CKRNGYYYYF PAGDVSGGQY AMRGKALTAD
STQWERLGDF FKPITDPKTG FRRPNHISAP IQLADGTWWT IGQSYEKPLE GADWSGMGRQ
TGLYQVIWEG DRPWGLAPTT KPVARPNLPQ SGILWRSAQS DRFDSDVLSP NWHFLSKRVA
DRYSLSARKG WVRLSPDTAR VHLMQKETDH YYTAVTRVDL NADDPAAKAG IYLTNGNQKV
FAQLYSGFDT GKKIVFRLDT AIRTVPNSVG SVVWLKVERN EHQLTGYYSR DGTKWTAVGP
PISSVSLDKA QPNFNSWVGT SIGLFAEGKP ADFDFFICKD GFSSLPAVGY SNYYGVEKTG
VTAGSGVTND SSQGGWFMLS GVDLGMGNDA AKQVQLQVST TMEGTLELWL DDLSNGKKIA
TIPFKSTTGK ATSKTVSQSV KGVSGHHDVF VKFPAGKANI VTVKALQFQK