Gene Slin_2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2106 
Symbol 
ID8725844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2543116 
End bp2544792 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content55% 
IMG OID 
Productglycoside hydrolase family 10 
Protein accessionYP_003386940 
Protein GI284037010 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.390174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.187838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCTC TAAAAACGAT AACCCTGCTA ACGCTCCTCA CGGCGCTGTC GGCGGCCTGT 
CGGACCGATA AAAGCGAGTT TGTGCTGGCT AAGCCCGAAA GCGTGGCCAC GCTGGAACCG
ATCAATAAGT ATGACGCACT GAAGACCTAT CTCAACACGG GCGACAATCC CAACTTTAAG
CTGGGGGCGG GCGTTGGATT GTCCGATTAC CTGGCGAAGG GCGTGGTGTA CCGGATGATC
AACAGCAACT TTAACGACAT CACCCTTGGT TACGAAATGA AGCACGGGGC CGTTGTCAAA
GCCGATGGTA AGCTGGATCT GACCAACGTG ACCAACCTGC TCAAGGCCGC TCAGGCGGCT
GGCGTTAGCG TGTATGGGCA TACACTCGCC TGGCACGCCA ACCAGAACGC GTCCTACCTC
AACAGCCTGT TGCTGACCGG TGTCGACTTC GACCCGGCAG ATAAACGCGT GAATTATGCG
AACGGCTCCT TCGAACAGAA CCAGACTGGG TGGAACTCCT GGGGTGGCTC AAGCACCCGC
GACATTATCA ACACGGGGCT GGTGGGCACC AAAAGCCTGC GTTTTACGCA CACATCCAAA
GCCAATCCCT GGGATGCCCA GATCGCCCTC GACTTCAGCC CGTCGCCAAT ACCCGTGGGC
GAGTACACCC TGTCGTTCTT CGTCCGGTCC GATGCGCCCG GCAAATTCCG CTGCTCAACG
GTGGGTAGCG GTGCCGACGT GCAGTACCAG CCCGACGTAA TCACGACCAC AACCTGGCAG
TATGTGGAAT GGGACATTAA ATCGGCGGGT ACCCTTTCGG CGCTACGCTA CGACATGGGG
ACCACACCCG GAACGTATTA CCTCGACGAG GTTCGGCTGA ATCCTAAGTC AACCCTTTAT
AAAAAGCCCG TTATCCTTCA GCTGAGCGCA GACGAAAAAA CGCGGATCAT CGGGGGCGCG
ATGGACAAAT GGATTTCGGA GATGGTGACC CAAACCAAGC CATATGTAAA GGCGTGGGAC
GTGGTCAATG AGCCTATGGA CGATGCCAAA CCCAGTACGC TGAAAACCGC AGCGGGTCGG
GCTGCTCTGG CCACGGACGA ATTTTACTGG CAGGATTACC TGGGCAAAGA TTACGCCGTT
CGGGCCTTTA AACTGGCTCG TCAGAATGGC AATGCAGACG ATATACTGTT TATCAATGAC
TATAACCTGG AATACGATCT GAACAAGTGC CAGGGATTGA TCGACTACGT CAGGTACATT
GAAGCCAACG GAGCTAAAGT AGACGGCATC GGCACGCAGA TGCACATGAG CCTCGATACC
AAACGGGAAA ATATCGATCA GATGTTTCGG CTGCTGGCGG CCACGGGCAA AATGATCAAG
ATCTCTGAGA TGGACATAGG TATTGGTACG GGAATCAAGA CAGCCGCAGC TACGCCAGCC
CAGTACAAAG CACAGGCCGA GTTATACGAG TACACGATCA AAAAGTACAT GGAGCTCGTA
CCGGCCAAAC AACGCTACGG CATCACGATG TGGAGCCCGA TGGATAGCCC CGACGGGTCC
TCGTGGCGGG CTGGTGAGCC AATTGGCCTC TGGAAACGCG ATTACACCCG AAAACAGGCT
TATGGTGGTT TCGCCAATGG GCTGGCGGGC CGGAATGTAA GCGCTGATTT TAAATAA
 
Protein sequence
MTSLKTITLL TLLTALSAAC RTDKSEFVLA KPESVATLEP INKYDALKTY LNTGDNPNFK 
LGAGVGLSDY LAKGVVYRMI NSNFNDITLG YEMKHGAVVK ADGKLDLTNV TNLLKAAQAA
GVSVYGHTLA WHANQNASYL NSLLLTGVDF DPADKRVNYA NGSFEQNQTG WNSWGGSSTR
DIINTGLVGT KSLRFTHTSK ANPWDAQIAL DFSPSPIPVG EYTLSFFVRS DAPGKFRCST
VGSGADVQYQ PDVITTTTWQ YVEWDIKSAG TLSALRYDMG TTPGTYYLDE VRLNPKSTLY
KKPVILQLSA DEKTRIIGGA MDKWISEMVT QTKPYVKAWD VVNEPMDDAK PSTLKTAAGR
AALATDEFYW QDYLGKDYAV RAFKLARQNG NADDILFIND YNLEYDLNKC QGLIDYVRYI
EANGAKVDGI GTQMHMSLDT KRENIDQMFR LLAATGKMIK ISEMDIGIGT GIKTAAATPA
QYKAQAELYE YTIKKYMELV PAKQRYGITM WSPMDSPDGS SWRAGEPIGL WKRDYTRKQA
YGGFANGLAG RNVSADFK