Gene Slin_2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2687 
Symbol 
ID8726437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3252495 
End bp3254528 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content52% 
IMG OID 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003387502 
Protein GI284037572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.547616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.221922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCTTT TTAACAAGCG TTTTTCGTAC CACTATCTGG TCCTTTCAAC GCTTACGCTG 
GCTACTTCGG GCGTGCTGGC CCAGAAACCA ACGCAGCCTG CTCTAGGCTC GCGGTCGGTC
AAACCACTAA CCGTTAACGG CTTTTCATTC AAAGACCTGA ATAAAAACGG AAAACTTGAC
AAATACGAAG ACTGGCGCTT ACCCACCGAA CAGCGCGTAC AGGATTTGAT TGGCCAGATG
ACCCTTGATG AAAAGATAGG CTTCATGCTG ATCAGCACAT CGCGAATGGC CGGCGATTTT
TCGTTTCAGC AGGGGGCTCC AAAAGCCGAA ATCACAAGTG GCTTTAATGA AGAAGACCAG
GTTCAGAGCA TGAATATGTT CACCCGGAAG CCACTCCCCT ACCCGATGAT GATGGCCGCC
GGAACAACCA AAGCCGTAAC GCAGAACCAG CTACGCCATT TTATTCTGCG GGCCAACACG
TCGGCGAAAA CCATGGCCGA ATGGCATAAC AATTTACAGG CGCTCTGCGA AAACTCCCGT
CTGGGCATTC CGGCTATTGT AGCGTCCAAC CCGAGAAATC ACATCACCAC CGATGCCGCT
GTTGGGCTTA GCGTTGGCAC AACGGTATTC TCAAAGTGGC CCGGCGAATT GGGTCTGGCG
GCCATGCGCG ACTTAAAACT TACCCGCGAA TTTGCCGACA TTGCCCGGCA GGAATGGGCG
GCTGTGGGGC TGCGCAAAGG CTATCAGTAC ATGGCCGACT TAGCAACCGA ACCGCGCTGG
CAGCGTATTG AAGGCACATT TGGCGAAGAT GCCGATCTAG CCGCCAACAT GACCCGCGAA
ATAGTACTCG GCTTTCAGGG ACCCAAGCTG GGCCTCCACT CCGTAGGACT TACCACCAAG
CACTTCCCCG GTGGCGGACC GCAGGTAGAG GGGCAGGACC CGCATTTCGA CTGGGGAAAA
GATCAGCATT ACCCCGGCAA CATGTTCGAG TATCACCTCA AGCCATTTCA GGCCGCCATT
GATGCCGGCA CATCGTCCAT CATGCCTTAC TACGCCAAAC CCATCGGCAC AAAATATGAA
GAGATAGCTT TTGCTTATAA TAAAGCCATT ATCAAAGATT TACTTCGCGG CAAAATGGGC
TTTCAGGGCA TTATCAACTC CGACACGGGG CCTATTGAAA TGATGCCTTG GGGCGTTGAG
AAGTTAAGCA TCGAGGAACG ATACCAGAAG GCTATCGAGT GCGGAGTTGA TTTGTTTTCC
GGTTCTGCCG ATCCCTCGCT GCTGATGTCG ACCGTAAAAA AAGGACTCGT GACTGAAAAG
CGGATCAACG AATCCGTAGC CCGGTTACTG CGTGAGGAAT TCGCGCTGGG CCTGTTTGAA
AACCCATACG TCGACCCGGA GGTTGCACAG AAAACGGTTG GAAAACCCGA GTTTCAGCAA
CGGGCCGATC TTGCTTTCCG GAAATCCATT GTGCTGCTGC GCAATTCGGG AAAACTGCTT
CCGCTGGCCC CAAAAACCAA AGTCTTTATT GAGTCATACT ACGACAATGG CCGCTCTAAA
GAGCCTATTA CGGTAATCAA ACCTGCAACG AACAACTGGA ATCTGGAGTT TGTCGGTAGC
AAAGAAGAAG CCGATGTTGT GGTGCTGATG CTGACGCCCA GCAGCGGTAG TTTATTCAGC
TCGAACGGCG GGCCAATTGA GTTGCAACTG TCAAAAAACA AGATCGACGT AAAGCACGTC
AATGAAGTAA CCAGTCAGAA ACCAACCGTT GTCCTGATCA ATTACACGAG TCCGTGGGTG
ATCGACGAAA TTGACAATCC AAACCTCAAA ACGGTACTGG CAACGTTTGG CACCACCCCC
GACGCCATTC TGGACGTGCT GAGCGGGAAG TTCAACCCGA CCGGCAAGAT GCCGTTCAGC
ACTCCCGTTT CCCGACAGGC CGTTCTCGAC AACCAATCCG ACGTGCCGGG CCATATGAAG
CAAAAAGGCT ATGCGCTGTT CACCTTTGGC GATGGACTGA GCTACCCGAA CTAA
 
Protein sequence
MHLFNKRFSY HYLVLSTLTL ATSGVLAQKP TQPALGSRSV KPLTVNGFSF KDLNKNGKLD 
KYEDWRLPTE QRVQDLIGQM TLDEKIGFML ISTSRMAGDF SFQQGAPKAE ITSGFNEEDQ
VQSMNMFTRK PLPYPMMMAA GTTKAVTQNQ LRHFILRANT SAKTMAEWHN NLQALCENSR
LGIPAIVASN PRNHITTDAA VGLSVGTTVF SKWPGELGLA AMRDLKLTRE FADIARQEWA
AVGLRKGYQY MADLATEPRW QRIEGTFGED ADLAANMTRE IVLGFQGPKL GLHSVGLTTK
HFPGGGPQVE GQDPHFDWGK DQHYPGNMFE YHLKPFQAAI DAGTSSIMPY YAKPIGTKYE
EIAFAYNKAI IKDLLRGKMG FQGIINSDTG PIEMMPWGVE KLSIEERYQK AIECGVDLFS
GSADPSLLMS TVKKGLVTEK RINESVARLL REEFALGLFE NPYVDPEVAQ KTVGKPEFQQ
RADLAFRKSI VLLRNSGKLL PLAPKTKVFI ESYYDNGRSK EPITVIKPAT NNWNLEFVGS
KEEADVVVLM LTPSSGSLFS SNGGPIELQL SKNKIDVKHV NEVTSQKPTV VLINYTSPWV
IDEIDNPNLK TVLATFGTTP DAILDVLSGK FNPTGKMPFS TPVSRQAVLD NQSDVPGHMK
QKGYALFTFG DGLSYPN