Gene Slin_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1747 
Symbol 
ID8725484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2101105 
End bp2103441 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content51% 
IMG OID 
Productglycoside hydrolase family 65 central catalytic 
Protein accessionYP_003386591 
Protein GI284036661 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATT ATATAACTCA GGACCCCTGG AATATTGTTG AAGACGGGTT TCACCCCGAT 
TTTAATGAAA TTACGGAAAG TGTGATGTCG CTCGGCAATG GCCGGCTTGG TCAGCGGGGG
AACTTTGAAG AGAAATTCAC CGGTAAATCC CTACAGGGCA ATTATGTAGC GGGGGTGTAT
TACCCCGATA AAACAAGGGT GGGCTGGTGG AAAAATGGCT ATCCTGAATA CTTCGCTAAA
GTGCTCAATG CCGCTAACTG GATTGGTATC GACATTGATA TTGACTACGA AAGTCTGGAT
CTCAACCACT GCGAGGTCCG CAATTTTCGC CGGGTGCTCA ACATGCAGGA GGGATACCTC
GAACGCTCGT TTGTGGCCGT ACTCAAAAGC GGTAAAGAGA TTCAGGTGAA TGCCAAACGC
TTCTGCTCCA TTGTTGACGA CGAAGCTGGT GCCATTCGCT ACGCTATCAA ACCCCTGAAC
TTCGAGGCCA AGATTACCAT TACGCCCTAC ATCAACGGCG ATATTCGTAA CCGCGATGCC
AATTACGACG AAACATTCTG GGATGAAGTT CGAAAAGAAA CGGGTTATGG CGAAGCGTAT
ATCGAACTCC GCACCCGTAA AACGGGCTTT CATGTCGCTA CCGGCATGTG CGTTGAAATC
GAGCAGGATG GCGTAAAGGT CGACTATCAG TCGCAGCCCA TAAAGTACGA AAAGTACGTG
GCTAACCGGA TGACACTCGA CTGCCGGAAG GGGCAGGAAA CGGTTATTTA TAAATATGCC
GTAAACCTCT CGTCGCTCAA CTATGACCCC GATACGATCA TAAAGGATGC GCATCAGTAC
ATCCAGCGGA TTACCCGGAA GGGGTTCGAA AAAATGCTGT TCGAGCAGAA ACAGGCCTGG
GCCGACAAAT GGAAAACCAA CGATATCATT ATTGAGGGTG ATATTGCCGC TCAGCAGGGC
ATTCGGTTCA ATATATTTCA GCTGAACCAG ACCTATACGG GCGAAGATGA GCGATTGAAC
ATTGGGCCGA AAGGGTTTAC GGGCGAAAAA TACGGCGGGT CGACCTACTG GGATACCGAA
GCCTACTGCC TGCCGTTTTA CCTCGCTACC GCCGACCAGA AAGTGGCGAA AAACCTGCTG
GTTTACCGCT ACAAGCAACT GGGTAAAGCC ATCGAGAATG CGCAGAAACT CGGATTCCGG
GCCGGGGCGG CTTTGTACCC CATGGTAACC ATGAACGGCG AAGAGTGTCA TAACGAATGG
GAAATTACCT TTGAGGAAAT CCACCGCAAT GGCGCTATTG CCTATGCCAT CTTCGACTAC
GTTCGCTACA CCGGCGATGA GCAGTATCTG GTCGATTATG GCCTCGAAGT CCTCATTGCC
ATCAGCCGGT TCTGGAGCCA GCGGGTCAAC TGGTCGAAGG AGAAAGAGAA GTACGTAATG
CTGGGCGTTA CCGGACCCAA CGAGTACGAA AACAACGTCA ACAACAACTG GTACACGAAC
TATATTGCCG CCTGGACGCT CCGGTACACC ACCGAAGCCG TGGCCAGAGT GAAAGCGCTC
GATTCAGATA AATACGCGGA CCTGATTGAC CGGATTCACT TCCGCGAAGA TAAGGAACTG
GCCACGGCCC GGCAGATTAT TGATAAGATG TACCTGCCTG CTGATGCTGA AAAAGGCGTT
TTCCTGCAGC AGGAAGGCTT TTTGGATAAG GACCTGATGC CCGTTTCGGA CATTCCGAAG
GGACAGCGGC CCATCAACCA GAACTGGTCG TGGGACCGTA TTCTGCGGTC GTGCTTCATC
AAGCAGGCCG ATGTATTGCA GGGACTTTAT TTCTTCGAAG ACGAGTTCGA CACGGAGACC
CTGCGCCGGA ATTTCGATTT CTACGAGCCG ATGACGGTGC ATGAATCGTC GCTCTCGCCC
TGCGTTCACT CCATTCAGGC CTCGAAACTG GGCATGAAAG AGAAGGCCTA TGAAATGTAC
CTGCGAACCG CCCGGCTTGA CCTCGACGAC TATAACAACG ATACGGAAGA CGGGTGCCAC
ATTACAAGCA TGGCCGGTAC GTGGCTGGCG GTTGTAAAAG GATTCGGTGG TTTGCGAATT
GAACAGGCCG ACGGGGCCGA GCCTCGGGTC GTTCTGAACC CTTACTGCCC GGATAACTGG
CAGTCGCTGG CGTTTAAAAT CCGGTACCGG GGCGTACTCT TGCAGGTAAC AACCACGCAG
CAGGATGTTA CGGTGGAAAA TTTCTCGGCA CAACCCATCA CAATACACCT TCTTGGTGAA
CAAGTGGTAA TTGGTGCCGA GAGTCAGCAG ACCGTGTCTG TGAAAATAGA AGCCTAA
 
Protein sequence
MKNYITQDPW NIVEDGFHPD FNEITESVMS LGNGRLGQRG NFEEKFTGKS LQGNYVAGVY 
YPDKTRVGWW KNGYPEYFAK VLNAANWIGI DIDIDYESLD LNHCEVRNFR RVLNMQEGYL
ERSFVAVLKS GKEIQVNAKR FCSIVDDEAG AIRYAIKPLN FEAKITITPY INGDIRNRDA
NYDETFWDEV RKETGYGEAY IELRTRKTGF HVATGMCVEI EQDGVKVDYQ SQPIKYEKYV
ANRMTLDCRK GQETVIYKYA VNLSSLNYDP DTIIKDAHQY IQRITRKGFE KMLFEQKQAW
ADKWKTNDII IEGDIAAQQG IRFNIFQLNQ TYTGEDERLN IGPKGFTGEK YGGSTYWDTE
AYCLPFYLAT ADQKVAKNLL VYRYKQLGKA IENAQKLGFR AGAALYPMVT MNGEECHNEW
EITFEEIHRN GAIAYAIFDY VRYTGDEQYL VDYGLEVLIA ISRFWSQRVN WSKEKEKYVM
LGVTGPNEYE NNVNNNWYTN YIAAWTLRYT TEAVARVKAL DSDKYADLID RIHFREDKEL
ATARQIIDKM YLPADAEKGV FLQQEGFLDK DLMPVSDIPK GQRPINQNWS WDRILRSCFI
KQADVLQGLY FFEDEFDTET LRRNFDFYEP MTVHESSLSP CVHSIQASKL GMKEKAYEMY
LRTARLDLDD YNNDTEDGCH ITSMAGTWLA VVKGFGGLRI EQADGAEPRV VLNPYCPDNW
QSLAFKIRYR GVLLQVTTTQ QDVTVENFSA QPITIHLLGE QVVIGAESQQ TVSVKIEA