Gene Slin_4791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4791 
Symbol 
ID8728555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5840152 
End bp5843496 
Gene Length3345 bp 
Protein Length1114 aa 
Translation table11 
GC content54% 
IMG OID 
Producttrehalose synthase 
Protein accessionYP_003389568 
Protein GI284039638 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0872883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACG AAAGCGTAGA ACAACTGGAT AATCTCCGCT GGTATAAAGA TGCCATCATC 
TATGAATTGC ACATCAAAGC GTTTTGTGAT GGTAATGGCG ATGGCATTGG TGATTTTCAG
GGCTTACTCG AAAAGCTGGA TTATTTGCAG GAACTGGGCG TAACGGCCAT TTGGCTGCTG
CCTTTTTACC CGTCGCCCCT GCGCGATGAT GGATACGATA TTGCCGACTA TTACACCATT
AACCCGTCGT ATGGCACCAT CGAGCAGTTT AAAACGCTCC TTCGGGAAGC GCACCAGCGT
AATCTGAAAG TAATTACCGA GCTGGTTATC AATCACTCAT CCGACCAGCA CCCGTGGTTT
CAGCGTGCCC GACGTGCACC CAAAGGCTCG CCCGAACGGG AGTACTATGT CTGGACCGAC
GACCCGACGC AGTTCAAGGA TGTACGTATC ATTTTTCAGG ATTTCGAAAC CTCTAACTGG
ACCTGGGATC AGGAAGCGCA GCAGTATTAC TGGCACCGCT TTTTCCACCA CCAGCCCGAC
CTTAATTACG ACAATCCGCT GGTGCAGGAC GAGGTGTTCA AAATGATTGA TTACTGGTGC
GAACTGGGCG TGGATGGGTT TCGGCTCGAT GCCGTTCCTT ACCTGTTTGA GCGGGAAGGG
ACCAACGGCG AAAACCTGCC CGAAACACAT GCCTTCCTCA AAAAACTGCG TAAACACGTC
GACGATCATT TCCCGGGCGT CGTCTTTCTG GCCGAAGCCA ATATGTGGCC CGAGGATTCG
GCGTCGTACT TCGGCGATGG CGACGAGTGC CACATGAACT ACCATTTTCC GGTCATGCCG
CGTATGTTCA TGTCGCTGCA AATGGAAGAC CGGTATCCCA TTACCGACAT TTTCGATCAG
ACGCCCGCCA TTCCAGACAG CTGCCAGTGG GCTATTTTCC TGCGCAACCA CGATGAGCTG
ACCCTCGAAA TGGTGACCGA CGAAGAGCGG GATTATATGT ACAAGACGTA CGCCAAAGAC
CCGAAAGCGA AGATTAACCT CGGCATCCGG CACCGGCTGG CACCGCTCAT GGGTAACAAC
CGGAAGCGGA TCGAACTGCT GAATAGTCTG CTGTTTTCGC TACCCGGCAC GCCCGTCATC
TACTACGGCG ACGAGATCGG CATGGGCGAC AACGTGTATC TGGGCGACCG GGACGGCGTG
CGAACACCCA TGCAGTGGTC GCCGGACCGC AATGCGGGTT TCTCGACAAC TAATCCGCAA
AAACTCTACC TGCCCACTAT TCTCGACCCG GAATACCACT ATGAGGCCGT AAACGTCGAA
AACCAGCGCG GCAATACCTC GTCGTTGTTC TGGTTCATGA AGCGGATGAT CAACCTGCGC
AAACAGTACA AGGCGTTTGG GCGGGGCGAC ATGAAATTCC TGAACGTCGA AAACCCGAAA
GTACTGGCCT TTACCCGAAC CTACGAAGAC CAGACGCTGC TCATCGTGGT GAATCTGTCG
AAGTACGCGC AACCGGCAGA GGTGGAGTTG AGCGGTTTTT CGGGCTATGT GCCGGTGGAA
GCGTTCAGCA AGAACCCCTT TCCGACGATT TCTAACACCG AAACGTATTT CTTTACCCTG
GCTCCGCACG ATTACCAGTG GTTTGTGCTT GAAAAAGCGG CTTCCGAAGC GGCCCGCGTG
TTCCAGCTGC CCGGCGTTCG TGTCAACGAC TGGAATGAGT TAATGAGCCA GAACACGCGC
ATGATGCTGG AAACGAAAGT GCTGCCGGAC TATCTCCTCC GGGTAGACTG GTTCGATGAT
AAAAAGCAGA CCATGCGTGG GGTGTCCATT TTGCGGAACG GTATACTGCC GCTGGCGGAT
AGCACGGCCT ATGTGCTGCT GCTTGAGGTG TCGTACGAGC GGGGCCTGCC CGAGTTGTTT
CAACTGGTGG TTGCCTTTGC TAAAGAAGCA TCGGCCGAAA AACTGATCGC CAATTGCCCG
CAGGCCGTGC TGGCCAACAT GGAAGTTGGC GACGCGTCCG GCGTTCTCTG CGACGGTATC
TATCTGACCG ATGTTCAGTT GGCTTTACTG CAGAACATGA GTGGGCCGAA ACAAAGTGGG
GTGCGTAATC TGGAGTTCCA GCATACGCCA AAGTTCGACG AGTACGTGCG CAACCACAGT
GAGGTAAAGC CCAAACTGAT GCCCGTCAAT CCCGGCTATG TGTCCATCAG CTACGACCAG
TGCTATCTGC TGAAGCTGTA TCGGCAGGTA GAAATGTCGG TCAACCCCGA CACCGAACTC
ACGCGCTTCC TGTCTGAAAC GGCCAATTTC GAATACGTAC CTGCCTTTGC CGGGTCTATT
GAACTGAGCA CGACCGAAAA ACCCGTTATG CTGGGCACCA TGCAGGAGCT GGTAGCCAGT
CATGGCGATG GAAAACGGTA TGTGCTGGAG CGGATCAATA ATTTCATCGA ACGGATTCTT
GCCCGCAATA AAACCCAGCT GGCTGCGGCC ATGAATGTGC CGGTCGGTAG CCTGAGCAAC
CCCATCCCGT TTGAGGATTT GCCTGTCGAG ACAAGAGAGT TGATCGGTCA GCGGTCGGCC
GATCAGTCAC GCTTACTGGG CACCCGCATC GGACAAATGC ACCTGGCGCT GGCATCCAGC
AAAAACCTGA AAGAGTTTGC GCCGGAAGAG TTCTCCCTGC ACTACCAGCG GTCGCTTTTC
TCGGGCTTAC AGTCGCTGGT ACGGGAAAGT TACCAGACGC AGAAACGGAA CGTGCAGCGG
CTTCCGGAGG GCGTACGGCA GGAGGTAGAA CAAATGCTGG AACGGAAAGA GGACGTGCTG
AATACCCTCA AACGGATTTA TGACCATAAG CTGGAAACCA CGAAAATCCG GAGTCATGGT
GATTTGCAAC TGGAAAAAAT CCTGCTCACC GGGAAGGATC TTGCTATTCA GGATTTCGGT
GGTGACCCCA GCCGGAGTTA TAGCGAACGT CGGCTGAAGC GGTCGCCCCT GCGCGATGTG
GCGGCCATGA TTCGCTCGTT CTATTACGTA GGTTATCAGG GCTTTCTGGA AAACAATCAG
GTGCCTAAAG AAGAGACGGT GAAGCTGTTG CCGTATGCCG GATTCTGGGC GCATTACATG
AGCAGTTTCT TCATGAAGGC CTATCTGGAA ACGGTTCAGG GCAGTTCGTT CATCCCGAAA
AACACCGACG ACCTGCAAAT GATGCTCGAA ACATACCTGC TCGAAAAGGC TATTTCAGAC
TTCAACCACG AGCTGAACTA CCGACCCGAC TGGGTGCATG TGCCGCTGCA AATTATTAAG
TCTATCGTTG TTTCGCCCGA AGTCGCCGTG CCGGAACTGG CGTAG
 
Protein sequence
MMNESVEQLD NLRWYKDAII YELHIKAFCD GNGDGIGDFQ GLLEKLDYLQ ELGVTAIWLL 
PFYPSPLRDD GYDIADYYTI NPSYGTIEQF KTLLREAHQR NLKVITELVI NHSSDQHPWF
QRARRAPKGS PEREYYVWTD DPTQFKDVRI IFQDFETSNW TWDQEAQQYY WHRFFHHQPD
LNYDNPLVQD EVFKMIDYWC ELGVDGFRLD AVPYLFEREG TNGENLPETH AFLKKLRKHV
DDHFPGVVFL AEANMWPEDS ASYFGDGDEC HMNYHFPVMP RMFMSLQMED RYPITDIFDQ
TPAIPDSCQW AIFLRNHDEL TLEMVTDEER DYMYKTYAKD PKAKINLGIR HRLAPLMGNN
RKRIELLNSL LFSLPGTPVI YYGDEIGMGD NVYLGDRDGV RTPMQWSPDR NAGFSTTNPQ
KLYLPTILDP EYHYEAVNVE NQRGNTSSLF WFMKRMINLR KQYKAFGRGD MKFLNVENPK
VLAFTRTYED QTLLIVVNLS KYAQPAEVEL SGFSGYVPVE AFSKNPFPTI SNTETYFFTL
APHDYQWFVL EKAASEAARV FQLPGVRVND WNELMSQNTR MMLETKVLPD YLLRVDWFDD
KKQTMRGVSI LRNGILPLAD STAYVLLLEV SYERGLPELF QLVVAFAKEA SAEKLIANCP
QAVLANMEVG DASGVLCDGI YLTDVQLALL QNMSGPKQSG VRNLEFQHTP KFDEYVRNHS
EVKPKLMPVN PGYVSISYDQ CYLLKLYRQV EMSVNPDTEL TRFLSETANF EYVPAFAGSI
ELSTTEKPVM LGTMQELVAS HGDGKRYVLE RINNFIERIL ARNKTQLAAA MNVPVGSLSN
PIPFEDLPVE TRELIGQRSA DQSRLLGTRI GQMHLALASS KNLKEFAPEE FSLHYQRSLF
SGLQSLVRES YQTQKRNVQR LPEGVRQEVE QMLERKEDVL NTLKRIYDHK LETTKIRSHG
DLQLEKILLT GKDLAIQDFG GDPSRSYSER RLKRSPLRDV AAMIRSFYYV GYQGFLENNQ
VPKEETVKLL PYAGFWAHYM SSFFMKAYLE TVQGSSFIPK NTDDLQMMLE TYLLEKAISD
FNHELNYRPD WVHVPLQIIK SIVVSPEVAV PELA