Gene Slin_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4039 
Symbol 
ID8727797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4854062 
End bp4856497 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content57% 
IMG OID 
Productglycoside hydrolase family 2 sugar binding protein 
Protein accessionYP_003388828 
Protein GI284038898 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.115808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA AGCTGCTGGC TTTAAGTGTA CTGATCGCCA GTCAATCGCT GGCCCAGCCC 
GGCCCAACCC GAACGCGCCT TTTGGATGCC GACTGGTATT TTTTTAGAGA CAGTACCGCT
GCTGGCCAGC AACCTGCCTT CAAGGAGTCA ACCTGGCGAA AGCTGAGCCT GCCGCATGAC
TGGAGTATCG AGGATTTATC CCACCAGTCG CCGGATCAGG TAATGGGCCC ATTTTCGCGG
GCGAGTGTGG GTACTACTTC CACTGGCTAC ACCGTTGGCG GAACGGGCTG GTATCGTAAA
ACATTTGTCC TGAATCGCTA CGACGCCGGT AAAATAGCCC GTATTCAGTT TGATGGTGTG
TACATGGAGT CTGATGTGTG GCTCAACGGC CACCATCTCG GGTATCACCC GAACGGTTAC
ACCTCATTCA GCTACGAGCT GACACCCTGG CTCCTGCCCA CGGGACAAGC CAACACACTG
GTTGTTCGAG TGAAAAACCT CGGCCAGAAT ACTCGTTGGT ATACGGGCTC GGGCATCTAC
CGGCACGTGT GGTTAATCCT GACCGAAGCC GTGCACATCG CGCCGATGGG CGTAACGATT
ACGACCCCGC AGGTGTTGGC CCGGTCTGCT CAGATAGTGG TCAGGACACA AATTGACAAT
ACGCAAAACG CAGCTTTACC CGTTCGGGTC ATAACAACCC TGGTCAAGCC AAATGGGCAG
ATGGCAGGGA GAGATGAGCA GGTATTGACG GTAGCGGGCA ACGGAAACGC CGAACGTTCG
CAAACCCTGA TAGTAGCTAG TCCGGCAGTA TGGTCGCCCG AATCGCCATC GCTATACCGG
GCCAACGTGG TGCTGGTGTC CGGTACGAAA CGGTTGGATA GTGTAACCAC CTCCTTTGGG
ATTCGCTCCG TTGAGTTCGA TGCCAGGCGC GGCTTTGTTC TGAACGGCAA ACGGCTTTTG
CTCAAAGGTG GCAGTGTTCA CCACGACAAT GGCCCGTTGG GGGCCAGCGC CTTTGACCGG
GCCGAAGAAC GAAAAGTCGA ACTGCTGAAA GCCAATGGTT TCAACGCCGT TCGCACCAGC
CATAACCCAC CGTCGCCCGC TTTTCTGGAC GCCTGCGACC GGCTGGGTTT GCTGGTCGTT
GAAGAAGCTT TCGATATGTG GCAGCGTCCC AAAAAACCAC AGGACTACCA CCTTTTCTTC
GACCAGTGGT GGCGTACCGA CTTACGGGCC ATGATCGAGC GCGACCGCAA CCACCCGTCC
GTATTCTTGT GGAGCATCGG CAACGAAATC AACGAACGAG CCGACCCGTC GGGACTGGTG
TTGACGAAAC AGTTGGCCGA TGAAGCACAC CGACTCGACC CCAGCCGCCC GATTATGGAG
GCTATGTGTG TGTTCTGGGA GCATCCCGGC AAGGTGTGGG AAGACGGCGA CAAAGCCTTT
GCGCTGTTGG ATGTAGGTGG CTACAACTAC GAATGGAAGC ACTACGAGTC AGACCACCAG
CGCCATCCGG ACCGGGTCAT GCTCGGCACG GAGTCTTTTG CCCGCGAAGC CTACCAAAAC
TGGCAGCAGG TAGAAAAGCA CCCGTATGTG CTTGGCGATT TCGTCTGGAC GGTTATGGAT
TACATGGGCG AAACGGCCAT CGGCCATGCG TTGATCCAGC CCAAAGCCGA GAAAGACAGT
GTGAAGGCGG TGTTGCCGTG GCCGTGGTTC AATGCTTGGT GTGGCGATCT GGATTTGATC
GGTACGAAAA AGCCGCAGTC GTACTACCGC GATGTTGTCT GGCGCAACAG CCCCGTCGAG
ATGGCCGTTC ATAGCCCTAT CCCCGACGGT ATGAAAGAAA CCGTTACTAA CTGGGGCTGG
CCCGATGAGC ATCAGAGCTG GTCGTGGCCG GGGCAGGAGG GGAGACTCTT TCAGGTGCGT
GTCTTTTCGC GCAGTCCGCT GGTTCGGCTG GAACTCAATG GAAAACTGGT GGGGGAACAG
CAACTGGCAG ATACCACCAT TACGGCTTCG TTTACCGTGC CTTACCAGCC GGGCGTGCTC
AAGGCAACGA GCTTTGCCAA CGGTAAATTA ACGGGATCGG TAACCTTCCG TACAACGGGG
AAACCCCACC ACCTTCGGCT CAGAACCGAC CGCCCGGCCA TACACCCCGC CCGCCATGAC
CTGGCCTATG TGACGGTTGA GGTGGTAGAC GATCAGGGGC AGGTAGTACC CTGGCAGGAT
GTGCCCATCG CCTTTCAGCT CACCGGTGCT GGTGTACTGG CGGGGGTAGG CAACGGAAAC
CCAACGGATG TAAGCAGCTT TCAACAGCCC CGGAAGTCAA CGTTTCGGGG GCGCTGTCTG
GCCATTGTGA GGTCAGGAGG AAAGCCCGGC ACGATCACCT TCGAAGCGAG CAGCCCAACG
CTTGGAACGG CTCAGCTGAC CTTACAGGTG AACTAA
 
Protein sequence
MKIKLLALSV LIASQSLAQP GPTRTRLLDA DWYFFRDSTA AGQQPAFKES TWRKLSLPHD 
WSIEDLSHQS PDQVMGPFSR ASVGTTSTGY TVGGTGWYRK TFVLNRYDAG KIARIQFDGV
YMESDVWLNG HHLGYHPNGY TSFSYELTPW LLPTGQANTL VVRVKNLGQN TRWYTGSGIY
RHVWLILTEA VHIAPMGVTI TTPQVLARSA QIVVRTQIDN TQNAALPVRV ITTLVKPNGQ
MAGRDEQVLT VAGNGNAERS QTLIVASPAV WSPESPSLYR ANVVLVSGTK RLDSVTTSFG
IRSVEFDARR GFVLNGKRLL LKGGSVHHDN GPLGASAFDR AEERKVELLK ANGFNAVRTS
HNPPSPAFLD ACDRLGLLVV EEAFDMWQRP KKPQDYHLFF DQWWRTDLRA MIERDRNHPS
VFLWSIGNEI NERADPSGLV LTKQLADEAH RLDPSRPIME AMCVFWEHPG KVWEDGDKAF
ALLDVGGYNY EWKHYESDHQ RHPDRVMLGT ESFAREAYQN WQQVEKHPYV LGDFVWTVMD
YMGETAIGHA LIQPKAEKDS VKAVLPWPWF NAWCGDLDLI GTKKPQSYYR DVVWRNSPVE
MAVHSPIPDG MKETVTNWGW PDEHQSWSWP GQEGRLFQVR VFSRSPLVRL ELNGKLVGEQ
QLADTTITAS FTVPYQPGVL KATSFANGKL TGSVTFRTTG KPHHLRLRTD RPAIHPARHD
LAYVTVEVVD DQGQVVPWQD VPIAFQLTGA GVLAGVGNGN PTDVSSFQQP RKSTFRGRCL
AIVRSGGKPG TITFEASSPT LGTAQLTLQV N