Gene Slin_6190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6190 
Symbol 
ID8729973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7497696 
End bp7499618 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content57% 
IMG OID 
Productglycoside hydrolase family 2 sugar binding protein 
Protein accessionYP_003390948 
Protein GI284041018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTTA ATCAAACTCT ATTCACGCCA TTAGGGCTCC TGCTGCTGAC TACGGGCCTT 
GCCGTTGCGC AATCGCCCGC CCAGAATACG CCCCGAACGG CCCCCATCAT GAGCCGGTGG
GAGAAGCAGC TAACGCCGGA GAACGCCTGG CGTGAATACC CGCGTCCGCA GATGGTCCGC
AAACAATGGC AGAACCTGAA CGGTATGTGG GACTATGCCA TTACGGCCAA AACCGCTCCC
CAGCCAACCG ATTTTTCGGG GCAGATTCTC GTACCCTTCA GCGTAGAGTC GACGGTGTCG
AAAGTCAATA AATCACTCAC TGCCGACCAG CGTTTGTGGT ACCGTCGTAC CGTTAGCGTA
CCCGCCGACT GGGCCGGACA ACGGGTGCTG CTGCACTTCG GCGCGGTAGA TTATGAATGC
AGCCTGTGGG TCAATGGCGG GCTGGTGGGT TCGCATACCG GCGGTTTGGA TGCGTTTAGC
TTCGACATTA CCGATTACCT GAAAGACGGC CAGAATCAAC TCGTACTGGG CGTACTCGAC
CCTACCTCAA CGGGCGAACA GCCACGCGGC AAACAGTTGA TGAACCCGAA TGGCATCTGG
TACACGCCCG TGTCGGGCAT CTGGCAAACG GTCTGGATGG AGCCGGTGCC GAAGCAGACC
TACATCGAAG AGGTGAAACT CACCCCCGAA CTGGATTCGG GCCGGGTTCG GGTCGATGTG
CTGCTCGACA AACCCGCTAA CAACTACACC ACTGCCATTC GATTGACGGC GATGGACGGC
AACCGAACCG TTGCCACAAC GCTCGTTCGT GCCGGGCGGA CGGGGTACCT TTCGGTGAAA
AACCCCAAAC TCTGGTCGCC CGATAGCCCG TTCCTGTATG ACTTTAAAGC CGAACTGGTT
ACCGTAACGG ACCCTTTTGG CGATACGCCC CGCAATAAAC GCCCGCGTCA GGATGATGCC
CTGAACAAGG CCTTCGCCGG TGCTACCGTA ACGGGCAATC CGCTCGATGT GGTAACCGGC
TACTTTGCCA TGCGGAAAAT TGCGACGGGG AAAGGTCCGG TAGCCAACCA GCCGGTATTG
CTGCTGAACA ACAAATTTGT TTTCCAGAAT GGCCCGCTCG ATCAGGGCTG GTGGCCCGGC
AGTTTACTGA CGCCCCCTTC GGATGACGCG ATGGCCTTTG AAATTGACTT CCTGAAAAAG
TCGGGCTTCA ACATGCTCCG TAAGCACATC AAAGTAGAGC CTGACCGCTA TTATTACCTG
TGTGATAAGA TGGGCATGCT GGTTTGGCAG GATATGCCGT CGGGCTTTCT GGAAGGCCAG
AACGAAGCCC CCGGCGATCA GACGGAGCCC ATTCGCCGGT CGAAGGCGAA AGAACAGTTT
GAACTGGAGC TACGCCGGAT GATGAACCGC CTGCATAACC ACCCCAGCAT TGTTACCTGG
GTGGTGCATA ACGAAGGCTG GGGCCAGTAC GACAACAAAC GCCTGGCCGA TTGGGTGAAA
GCCCTCGACC CCAGCCGGAC CGTTAACGCC AGCAGCGGCT GGAACGACCT CGGCGCGGGC
GATTTCTACG ATATTCATAC CTACGAGCCG GAACCCAACG CACCAGCACC CAAAACCGAC
CGCGTGGTAG TTATTGGCGA ATTTGGCGGC ATCGGCTGGC CGGTACAGGG TCATCTCTGG
AACCCAGAGA TGCGGAACTG GGGCTACCAG ACGTATCAAT CGGCCGACGA GGTGCTGAAA
GCCTACCAGA AGAAATACGC CAAAATCGTG GAGTATTATC AGAAACAGGC GCTGTCGGCG
GCAGTGTATA CCCAAACGAC GGACGTGGAA GGCGAAGTCA ACGGCCTGCT TACGTACGAC
CGAGAGGTAA TCAAAATACC TATCGAAACG CTGAAAAAGA TTCACGCGCC TTTGTTTAAG
TAA
 
Protein sequence
MRFNQTLFTP LGLLLLTTGL AVAQSPAQNT PRTAPIMSRW EKQLTPENAW REYPRPQMVR 
KQWQNLNGMW DYAITAKTAP QPTDFSGQIL VPFSVESTVS KVNKSLTADQ RLWYRRTVSV
PADWAGQRVL LHFGAVDYEC SLWVNGGLVG SHTGGLDAFS FDITDYLKDG QNQLVLGVLD
PTSTGEQPRG KQLMNPNGIW YTPVSGIWQT VWMEPVPKQT YIEEVKLTPE LDSGRVRVDV
LLDKPANNYT TAIRLTAMDG NRTVATTLVR AGRTGYLSVK NPKLWSPDSP FLYDFKAELV
TVTDPFGDTP RNKRPRQDDA LNKAFAGATV TGNPLDVVTG YFAMRKIATG KGPVANQPVL
LLNNKFVFQN GPLDQGWWPG SLLTPPSDDA MAFEIDFLKK SGFNMLRKHI KVEPDRYYYL
CDKMGMLVWQ DMPSGFLEGQ NEAPGDQTEP IRRSKAKEQF ELELRRMMNR LHNHPSIVTW
VVHNEGWGQY DNKRLADWVK ALDPSRTVNA SSGWNDLGAG DFYDIHTYEP EPNAPAPKTD
RVVVIGEFGG IGWPVQGHLW NPEMRNWGYQ TYQSADEVLK AYQKKYAKIV EYYQKQALSA
AVYTQTTDVE GEVNGLLTYD REVIKIPIET LKKIHAPLFK