Gene Slin_1409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1409 
Symbol 
ID8725143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1712538 
End bp1713836 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content53% 
IMG OID 
Productglycoside hydrolase family 8 
Protein accessionYP_003386258 
Protein GI284036328 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0686478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGA TAAACAATAA CTTGCCTGAA AATCGCTGGT TGCTCCCAGT GCTGATCGGA 
ATCTTTTTGA TAGTAACGCC CATGGCGGTG GCTCAAAAAA AGATGAAAGA CAAGCCTAAA
GCCGAGGCTC AGGTAGGCAA CTACCGGAAC CTGTTTCGCG AAGCAGGCTA CAAACAGGCC
GACATCGACG CGAAATTGGC GAAGGCGTAT CACGACGTGT TCGAAGGTCC CAATAAGGTC
TATTTTGAAG TCGGTGATAC GATGGCGTAT GTGTCAGATG TGAAGAATAA GGACGCCCGA
ACCGAAGGTT TGTCGTATGG GATGATGATC GCCGTGCAGC TGGACAAAAA AGACGTGTTC
GACCGCATCT GGCGGTGGTC GAAGAAGTAC CTGCAACACC AGAGCGGCCC ACGAGAGGGC
TATTTTGCCT GGAGCATCAA CCCGCAGACG ATGAAGAAAA ACTCGGAAGG GTCAGCGTCG
GATGGGGAGT TGTATTACAT CACCAGTCTG CTACTTGCGG CCAACAAGTG GGGCAACAAC
ACAGGTATCA ATTACTACGG CGAAGCCCGG CGAATTCTGG ATGCGATTTG GAAGAAAGAT
GGCACCGGCA ACATCTACAA CCTCATCAAC ACCGACACGA AGCAGATCAG CTTTGTGCCC
GAAGGAGGCA TGTACAACTG GACCGATCCG TCGTATCACC TGCCCGCCTT TTACGAGGTC
TGGGGCACGT ACGCCAAAGA CGGGCACGAG CAGTTTTACA GGGAGTGCGC CGATACATCG
CGCGCGTTTT TGCACCGGGC CTGCCACCCC GTAACGGGCC TCAACGCCGA TTACACGGAG
TTTAGCGGCA AGCCGCACGA CACCCGCTGG TCACCTTCGG CGTTTCGCTA CGATTCGTGG
CGGGTGCCCA TGAACATCGC CATGGACTAC ACCTGGTCGG GAAAAGACAA AGCCTGGCAG
GAAGATTACG CCAAACGGTT TCAGGGTTTC CTGCGCTCCA AAGGCATGGA CACGTATGAT
GATCAGTTTA ACCTGGACGG CTCACGCCCC GAATTTATTT TGCAGGCTGG GCCGGTAAAA
AAACTCCGGC ACTCGCTGGG ACTGGTGTCT ACATCGGCTA CGCTGTCATT GGTCAATAAG
GAGCCAAACA GCAAGGACTT TGTACGAGCC GTCTGGAACG CAAAACTGGA ACCGTTCGAC
GATGGCTATT TTGATCCGTA TTACGATGGG CTGTTGTATG TGTTCAGCCT CATGCACCTG
AGTGGAAAGT ACCGGATTAT AACGCCACAG GCCCGTTAA
 
Protein sequence
MNLINNNLPE NRWLLPVLIG IFLIVTPMAV AQKKMKDKPK AEAQVGNYRN LFREAGYKQA 
DIDAKLAKAY HDVFEGPNKV YFEVGDTMAY VSDVKNKDAR TEGLSYGMMI AVQLDKKDVF
DRIWRWSKKY LQHQSGPREG YFAWSINPQT MKKNSEGSAS DGELYYITSL LLAANKWGNN
TGINYYGEAR RILDAIWKKD GTGNIYNLIN TDTKQISFVP EGGMYNWTDP SYHLPAFYEV
WGTYAKDGHE QFYRECADTS RAFLHRACHP VTGLNADYTE FSGKPHDTRW SPSAFRYDSW
RVPMNIAMDY TWSGKDKAWQ EDYAKRFQGF LRSKGMDTYD DQFNLDGSRP EFILQAGPVK
KLRHSLGLVS TSATLSLVNK EPNSKDFVRA VWNAKLEPFD DGYFDPYYDG LLYVFSLMHL
SGKYRIITPQ AR