Gene Slin_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1658 
Symbol 
ID8725393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1990631 
End bp1993750 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content53% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003386504 
Protein GI284036574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.450453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTC TTCAACAAAC GGCCCAACGC ATTTTTGACA CCCATGGTCC CAGCTTAAAC 
GATGTTTGGG TAATCTTGCC CACCCGCCGG GCTGTGTCTA CCTTCCTGGA TGAGCTGGCC
GCCCTTTCTG ACAGGCCCTT TCTGGCCCCC CACGCACTTG CGGTCGATGA TTTTATCACC
CAGGCAGCGG GCGTTCAACT GATTGATTCG GTTAGTTTAC TTTTCGAGTT ATACGACGTT
TTTAAAGAAA TTGACCCGCT GGTCGAGTTT GAGCAGTTTA TCGGCTGGGC ATCCATACTG
CTCTCCGATT TCGACCGTAT TGATCAATAC CTCGTCAATC CCCACGAGTT ATTCAGCTAC
CTGACAGCCG CCAAAGCACT CGAACGCTGG CAGGTCGACA TGCCTTCGTC CGCCAAACCG
ATTGTCGAGA CGCCCGGCAC GACCCGATAC TTCAAACTGT TCGAGAATAT ACATACAGCC
TACCACGCCC TTCACCAGCG CCTGAACGAG CAGCGACTGG CCTATCGGGG AATGGCCTAC
CGGCTACTGG CGCAGCAGGT TGAACCCTTG ATTCGGGATA ATCTGGCCTA TGAGCGGGTG
TATTTTGTGG GATTCAATGC CCTGAGTAAA GCAGAAGAGC ACATTATCCG GGTGCTGGTC
GATGCCAAAA AAGCTGAACT GATCTGGGAT GCCGACCTGT ATTACATGAA TGACCGGCGG
CAGGAAGCCG GCGAATTTCT GCGACGATAT AAGGACAATG GCTGGCTTTT TTCGAGACAA
AACCATGCCG ATCTGGCGCA ACTGTCTAAT AACCTGCTAG GTTCTGAAAA AAATATCCGC
GTCGTTGGCG TACCCAATGC CAGCATGCAA ACCAAGGTAG CGGGTAAGAT TTACAGCGAA
TGGCAACGGG CCGATAGTGC AGTGCCACGC GATGCTGCAA CGCCCTCGCC AAAGACCGCT
ATCGTACTGG CTGACGAAAC ACTTCTGGTG CCCGTGTTAT ACGCGCTGGA TGAAAACGTG
ACCGACCTGA ACGTTACTAT GGGTTTGTCG TTGCGTTCGT CGCTGTTGTT TACCCTGGTC
GACACGTTGT TTGAGATGCA GCGGACGGTC CATGAGTTCC GTACCAAAGA CGGGCGCGAC
CTCAAGATCC CCAAGTTTCA CCATCGCCAC GTCGTAAAAC TCCTTAACCA CCCCTTCCTG
AAGCAGTACG AGCGTATACG GGGGCTGATG TGGCCGGGCG ATGTACTGTC GACAGGGGAA
ATTTTACCGC CCGAACCGCT CTTCCAGTGG ATCGCCAAGG AGATTGTGAA AAATCAGCGG
GTATACCTGA CCGAGCGGGA CATGCTGGAA CTGGGGCAGG ATGACCCACT CGTGCGGGTG
CTGTTCAGGC GTTGGCCCAA TGAAGAGCCC ATGAAGGCCA TTCGTACGTT TTATGACCTG
ATCGAGTTAC TGCGCGACGT GTATCGTACC AGTCAGGATG CCATCGAGAT CGAGTACCTA
TACCTTTTCT TTACCCTGCT CAAACAGCTG GAAGCAACGC TGGACAGGCA GGGCGAAGGA
GCACAAGGGG CCGGGCGCGG AGTACCGGTG CCCAAGGCGC GAAATAGAGG GCAGGGGGGA
ACCGCCATCC TGGATACGGG CGCCCCGGTT CCCATGCTAC ACGTCCACGA ACCGTCGGCC
GCCGTTACCG TGCGCAGCCT GAAACAGTTT TTGTACGAAC TGATTCGGCA AACGAGTATT
CCCTTTACCA GCGAAGGGAA GAGCCAGTTG CAGATCATGG GTATGCTCGA AACACGGGCT
CTGGATTTTG ACCGGGTTAT TATTCTGTCA GTGAACGAAG GGATTCTGCC GCAGTCCAGA
AAGTTAAACT CGCTTATTCC GTTCGATATA GCCGCCGATG AAAACATAAA GCTGCCTACT
TATAGCGAAC AGGAGGCCGT GATGGCGTAC CACTTTTACC GGTTGCTGCA ACGGGCCAGC
GAAGTGGTGC TCTTGTACAC AACCTCGACA GATGCGTACG GGAATAGCAA AGGTGAGCCA
AGCCGCTTTA TCCGCCAGCT GGAACACGAG CTGGTGCCCC GCTCTAACGG ACTCGTTCGG
ATAAGCTACC CAACGGTTCG TTTCGGTCGG ACAAGCGAGA AAAAGGAAAC CAGTCTGACC
GAGCTGAGTG TGCCCAAAAC GGAATCGGTG CGGGACGGTC TGATCAATCT GCTCATAACG
AAAGGGTTGT ATCCATCTTA CCTGAATCAG TTCGTGAGCT GTTCCATGCG GTTTTACTTC
AGCCGGATTG TAAATATTAG TGAGGAAGAA GACATCGAAG AGAAAATGGG AGCGGCTGAG
TTCGGAAGCT GGCTGCACAA AGTGATGGAG CGGCTAGACC TTGAGTACCG CCTGAAGGCG
CTGCCCATCG ACGAGTCGAT CATTAAAATG CTGCTCGAAG AAGAGTTTGC CAGTACCAAC
AAAGGCCGGG TTATCGAGTC GGGCATGAAC CTGCTGCTCT ACGACCTGGC ACAGAAACTC
ATGCTCGACT TTCAGCGTCA GCAGAATGCG CTTCCTGGTT TGACCGTTAT CGGAACGGAG
CAAACCCTCG AAACGTATTT GACTGTATCC ATTGAAGGGC GTGGGGCCGT TCGGGTGCGG
ATAGCGGGTA AAGTAGACCG TATCGAACGT CTGGGTGATC AAATTCGAAT TGTCGATTAT
AAAACGGGCA AAGTCGACCT GTCCGAAAAA ACGCCCAAAG ACCTGAGTGA TCGATTACTG
AACGATGGGG GCGACGATGC GGGTAAGATG CGGCAGTTGT GGCTGTACCG GTATCTGGCC
CTTAAAAACA TTAGCGAGTA TGGTGGTTTG CCCCGCGACC GGGCTAAACG GGATATTTTT
AATGCGGAGG GTATGCCTGT CGAAGCTGGC TTTTATTCGT TCCGGGATGT GAATGGGGGC
TTTAAAACAA ACCCTGTTCG CTTCGGAGAC AATGATAGCC CTGGTCAGTA CATCGAAGAT
TCGGAGGATT TACTTCGCCA ATTGATACAA CAACTGCTCG ACCCTGAACA ACCGTTCAGG
AAAACGGACC AGATTGAGAC CTGCCAGTTT TGTGATTATA AGGGTATTTG CGGGCGATAA
 
Protein sequence
MTFLQQTAQR IFDTHGPSLN DVWVILPTRR AVSTFLDELA ALSDRPFLAP HALAVDDFIT 
QAAGVQLIDS VSLLFELYDV FKEIDPLVEF EQFIGWASIL LSDFDRIDQY LVNPHELFSY
LTAAKALERW QVDMPSSAKP IVETPGTTRY FKLFENIHTA YHALHQRLNE QRLAYRGMAY
RLLAQQVEPL IRDNLAYERV YFVGFNALSK AEEHIIRVLV DAKKAELIWD ADLYYMNDRR
QEAGEFLRRY KDNGWLFSRQ NHADLAQLSN NLLGSEKNIR VVGVPNASMQ TKVAGKIYSE
WQRADSAVPR DAATPSPKTA IVLADETLLV PVLYALDENV TDLNVTMGLS LRSSLLFTLV
DTLFEMQRTV HEFRTKDGRD LKIPKFHHRH VVKLLNHPFL KQYERIRGLM WPGDVLSTGE
ILPPEPLFQW IAKEIVKNQR VYLTERDMLE LGQDDPLVRV LFRRWPNEEP MKAIRTFYDL
IELLRDVYRT SQDAIEIEYL YLFFTLLKQL EATLDRQGEG AQGAGRGVPV PKARNRGQGG
TAILDTGAPV PMLHVHEPSA AVTVRSLKQF LYELIRQTSI PFTSEGKSQL QIMGMLETRA
LDFDRVIILS VNEGILPQSR KLNSLIPFDI AADENIKLPT YSEQEAVMAY HFYRLLQRAS
EVVLLYTTST DAYGNSKGEP SRFIRQLEHE LVPRSNGLVR ISYPTVRFGR TSEKKETSLT
ELSVPKTESV RDGLINLLIT KGLYPSYLNQ FVSCSMRFYF SRIVNISEEE DIEEKMGAAE
FGSWLHKVME RLDLEYRLKA LPIDESIIKM LLEEEFASTN KGRVIESGMN LLLYDLAQKL
MLDFQRQQNA LPGLTVIGTE QTLETYLTVS IEGRGAVRVR IAGKVDRIER LGDQIRIVDY
KTGKVDLSEK TPKDLSDRLL NDGGDDAGKM RQLWLYRYLA LKNISEYGGL PRDRAKRDIF
NAEGMPVEAG FYSFRDVNGG FKTNPVRFGD NDSPGQYIED SEDLLRQLIQ QLLDPEQPFR
KTDQIETCQF CDYKGICGR