Gene Slin_0566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0566 
Symbol 
ID8724294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp694175 
End bp697435 
Gene Length3261 bp 
Protein Length1086 aa 
Translation table11 
GC content55% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003385429 
Protein GI284035499 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG GGTTACTTTT TATCTTCTTC CTGGTCAATG CGCTGGTCCT GAACGGGCAA 
AGCCGATTTG GTAACGAATG GATACGACCG GGCCAGAAAT ACCTCAAATT TACCGTCAAT
CAGGCAGGGA TTTACCGGGT TGGCTACGAG GATATTAAAG CCGCTGATGC GTCCTTTCTG
AAAACCAATC CGGCCCGCTG GCAACTGTTT TTCAGGGGAG AGGAGGTCGC TATTCGGGTG
GTCGGGCAGC AGGATGGGAC TTTCGATGCC CAGGATTACA TTGAGTTTTA TGGCATCGGG
AACGACGGCG CGCAGGATTC GGTCGTTTAC CGACCCCAAC AGCGACTACA TTCCTACCAA
ACCCTTTTCT CCGATAAAAC AGCGTTCTTC CTGACCAGTA ACCCGACGTT GACCGGCAAA
CGAATGCCCG AACTGAACCT GTCGGCGCAG GGATTGACGC CCGAGCCGTT TCATGTGGAA
GAGACCGTTC AGGCCTTCAC GAGCGAGTAC ACGTTCAATA ACCTGAAGGG GCTGGAGCCG
TACCTGCAAC AAAGCTATTT TGAGCCGGGT GAGGGCTGGT CCGGACGGCA GCTTACCGCC
GATTCTGTGG GCGTTGTTCA ACTGAAACTG CCGGGCCGGA CATCAGCAAA CTGGCCCGTT
ACGCTGGAAG GCATGGTCAA CGGGCGCGAT AATTCCTACC ACCAACTCCA GCTTCAACTC
GACGCCAGTA CCACCTCGCC CCTGTCTGTG GAGCCGTTTA GCGGATTTAC GAGTCAGACA
TTTAAGGCAA CGGTTACGCC GACAACGATT CAGAATGATC AACTTACTTT GCGGTTCCGG
GGCGTGAAAA GCGGCTTTAC CACCAACTAT TCCATCACCT ACCTCAAGGT GGCCTATCCG
CAGGCGCTGG ATATGACCGG GCAGGTCGGC AAGGTGTTTA CCATACCGGC CAACCCGCGC
CTAAGTGCTT TACTGGCCGT CAGAAATGTG CCAGCCACGT CGCTCGCGTA CGATATTACC
GACCCCGCCA ATGTCCGGTT GCTGGCTACC CAACCTTCCG GCGACCAAAC GCAGGTGGTT
GTGAATGAGA TGGCTCGGAG CCGGGTCGTG TTGATTACCA ATAAAATCAA TAAACCGCTG
GCTATCCAGC CGATCCAATT TCGAACGGCG GTTCCCGAAA CGGCGGATTA CGTTATCATC
ACGCACGCAT CGCTCCGGCA GTCGGCCGCT ACGTATGCCG GTTACCGGTC TTCGGCAGCA
GGGGGGAGCT ACAAGCCGTT CATTGTCGAA TCCGACTCCC TCTACGACCA GTTCAATTAC
GGTGAGAAGA GCCCGCTGGC ACTTCGGCGT TTTGCCGATT ATCTGGTGGC TAACACGGCT
GTTAAGCACT TGTTGCTCGT AGGCCGGGCG AATAGTTACC CGTATACGGT CAAAACCGCT
ACCGACGATT TAGTCCCGAC CATGGGCTAT CCCGGTTCCG ATATTCTGCT GTCGGCCGGG
CTGGGTGGTT TCCCCCCGAA CACGCCCGCC ATCCCGACCG GGCGCATCAA TGCAACGACC
AACGATCAGG TACTTACGTA TCTGGACAAG GTGAAACAGA TGGAAGGGGC CAGTTACAAC
GGGCTTTGGC GAAAGCACAT TGTTCACATC AGCGGGGGTA AATCGGCGGG TGAAATATCC
AGCCTGCGCG AAGCCCTCAA CGACATTGGC CGTATCTATA CCGATGGTCT GTTGGGCGGT
CAGGTAACGG CGTTCAGCAA AAACACCAAT GCCGAAGTTG AGCAGATCAA TATTGCCCCG
CAGGTGAATG ACGGGGTTAG TCTGGTAACG TTTTTTGGTC ATGCCGGACC GGCCATCACC
GATATGAATT TCGGGTTTGC TTCTCCGCCC GAAAACGGGT TCCGTAATCA ATATTATCCG
TTTATGATCT TTAACGGCTG TGGGGTCGGA GAGATCTTCT CCAGCTTCAA AACCCTATCG
ACGGATTGGC TGCTGGCTCC GCAAAAAGGC GCAGGCCTGG TGCTGGCACA CTCGTATTAC
AGCTACGAAC TGCCCACAAC GCGGTACCTG ACCAAACTGT ATTCCCGCTT ATACACCGAT
GCCAGTACGC TGGGTATGCC TTTCGGCAAG GTGCAGCAGC AGGTTAACCT GGCGTTGGAA
AAAGAAGGTG TGGACGGATA CGACATCTCC GTGATTCTGC AAATGCTTTT GCAGGGCGAC
CCCGCCCTGA GTATGTATCC GCTACCAAAT CCGGACTTTT CGGTCGAGTC GAAGGGGCTG
TATATTCAGG GTAAAGTAGT GGGCAGCTCG CTACAAAACA GCGATTCACT GCGGGTTGTT
GTTCCGGTAG CCAACCTCGG TAAGTTCGTG GCTGGTCAGT TGGTGGCATT GGCGTTGACA
AAAACGAGCA ACGGCAGTGC GGCCACCACC ACGCTCCGGT TTCCGGCCTT TCGGTACCGC
GATACGCTGG TATATACCAT CGCCAAAGAC GAAAAGCTAC AGAAACTGGA GGTGACGATC
GATCCCGGGA ATCAGCTTGT TGAGTTGAGC AAGTCAAATA ACAAAGCCAG CCTGGACATC
GACTGGGCGC AGGCGAAGAC GAGTACCAGT TACCCGCCCA ATCGATTTCC TGACCGGGTA
AGCCCGGAAA TCAGCGTTTT CCTCGACGGA AAAGTCCGGG AAAATCAGGC AATCGTTGGG
GTGAACCCCC GTATAGAGGT GTTTATTCTG GACGAGAACC CGCTCTCGCC AACCGATACG
AGTGCCGTTG AGGTCTATCT GAAAAGTTGC GCCAGTTGCT CCCCCAACAA ACTGCCGTCG
GCCCGGTTCA CGGTGTCGGC GGTTTCGGCA AACCAGCTTC GGGTAGCTAC AAACGCTGTG
CTTCAGCCTG GGGGGAGCTA TCAACTTATT GTGTTCGGGA AAGATGCCGC CGGAAATCGT
ACGCAACCGC CCTACGTACT CGATATTGGC GTTGTAGCTG AGGATAAGCC CGTTACGGTA
ACCGCCTACC CAAATCCGGC GACGACCTAC GTGAAATTTG ACCTGAATTT GAATCTAACC
GAACTGCCCA CCGAATCCCG GCTGCTGATT TATAATCCGT CGGGCGTGCT GGTCTATACC
GATACGGTGT CGGTAACTAC CGGCAAAAAC ACATGGCTTT GGCAGGCAAC CGCAGCGGGC
GTTTACCCGT ATTCGTTCCG GTTAACCTAT AAAGACGGCC GCACCGAAAT GCATACCGGC
AAGGTGGTCT GGCAACATTA G
 
Protein sequence
MKKGLLFIFF LVNALVLNGQ SRFGNEWIRP GQKYLKFTVN QAGIYRVGYE DIKAADASFL 
KTNPARWQLF FRGEEVAIRV VGQQDGTFDA QDYIEFYGIG NDGAQDSVVY RPQQRLHSYQ
TLFSDKTAFF LTSNPTLTGK RMPELNLSAQ GLTPEPFHVE ETVQAFTSEY TFNNLKGLEP
YLQQSYFEPG EGWSGRQLTA DSVGVVQLKL PGRTSANWPV TLEGMVNGRD NSYHQLQLQL
DASTTSPLSV EPFSGFTSQT FKATVTPTTI QNDQLTLRFR GVKSGFTTNY SITYLKVAYP
QALDMTGQVG KVFTIPANPR LSALLAVRNV PATSLAYDIT DPANVRLLAT QPSGDQTQVV
VNEMARSRVV LITNKINKPL AIQPIQFRTA VPETADYVII THASLRQSAA TYAGYRSSAA
GGSYKPFIVE SDSLYDQFNY GEKSPLALRR FADYLVANTA VKHLLLVGRA NSYPYTVKTA
TDDLVPTMGY PGSDILLSAG LGGFPPNTPA IPTGRINATT NDQVLTYLDK VKQMEGASYN
GLWRKHIVHI SGGKSAGEIS SLREALNDIG RIYTDGLLGG QVTAFSKNTN AEVEQINIAP
QVNDGVSLVT FFGHAGPAIT DMNFGFASPP ENGFRNQYYP FMIFNGCGVG EIFSSFKTLS
TDWLLAPQKG AGLVLAHSYY SYELPTTRYL TKLYSRLYTD ASTLGMPFGK VQQQVNLALE
KEGVDGYDIS VILQMLLQGD PALSMYPLPN PDFSVESKGL YIQGKVVGSS LQNSDSLRVV
VPVANLGKFV AGQLVALALT KTSNGSAATT TLRFPAFRYR DTLVYTIAKD EKLQKLEVTI
DPGNQLVELS KSNNKASLDI DWAQAKTSTS YPPNRFPDRV SPEISVFLDG KVRENQAIVG
VNPRIEVFIL DENPLSPTDT SAVEVYLKSC ASCSPNKLPS ARFTVSAVSA NQLRVATNAV
LQPGGSYQLI VFGKDAAGNR TQPPYVLDIG VVAEDKPVTV TAYPNPATTY VKFDLNLNLT
ELPTESRLLI YNPSGVLVYT DTVSVTTGKN TWLWQATAAG VYPYSFRLTY KDGRTEMHTG
KVVWQH