Gene Slin_5056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5056 
Symbol 
ID8728821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6160179 
End bp6163520 
Gene Length3342 bp 
Protein Length1113 aa 
Translation table11 
GC content52% 
IMG OID 
ProductASPIC/UnbV domain protein 
Protein accessionYP_003389830 
Protein GI284039900 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.841506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCA TTCTTCTTTT TATCGGCGCA TTGGCGTTGA CGGGCTGTAA CTCCTTCAAA 
TCGGAAAAAC CCCTTTTCGA GTCGCTCGAT TCGACCCAAA CGGGCGTTGG GTTTGTTAAT
AAGCTGGTGC CGACCGAAAA GCTCAACATT TTAGATTACC TCTATTTCTA CAATGGTGGG
GGCGTTTCGG CGGGCGATAT AAACAACGAT GGCCTCACCG ATCTGTACTT CGTATCCAAT
CAGGGAAAGA ATAAGCTGTA CCTCAACAAG GGAGATTTTA AGTTTGAAGA CATCACTGCC
AAAGCGGGCG TCGAAGGATT TGCAGACTGG CAAACAGGCT CCACAATGGC CGATGTCAAC
GGCGACGGTC TACTCGATAT TTACGTCTGC GCGGTGGGCA ATTTCCGGGG GCAGGAGGGC
TCCAATGAAT TGTACATTAA TAACGGCGAC AATACCTTCA CCGAAAAAGC CGCTGATTAC
GGACTGGACT TTACGGGTTT TTCGACCCAG GCCGCCTTCT TCGATTACGA CCATGATGGC
GATCTGGATT GTTACCTGCT CAATCACGCT ATCCATACCT CCCGCAGTTA CGACCGCGTT
TCTGCCCGAA ACTTACGCAA TAACGAATCG GGCGATTACC TGTACGAGAA CCAAAGCATA
ACCAAATCCG GGCAAGCCCC TGCGGGTAAA ATCGCCCGGT TTCAGAACGT AAGTGAGCGG
GCGGGTATCT TCGGCGCGGC AATGGGCTAC GGGCTGGGAG TTTCGGTCGC CGACCTGAAT
AACGATGGGT GGGAAGATAT TTACGTCTCC AACGATTTCC ACGAAGACGA TTATTACTAC
GTCAACAACC ACGACGGGAC CTTTACCGAA AGCGTAAAAA AAGCCTTTCA GCACACCAGC
CGATTCTCTA TGGGAAACGA CATTGGCGAT GTGAATAATG ACGGTTTTGC CGATGTAGTC
ACGCTCGACA TGTATCCCGA AGATGAAACA GTTGAGAAAT CGTCGCTGGG CGAAGATCCG
CTGGATATTT ACACCTATAA ACTGGCGTAC GGCTACATGA ACCAGTACAG CCGGAACTGT
CTGCAACTGA GTCTTTCGGG TAAAAAATTC ATGGACATCG GCGCGATGTC GGGGGTAGCC
GCAACCGACT GGAGTTGGTC GCCGTTGCTG GCCGACTACG ACAACGATGG GATCAAAGAC
CTGTTCATCA CGAACGGCAT TGTCCGGCGG CCTAATAACC TGGAGTACGT GAAGTTTGCC
GCTGACGACT CATTGCGGTA TGCAATGGAA ACATCGAAAT CGCTCGATGA GCGGGCTACT
AAACTCATGC CCGAAGGCAA AGTGCATAAT TACCTGTACC GGGGAACACC CTCGTTGCGG
TTTGAGGATA AGTCGTCGGC CTGGGGTTTC GAACTGCCGA ATTACTCGAA CGGTTCCGCC
TACGCCGACC TCGACAATGA CGGCGATCTG GATCTCATTA CCAACAACAT CAACGACCCG
GCCAGTTTGT ACCGCAATGA AGCCACGACG TTGTTTCCCG AAAATACGCA CCTGACCGTT
AAGCTCAAAG GCGATTCGCC CAACACATTT GGGGTAGGGG CAAAGGTGAT TTTGAAATAC
GGCAAAGACA GTCTGCTGGT GCAGCAACTG ATGCCAACGC GGGGCTTCGA GTCGTCAGTG
GCACCGGAGT TACTGTTTGG GTTGGGCAAG CGGGCCAGCA TTGACTCGCT GATCGTTATC
TGGCCGAATC AGCAGATGGA GGTGCGTACC AACGTCAAAA CCAATCAGTC GATAATGCTG
AAACAGGCGG ATGCCAAACT CGATGGTGCC GGGTTCGTTT ATACAAATCC GCCCATGCAG
CCTTTGTTCG CCGATGTATC GACCGCCGAC TCAATTCCGT ACCAACACAA GGAAAATAAA
GAGTATTTCG ATTTCGTGCG GGAGTCGCTG ATGCCGTTCA AAGTGTCGAC CGAGGGGCCG
CATCTGGCCG TTGGGGATGT AAACGGCGAT GGACTGGACG ATATGTATGC CGGTGGGGCC
AAGTGGCAGG CTGGTAGCTT ACTCGTGCAG CAAGCGAACG GAACGTTCAA ACCATCGGTA
CAGGCCGACT TTAAGAAAGA CTCTGTTTAT GAAGACGTCG ACGCGGTTTT CTTCGATGCC
GATGGCGACA AGGACCTGGA TTTATACGTC GTTTCGGGCG GTAACGAGTT TTATAACAAG
ATGCCGGAGC AGTTCGACCG GCTCTACCTG AACGACGGCA AAGGAAGCTT TACGCGTTCG
CCGAATGCGC TGCCTGCCAT GTACGATAAC AAAAGCTGCG TTCGTCCTTT CGACGTCGAT
AACGACGGCG ACCTCGACTT GTTTGTGGGT GGGCGTGTAG TGGGCTTCGG CTACGGTAAA
TCGCCTAATT CGTATTTGCT GGTCAACAAT GGCAAGGGTA CGTTTACAGA TCAGATTGAT
AAGCTGGCAC CGGGCCTCCG AACGGTTGGT ATGGTTACCG ACGCCGTATG GGCCGATTAT
GACGGTGATA AAGACCAGGA CCTGATCGTG TCGGGCGACT GGATGCCGAT TCGCATTTTC
GCCAATGATA AAGGGAAATT CCAGGAAGTG AAATCCATTA CCGATACCCC GCTCAACGGC
TTTTTTCAGC GCCTTATTGC CGCCGATTTC GACCGGGATG GTGATATTGA TCTGGTAGCT
GGAAACATCG GCACAAACAC CAAGTTCCGC AAAACACCCG ATTCGCAGCT GCGGATGCTG
GTAAAGGATG TAGACAATAA CCAGAGTATT GAGCAGATCA TTTCGTACAA TCGCGGAGAT
GACTGGTACC CACTCGCCTT TAAAGACGAA CTGGGCAAGC AAATGCCGAG CATCATTAAC
AAACGCTTTA CCGACTATGT CTCCTTCGCA GGGAAGCCGC TCGATGATGT TTTGACGGAT
GAAGAACTGA AAGGAGCGGA TGAACTTACC GTTATCGAGT TTGCTTCGGT TTACCTGGAG
AATAACGACG GTAAGTTTGT CGTGCATGAA CTGCCCATGA TGGCGCAGGT ATCGAAGTTA
TTTGCCCTGC AGGCCATTGA CTTCGACCGG GATGGGGATC TGGATATACT GGGGGGAGGT
AACTTTTACG GGGTCAGCAC CTATCAGGGT CGATACGATG CCAGTTACGG GCTGGTACTG
CGGAATGATG GCAAAGGAAA CTTTAGTGCG CTGTCGCCCG TAGATTCCGG TTTCCTGCTC
AATGGCGAGA TACGGGATAT TCGCCCCGTA CGTTCGGCTC AGGGCGTTCG CATCGTTGTA
ACGCGGAACG ACGCTGGTAT GCAGGTATTC AAATCGCTGT AG
 
Protein sequence
MNRILLFIGA LALTGCNSFK SEKPLFESLD STQTGVGFVN KLVPTEKLNI LDYLYFYNGG 
GVSAGDINND GLTDLYFVSN QGKNKLYLNK GDFKFEDITA KAGVEGFADW QTGSTMADVN
GDGLLDIYVC AVGNFRGQEG SNELYINNGD NTFTEKAADY GLDFTGFSTQ AAFFDYDHDG
DLDCYLLNHA IHTSRSYDRV SARNLRNNES GDYLYENQSI TKSGQAPAGK IARFQNVSER
AGIFGAAMGY GLGVSVADLN NDGWEDIYVS NDFHEDDYYY VNNHDGTFTE SVKKAFQHTS
RFSMGNDIGD VNNDGFADVV TLDMYPEDET VEKSSLGEDP LDIYTYKLAY GYMNQYSRNC
LQLSLSGKKF MDIGAMSGVA ATDWSWSPLL ADYDNDGIKD LFITNGIVRR PNNLEYVKFA
ADDSLRYAME TSKSLDERAT KLMPEGKVHN YLYRGTPSLR FEDKSSAWGF ELPNYSNGSA
YADLDNDGDL DLITNNINDP ASLYRNEATT LFPENTHLTV KLKGDSPNTF GVGAKVILKY
GKDSLLVQQL MPTRGFESSV APELLFGLGK RASIDSLIVI WPNQQMEVRT NVKTNQSIML
KQADAKLDGA GFVYTNPPMQ PLFADVSTAD SIPYQHKENK EYFDFVRESL MPFKVSTEGP
HLAVGDVNGD GLDDMYAGGA KWQAGSLLVQ QANGTFKPSV QADFKKDSVY EDVDAVFFDA
DGDKDLDLYV VSGGNEFYNK MPEQFDRLYL NDGKGSFTRS PNALPAMYDN KSCVRPFDVD
NDGDLDLFVG GRVVGFGYGK SPNSYLLVNN GKGTFTDQID KLAPGLRTVG MVTDAVWADY
DGDKDQDLIV SGDWMPIRIF ANDKGKFQEV KSITDTPLNG FFQRLIAADF DRDGDIDLVA
GNIGTNTKFR KTPDSQLRML VKDVDNNQSI EQIISYNRGD DWYPLAFKDE LGKQMPSIIN
KRFTDYVSFA GKPLDDVLTD EELKGADELT VIEFASVYLE NNDGKFVVHE LPMMAQVSKL
FALQAIDFDR DGDLDILGGG NFYGVSTYQG RYDASYGLVL RNDGKGNFSA LSPVDSGFLL
NGEIRDIRPV RSAQGVRIVV TRNDAGMQVF KSL