Gene Slin_2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2123 
Symbol 
ID8725861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2573359 
End bp2576580 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content44% 
IMG OID 
ProductYD repeat protein 
Protein accessionYP_003386956 
Protein GI284037026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAC TCTACCAGTG GCTGGCCATA GGCTGTCTGC TCACTACTTT CTTGCCTAGT 
CTGGCACAAA CGCCCGGCAC TAACTACACA ATCAGCCGAA CCTATAAACA GGCTAACATT
AGCGAAAACT TGACTGGATC GGGCTTTACA GCAACAGCTC AACAGGCCAC TAGTCAAGTC
TCCTACTTCG ATGGATTGGG GAAGCCTATT CAGCAGGTGA TGGCTTTCGG GGCGGGGTCC
AAAGCTGACA TCGTTACGTT AATTGAATAT GATGCTCTAC AGCGACCCGT TCAGACATTT
TTACCCACTC CAATAAACGC AAATGGTGGA CTGCTTCAAG CGAATGGTGA TCTCAAAATA
AAAGCAAAGA CATATTATAC GGACGTCGCA AAAGTTTCAG CTCCTATTAA TACTACTGCT
CCTATAGAGG TCTTAGCAAC TCAAACTTTC TACGAAAACA CACCGCTTAA CCGGGTCACG
AGCCAGAAAG CCCCGGGAGC CACTGGCAAT TCCACTCAAT TCCATGGGGT GAACGCCATA
AATGATGTTA AATACTTTCG AAGCAGTGCT GCAGGGCTAG CAAGTATTCA ATTGGTTGGT
ACTTATGAGG CAGGTGAGTT GACTTATGTG CGTACCACTG ATGAAGCCGG TGGTGCCGTG
ACTCAGTATC TGGACCGTCT CAATCGGATA GTCCTTAAGC GCGTACTCAC TAGCAAAAAT
GGCCTGGATG TTAATTTAGA CACATACTAT GTTTATGATG AGAAAAGTCA ATTGCGGGCG
GTCCTTCAAC CAAATTATCA GAACGAAACA GACTTAAACC GGAATGCCTT TCTCTATCGG
TATGATGAAT ATGGACGTTT AGTGGAAAAG AAGTTGCCTG GCAGCAACGC GTCGCAAATG
GAGTATTACA TTACTACAGA TCTGCCGAAA AGTAGTACAG ATGGACGTGG CCAGAAGTTT
TATTACCTCT ACGATAATCT TAATCGCCAG ACCGAGATGG GTCTGTGCAA AAATGGCAAC
TGTGATACCC CAGAGCCCCT GTTGAAGACC TATTACGATA ATTATGGGTT TACCCCTTTC
CGAAATTACG AAGCTGAGCC AGGCTTAACC GGAGTTGCCT TTGCGAATAC TCCTATGGTC
AACCGTACTA ACTTGAAGAC CGGTCAGGCT GCCAGGGTGC TGCTACCTAA TGGCGATTAT
GGACAGTGGT TGCAAACGGT GATCTATTAC GATGACAAGC AAAGAGTAAT TCAGACGTTG
CGTCAGTTGT ACGGATTCAG CAGTAATGCA TTTGAGCGCG TGAGCCTGCA GTTAGCATTC
GATGGCAAGC CCGAGCAGGA GTGGATTACT CAGGAGACTG GCAGTGTGAG CTACAAGCTA
ACCAAGACTT TCACGTACGA CCATGCCAAC CGGCTGAGTA AAATTAACCA TATACTCTAT
GAAGGAGGTG TTCAGAAAAA ATCATACACC CACATGGAGC AACTTTACAA TGAAGTAGGT
CAACTGGCGA CCAAATCCCT GCATACAGGT GTGCAAATTC TAGGCTATAA GTACACTCCA
CGGGGCTGGC TAGGCAATAA TCAAACAAGT ACAGGTCAGC CTTTCACGTT AGGTCTAAGT
TACAAAGCTA ATGGTAACAT TGATAGCCTG TCGTGGATAA CCAAAAGTTA CAGTGGAGGA
ATGGGGTTAA CCTACGACAA GTCGAGCAGA TTAATTGGAG CAGTAGGTAG TGGTAATTTT
GGAGGCTATA ATGAGTCACC AATCAATTAC GATAGCAATG GTAACTTAGA AAGCCTGACT
CGTAAGTACA ACAATACAGT CATTGACCAG TTGAGCTACC AGTATCACGG CAACCAACTT
CATAGGGTAA ACGACGACGC GCAAGATAAT CAGAGCCAAG CGGTTAAAGG ATTTATTAAC
GGAACTAACA TCGATGATGA GCTAATCTAC GACGGCAACG GAAATTTGGT AAGGGATTTC
AATCGGGGCG TTGGAAGTGC CACTACAGAT GGCATTTATT ACAACGTACA GAACCTGCCT
CGCACCGTGA TCCGTAATGG GCGTACGGTA CTTTATACGT ACGATGCTAG CGGAATAAAA
CTTAAAAGTG AAGCGCCAGA TAATGTCAAT ACGTACTACG CAGGGATGTT CGAGTACAAA
GCAGAAAACA GCTTACTTCG TATCGGTTTA GAGGAAGGGC AACTCGTAAA GAAAGATACT
AATTACTTAG CGCATTACTA TTTAAGGGAT CATCTAGGAA ATGTCCGCTC AGTACTGGAT
GAGGTCGGCA CTGTAATACA GGAGACAGAA TACTATGCTT TTGGTTTACC AATTCAGCGT
AGTGGAAGTG ATAAAAATAA ATATCTTTAC AACGGCAAGG AAAAACAGCC TGAGACTGAG
TGGCTAGATT ATGGAGCCCG TATGTATGAT CCGAGTATTG GACGATGGAT GGTGATTGAT
CCGTTGACGG AGATTTTTCC AAGTACTTCA TATTATAGTT ATGCGGTAAA CAATCCTACT
TTATTTACAG ATAAGTATGG TCTTTATGCC GAGTCTTCAG AAAACATAGC AGTTTGTCCA
ACTTGCCCAA GTGGAGAAGA ATATAGCAAA TACAGGGACA GTAAATCACT TTATACCTAT
GACAAAGGAA CAGGTGTTAT CCTAAATGGG GATGGAAAAG GTGCAACAGT AACTGCAAAA
AGAATACAAC CTACAGAAGC TCCTACTTTT GGTTGGCCTT GGCAAGCAGA TTTGCCTATT
GGCCTATCAG AATTGCAATT AGGTAATAAA ATTGAACAAG TAGCAAAATG GGACGGTACT
TTCCGTTATC CAAATTCTGT ACTAAGTCAA AAAGAAGTAG ATGCCAAAGC TATTTTTAGA
CAACCACTTA TTCAAAAACG ACCTATAAAT ATTCTTAGAA ATGTTGCCTT ACCAAGGGAT
CTGGCACTCA AAGTAGCAAA AGGGTTGAAG GTAGCTGGAG GAATAACGAT GGCGATGGGA
GTTGTAGACA ATATTAGCAA AGGCTATGCA GGTAATATTA CGTGGGAACA TTCAGCGACT
TCAATAGGTA TAGGTGCGTT TGGGTTAGTC GCAGGTACAT TTGGAGCACC TGTTGTTCTT
GGCGCAATTG CGGTAGGAGT TGTTTATTCT GTTTACGAAG ATGACCTCTG GCATGATTAT
GACAAAACCA ATGCAACTAA TTTTGTGGAA AAGAAAGAAT GA
 
Protein sequence
MNQLYQWLAI GCLLTTFLPS LAQTPGTNYT ISRTYKQANI SENLTGSGFT ATAQQATSQV 
SYFDGLGKPI QQVMAFGAGS KADIVTLIEY DALQRPVQTF LPTPINANGG LLQANGDLKI
KAKTYYTDVA KVSAPINTTA PIEVLATQTF YENTPLNRVT SQKAPGATGN STQFHGVNAI
NDVKYFRSSA AGLASIQLVG TYEAGELTYV RTTDEAGGAV TQYLDRLNRI VLKRVLTSKN
GLDVNLDTYY VYDEKSQLRA VLQPNYQNET DLNRNAFLYR YDEYGRLVEK KLPGSNASQM
EYYITTDLPK SSTDGRGQKF YYLYDNLNRQ TEMGLCKNGN CDTPEPLLKT YYDNYGFTPF
RNYEAEPGLT GVAFANTPMV NRTNLKTGQA ARVLLPNGDY GQWLQTVIYY DDKQRVIQTL
RQLYGFSSNA FERVSLQLAF DGKPEQEWIT QETGSVSYKL TKTFTYDHAN RLSKINHILY
EGGVQKKSYT HMEQLYNEVG QLATKSLHTG VQILGYKYTP RGWLGNNQTS TGQPFTLGLS
YKANGNIDSL SWITKSYSGG MGLTYDKSSR LIGAVGSGNF GGYNESPINY DSNGNLESLT
RKYNNTVIDQ LSYQYHGNQL HRVNDDAQDN QSQAVKGFIN GTNIDDELIY DGNGNLVRDF
NRGVGSATTD GIYYNVQNLP RTVIRNGRTV LYTYDASGIK LKSEAPDNVN TYYAGMFEYK
AENSLLRIGL EEGQLVKKDT NYLAHYYLRD HLGNVRSVLD EVGTVIQETE YYAFGLPIQR
SGSDKNKYLY NGKEKQPETE WLDYGARMYD PSIGRWMVID PLTEIFPSTS YYSYAVNNPT
LFTDKYGLYA ESSENIAVCP TCPSGEEYSK YRDSKSLYTY DKGTGVILNG DGKGATVTAK
RIQPTEAPTF GWPWQADLPI GLSELQLGNK IEQVAKWDGT FRYPNSVLSQ KEVDAKAIFR
QPLIQKRPIN ILRNVALPRD LALKVAKGLK VAGGITMAMG VVDNISKGYA GNITWEHSAT
SIGIGAFGLV AGTFGAPVVL GAIAVGVVYS VYEDDLWHDY DKTNATNFVE KKE