Gene Slin_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2038 
Symbol 
ID8725776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2463451 
End bp2466525 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content52% 
IMG OID 
ProductTPR repeat-containing protein 
Protein accessionYP_003386882 
Protein GI284036952 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.144797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTATT CAACCACCCA AACGGGTCAC TACCAATTTT CCAGCATGGT CTTTAATCGG 
TTGCTGTTCA CGCTGACCCT TGGGTTACTG GCTACCTTAT CGGCTTCGGC CCAGCGCACC
CAAAGTAATG TTGAACCCGA TTACCACTAC CGGAATGGGC TCGAACTCTT CGAGAAAGCC
AATTATGCCG CTGCCCGCTA CGAATTTCGG CAATACATGG AACCCCGGCG GGGCGATGGT
GCCAAATCAC TGCTGAATAC CAGTGATCAG AATGCAGTGG AAGCCGAATA CTACATCGCC
CTGACGAGTC TTTATATCGA CGAACCGGGG GCCGAGCTTT TAGTTGACCG CTTCGTCAAG
AACAACAGCC AGCATCCCAA AGCCGGACAA CTTTACGGCG ATCTGGGGAC TTACTACTAT
AACCGCCAGG ATTACACCAA AGCCATCGAC TTCCTGGAAA AAGCCGTCCG GCAGGGAGGC
AGTTCAACGC AGCAATCGGG TTATAAATAC CAGTTGGCGC TCTCCTATTA CAACACGCAG
AATCTGCAAA AAGCCCTGCC TCTGCTGAAC GAAATTAAAG TCGATCCCAA CTCAACCGAT
GCACCGGCCG CATCGTACTA TGCCGGGACC ATTAACTTCA GAAACAAGAA CTTCAACGAA
GCCGTCGCCG ATTTTCGTCG GATCGAAAAC AACCCAACCT ACCAAAATCA GGTACCGAAC
TGGATTGCCC AGTCGCTTTA CCGGCAGCGT CGGTACGACG ATTTGCTGGC CTATACCGAG
CCGCTGCTTA AGCGAAACAA TGGAGCAGGT ATGAACGAAG TTGCCCTGTT TACGGCGGAA
GTGTTTTACC AGCAAAACCA GTTTGCGAGG GCCATACCTT ATTATAAATC GTACGTGAAC
ACCGCTGGCG CCAAAGCACC CGGAGCGGTT AAGTTCCGGT ATGGGCAATC GCTCTTTCGC
ACCGGTGCCT ACAACGACGC CATCGCTCAA CTCAAAACAC TGGCGGGCGG AAAAGACACG
ACCGCTCAAT ACGCTGCGTA TACGCTAGGT GTCAGCTACT TACAAACGCA AAACCCGACG
TATGCTCTGA ACGCTTTTGA TCAGGCAGGA CGGCTATCGT TTAACCGGGA GATTCAGGAA
GAAGCGCGTT TTAACCACGC TAAACTTCAG CTTGACCAGA ACAACGGGGC GGATGCGGTA
AAAGAGTTGA CGGCTTTTTT GAAGCAGTAC CCCGATAGTA AGTTCGAAAA TGAGGCCAAC
GAACTGGTCG GTGAAGCCTA TTTTGCGTCC AACAATTACC CGGCGGCTAT CGCTTATATT
GAAGGGCTGA AACGCCGGAC ACCGAAAATT AACGCGACTT ACCAGCGGCT CACCTACAAT
CAGGGGATTA ACGATTTCAA CGCCGAACGG TACCAGCAGG CAGTTGCCAA CTTCGATAAA
TCGCTGAAAT ATCCGGTCGA AAATAGTTTA CAACAGGCAG CCCAGTTCTG GAAAGCGGAA
TCCTATTCGG CGGGCAAGCA ATATGATACG GCCATCCCCC TCTACGCCAG CATTTCCAAA
GCGGGCGGGG CAGACTCGTA TGCGACCAAA AGCCTGTATG GCCTCGGATA CGCCTATTTC
AACAAAAAAG ACTACACCCG CGCCCTGCCT TACTTTCGGG ATTTTGTAAG TCGGGGTGGC
GATGCTGACG ACAGAGTCCA GGTTCAGGAT GCGACGATCC GGCTGGCCGA TACGTACTTC
GCAACGAAAC AGTACGAAAA TGCCCTTCGT TCGTACGATC AGGCCATCGC GCAGAATGCA
CCGGACAAAG ATTATGCGTC CTACCAGAAG GCGTTAATTC TGAGTTATGT TGGTCGGGAC
GCGGAAGCCA AAGCCCAGTT TGACCAGGTG CAACGGCAAT ACCCGAACTC CCGCTTCGTG
GATGAATCGC TGTTTCAGAA AGCGAATGTC GACTTCGAGA AAGGTTCATA TCAGGTAGCC
ATTCAGGGAT TTACCAAGTT GATTCAGGAC AAGCCAAACA GTGCACTTAT TCCGGCCGCT
TTGCTGAAAC GCGCGATTGC CTACGGAAAC CTGCAACAGT ACGATCCGGC GGTGGCCGAT
TACAAACGCA TTCTGGACAA CTACGGTGAA TCTGACCAGG CGCAGAGTGC CCTGCTCGGC
ATTCAGAATA CACTTAACGA TGCTGGTCGG CCGGAGGAGT TCTCGCAGGT GCTGGGCCAG
TACAAGAAAG GGAATCCGGG CAGTACGGAT GTTGAGCGGG TTCAATTCGA AAACGCCCGG
AATATTTACG CTAGTGGTAA ATACGAGCAG GCCATCCAGT CGTTCCTGAA CTTCATGCAG
GAGTACCCGG CCAGTCCGAA TACCAATCAG GCCCGGTATT ACGTAGCGGA ATCCTACCGT
CAAACCAACG ATGTGGCTAA TGCCCTCCGG TATTACAACC TGGTGATTGC CGATAACAAG
TCGGATTACC TAGTCCGGGC CGCTACACGG GCCGCGGAAC TGGAAGTAAA ACAGAAGAAT
TATGGCCGCG CGGTTCGTAA CTACCAACTC ATCCAGAGCC GGGCTGGTAG CAAGGCAGAG
CAGGTTACGG CTCAGTTGGG GCTGATGGAT ACATATTTCG TATACCCGAA ACTGGATTCG
GCAGCGATTG TCGCCCGCGA GATTGCGGCT GGTGGCAATG TGGTACCGGG CGCGCAAAAC
CGGGCTCAGT TAATGCTCGG AAAAGTAGCC TTGAGCCGGA ATGATTACAA AACCGCCCAG
GCTGACTTTG ACAAAACGAT TGCCCTGGCC AAAGATATTT ACGGAGCCGA AGCCCAGTAT
TACCTCGGCG AAATCCTGTA CCGCCAGAAA AAGTACAAAG AGTCGGTGTC TACGCTGTTG
AAATTTAACG AACAGTTCAG TGATTTTGAG TACTGGAAAG GCAAGGCGTT TATTCTGGTT
TCTGACAACA ACGTCGCCCT GGACGAGCCG GCCCAGGCCA AAGCCGTTTT GAACTCCATT
ATTGAAAATT CATCCGACGA AACCATCGTT ACCGAAGCTA AACAAAAGCT GGCAACGCTG
GAGTCTAAAA ATTAA
 
Protein sequence
MPYSTTQTGH YQFSSMVFNR LLFTLTLGLL ATLSASAQRT QSNVEPDYHY RNGLELFEKA 
NYAAARYEFR QYMEPRRGDG AKSLLNTSDQ NAVEAEYYIA LTSLYIDEPG AELLVDRFVK
NNSQHPKAGQ LYGDLGTYYY NRQDYTKAID FLEKAVRQGG SSTQQSGYKY QLALSYYNTQ
NLQKALPLLN EIKVDPNSTD APAASYYAGT INFRNKNFNE AVADFRRIEN NPTYQNQVPN
WIAQSLYRQR RYDDLLAYTE PLLKRNNGAG MNEVALFTAE VFYQQNQFAR AIPYYKSYVN
TAGAKAPGAV KFRYGQSLFR TGAYNDAIAQ LKTLAGGKDT TAQYAAYTLG VSYLQTQNPT
YALNAFDQAG RLSFNREIQE EARFNHAKLQ LDQNNGADAV KELTAFLKQY PDSKFENEAN
ELVGEAYFAS NNYPAAIAYI EGLKRRTPKI NATYQRLTYN QGINDFNAER YQQAVANFDK
SLKYPVENSL QQAAQFWKAE SYSAGKQYDT AIPLYASISK AGGADSYATK SLYGLGYAYF
NKKDYTRALP YFRDFVSRGG DADDRVQVQD ATIRLADTYF ATKQYENALR SYDQAIAQNA
PDKDYASYQK ALILSYVGRD AEAKAQFDQV QRQYPNSRFV DESLFQKANV DFEKGSYQVA
IQGFTKLIQD KPNSALIPAA LLKRAIAYGN LQQYDPAVAD YKRILDNYGE SDQAQSALLG
IQNTLNDAGR PEEFSQVLGQ YKKGNPGSTD VERVQFENAR NIYASGKYEQ AIQSFLNFMQ
EYPASPNTNQ ARYYVAESYR QTNDVANALR YYNLVIADNK SDYLVRAATR AAELEVKQKN
YGRAVRNYQL IQSRAGSKAE QVTAQLGLMD TYFVYPKLDS AAIVAREIAA GGNVVPGAQN
RAQLMLGKVA LSRNDYKTAQ ADFDKTIALA KDIYGAEAQY YLGEILYRQK KYKESVSTLL
KFNEQFSDFE YWKGKAFILV SDNNVALDEP AQAKAVLNSI IENSSDETIV TEAKQKLATL
ESKN