Gene Slin_5933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5933 
Symbol 
ID8729714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7190835 
End bp7192304 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF1501 
Protein accessionYP_003390694 
Protein GI284040764 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTC AGGACGAAAT ACATGACCAA CTCAGCCGCC GGACCTTTCT AGGGCAATCT 
AGTGCTGGTC TGGGTGCCAT TGCGTTAGCA TCACTGCTAA ACCCGACAAA TCTGTTCGGT
GGCGCATCGT CACCCGGCAC CTCCATGCCG GGAGAAAATC CGGCGGTAGG CAAGCCGCAC
TTTCCACCGA AAGTGAAACG GGTAATTTAT TTATTTCAGA GTGGAGCGCC GTCGCAACTC
GAATTGTTCG ATTACAAGCC AAAGCTCGAA GCCATGTGGG GGCAGGATTT ACCGGCTTCG
GTACGCAACG GCCAACGCCT GACGGGCATG AGTGCCGGAC AAAGCCGGTT TCCATTGGCG
GCTTCTAAGT ATAAGTTCGC GCAGTACGGA CCCGGTCGCA TGTGGCTTAG TGAATTGTTG
CCGCATACGG CGAAAATTGC CGGGGATTTA ACCTTTGTGC GCTCCCTGCA TACCGAGGCC
ATCAACCACG ACCCGGCTGT TACCTTTTTT CAGACGGGAA GCCAACAAGC CGGGCGACCC
AGTTTCGGCT CCTGGATCAG TTACGGACTA GGCTCAGACA ATCAGAATCT TCCATCCTTT
GTAGTACTTC TGTCCAAAGG GCGCGATGGC GACCAGCCGT TATATGCCAA ACTCTGGAGT
AATGGATTTT TACCATCTGT GCATCAGGGC GTGGTGTTCC GGTCGGGCCC TGACCCGGTG
TATTACCTTA ACAACCCGCC GGGAGTCGAT AAAACCAGCC GTCGGCGGAT GCTCGATTAT
TTGGATAAAC TGCATCAGGA ACAATTCAAA CACGTACTGG ACCCGGAAAT AAACAACCGG
ATGGCACAGT ACGAAATGGC GTATCGGATG CAGACATCGG TTCCCGAAAC GCTCGACATT
TCGAAAGAGC CGGACTATAT CTTCGACATG TATGGTCCCG ACAGCCGCAA GCCGGGCACG
TTTGCTGCCA ACTGCCTGCT GGCCCGCAAA CTGGTCGAAA AAGATGTTAA GTTTATCCAG
TTGTATCATC AGGGATGGGA CCAGCACGGC AATCTGCCCA ACGATATTAA AATACAAACA
AAAAGCGTTG ACCAGCCCTC GGCCGCACTG ATCATGGACC TCAAACAGCG TGGTTTACTG
GACGATACGC TCGTGATCTG GGGCGGAGAA TTTGGCCGTG GGGCATACTC ACAGGGAAAA
CTCACCCGCG ATAATTACGG GCGAGACCAC CATCCACGAG CGTTTTCGGT CTGGATGGCG
GGGGCCGGGG TTAAAAAAGG TATGGTCTAC GGCGAAACCG ATGATTTCGG CTATAACGTT
GTCAAAGACC CTGTTCACGT GCATGATTTC CAGGCGACGG TCCTGCATCT GCTCGGAATC
GACCACGAAA AACTGACCTT CAAAAGCCAG GGACGACGGT ATCGACTAAC CGACGTGCAT
GGCAAAGTAG TGAAGCCGAT ATTAGCATAA
 
Protein sequence
MDIQDEIHDQ LSRRTFLGQS SAGLGAIALA SLLNPTNLFG GASSPGTSMP GENPAVGKPH 
FPPKVKRVIY LFQSGAPSQL ELFDYKPKLE AMWGQDLPAS VRNGQRLTGM SAGQSRFPLA
ASKYKFAQYG PGRMWLSELL PHTAKIAGDL TFVRSLHTEA INHDPAVTFF QTGSQQAGRP
SFGSWISYGL GSDNQNLPSF VVLLSKGRDG DQPLYAKLWS NGFLPSVHQG VVFRSGPDPV
YYLNNPPGVD KTSRRRMLDY LDKLHQEQFK HVLDPEINNR MAQYEMAYRM QTSVPETLDI
SKEPDYIFDM YGPDSRKPGT FAANCLLARK LVEKDVKFIQ LYHQGWDQHG NLPNDIKIQT
KSVDQPSAAL IMDLKQRGLL DDTLVIWGGE FGRGAYSQGK LTRDNYGRDH HPRAFSVWMA
GAGVKKGMVY GETDDFGYNV VKDPVHVHDF QATVLHLLGI DHEKLTFKSQ GRRYRLTDVH
GKVVKPILA