Gene Slin_3266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3266 
Symbol 
ID8727019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3945429 
End bp3948380 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content53% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388076 
Protein GI284038146 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.348309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.462264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGTT CCGGTCTACA GCACCGGAAC GACCTATCTA TTTTCCTTAA CCAATCAGCT 
AACTATATGA AGATTTTTAC TGACAAATTA CCACTCAGGT CGGCACTCGT AACGGGACTT
GCCTTGGGTA GCCTGTCTGT ACAGGCGCAA AAAATAGACG AGCTGTATAA CCAGAAGATC
AGGGAATACA CGACAGATGC CCGGTTTCTG CCTGCTTCGG TGCTGAACCT GCCCGACGAC
CCTAAAATCC CATCTCCGCT CAAACATTTC GGTCAGATTG TGGGCACGCC CGGCGTCATT
CACCGCACGC CCGAGATTTA TGGCTATTAT CAGAAACTGG CTCAGACGTC ACCCAACATC
AGTGTACAGC AGGTTAGTAC TACGGAGGAG GGGCGCCCGA TTCAGTTGGT GGTTATCGGC
AGTGAAGACG CCATGAAGCG GCTCGATCAC TATAAAAAGC AACTCGCCCT GCTGGCCGAC
CCCCGTAAAG TAGGGAGTCA GGATGTTGAA AAGATTCTGG GTGATACCAA GCTGGTGTAC
TACCTCAACG GTGGCCTGCA CTCGCCCGAA ATGGGGTCAC CGGAGATGCT GATGGAGCTG
GCTTACCGGT TGGTTACCAG CCAGTCGCCC GAGATAAAGA CGATTCGGGA TAACATCATT
GTGCTAATCA ATCCGGTGTC GGAACCCGAC GGCTGGGATA AACAGGTCGA CTGGTACAAC
CGCTATACCA AAGGCCGGAA AGAGTACGAT GACGGGTTTC CGAAATCGCC ACCGTACTGG
GGAAAATACA CCTACCACGA TAACAACCGG GATGGTTTAC AGGCATCGCA GGAGTTGACG
AAAGCGCTCT ATAAAATCTT TTACGAATGG CATCCCACTG CCAGTCTCGA CCTGCACGAG
TCCGTCCCGT TGCTGTACAT ATCCACCGGA ACAGGGCCGT ATAACGAAAC CATCGACCCG
ATCACCATTG GCGAATGGCA GATCATGGCG CACCACGACA TCACGACACT GGCCTCGCAG
GGAGTTCCGG GCGTGTTTAC GTGGGCGTTT TACGATGGCT GGTATCCTGG TTATGCACTC
TGGATTTCCA ATAACCACAA TGCCGTTGGC CGTTTTTATG AAACCTTCGG GAACGCCGGG
GCGAACACCT ACCTGCGCGA TCTGGCCGAG CAGAAATACG CGGGCGACCC CGCCACTACG
AAAGAATGGT ATCGGCCCGT GCCGCCCACC GAAAAAGTTT ACTGGTCGTA CCGGAATGGC
ATCAATTACA TGCAGGCCGG GGTGCTGGCA TCGCTGTCGT ACGGCGCCAC GAATAGTCGG
CTGTTGTTAA AAAACTTCTA TCAGAAAGGG CTGAACAACA TCAAAAAAGG GACGGAAGAA
ACGCCACGTG CGTTCGTTAT TCCCAAAAAT CAGCGCGACC CGGCTATGGC GGCTTACCTG
GTCAATCAAC TGCGTACGCA GGCCATTGAA GTTCATCAGG CGGAGTCGGG CAAGAACAAA
GGCGATTATG TCGTGTTGCT GAACCAGCCC TACCGCAATC TGGCCGTTTC ACTGCTAACG
AAGCAGAACT ACCCGAAAGA AGCGAAATTT CCCCCCTACG ACGATATTGC GTGGACGCTG
GGTTACCTGT ATGGGGTGGA CGTAAAAGCC GAAGATAGCG TCAACTATGT ACCCAGTGAG
CTTAAACTCC TGAGCGAGAA TGTTAATTAT GCGGGGACGA TGGAGGGAGA GGGAACAAAC
TATGTTCTCA ACTACAAAGC CCAAACCAAT GTGCTGCCCG CCCTGCTTTG GCTGAAAGGG
CAGAGCAAGC AGGCAAAAGC CGTTGTGCTC GATACCAAAG CTACGTTCGG CGGACTAAAA
GACACGCTGT CGGCGGGTGC AGTGGTGTTC AAAGGACTCA CAGGCGATCA GGCCAAGAAA
CTGGCAGCGC AGTTTGGGCT GGATTTACAG GCAACAAAAG CGGAGCCCAT GGGTGTTGGA
TCGCCCTTGC GGCAGCACGA GGTTACTCTG CCTCGGGTGG CCATTTACCA CAGCTGGTAC
AACACGCAGG ATGAAGGCTG GGCGCGGTAT ACGTTCGAGC AGCGCGGTAT TCCATATATG
TCCATCAACA AAGACCACCT GAAAGCGGGC GATTTGCGGA AGAAGTTCGA TGTGATTCTC
ATTCCCCGGA TGCGCGGCAC ATCGACCAAC TTTATCCATG AAATCGACAA ACGATTCGGT
CCCCTGCCGT ACACCAGAAC GCCCGAGTTT CCATCGCATG GTTTCCCGGA CGCCAGCAGC
GATATTACCG GTGGGCCCGG ATTCGACGGC GTCGATAAAC TGAAACAGTT CGTCGAACAG
GGCGGTGTGC TTGTCACGCT TGACAATTCA TCGCTCATGG TTGCCGAAGC GGGCATCACC
CGCGATCTGG ACGAAGTGGC GGCTCCTACG CTGTTTCATC CGGGCTCCAT CGTAACGGTG
AAAAACCGGC GTCCCGATAG CCCCGTTATG TACGGCTTTC CCGAAATCTT TCCCATTTTT
CGGGGAATTG CTCCGTTGCT GCAAACAAAA AAGCATAACC GCGACATGAT GCTGATGCAG
TATGGCACCA AACCGCTCAA AGACGAAGAA GAATACAAGG GACTGATTAT GGGCATGCCC
GATAAAAAAC CGGCTAAAGA AGCGAAGGCG ACACCCAAAA AAGAAGACCC GTATGTGGTG
TCAGGGATGG TTCGCAATGA GCAGACGATC ATTGGGCATG GCGGGATTTT CAACGTGCCG
GTGGGTAGCG GCCGGGTCAT TGCTTTCACC TTCGATCCAC TGCATCGGTA CCTCAACCAC
CACGACGCCC CGCTACTCTG GAACGTGCTG ATCAACTGGA ATCATCTGGA TACACCGCCC
GTATCGGCCA CAGCAGACAC CGAAACACCG AACCGGGCCA ATTCACCAGT CATAAAGACA
GGAGATAATT AG
 
Protein sequence
MGRSGLQHRN DLSIFLNQSA NYMKIFTDKL PLRSALVTGL ALGSLSVQAQ KIDELYNQKI 
REYTTDARFL PASVLNLPDD PKIPSPLKHF GQIVGTPGVI HRTPEIYGYY QKLAQTSPNI
SVQQVSTTEE GRPIQLVVIG SEDAMKRLDH YKKQLALLAD PRKVGSQDVE KILGDTKLVY
YLNGGLHSPE MGSPEMLMEL AYRLVTSQSP EIKTIRDNII VLINPVSEPD GWDKQVDWYN
RYTKGRKEYD DGFPKSPPYW GKYTYHDNNR DGLQASQELT KALYKIFYEW HPTASLDLHE
SVPLLYISTG TGPYNETIDP ITIGEWQIMA HHDITTLASQ GVPGVFTWAF YDGWYPGYAL
WISNNHNAVG RFYETFGNAG ANTYLRDLAE QKYAGDPATT KEWYRPVPPT EKVYWSYRNG
INYMQAGVLA SLSYGATNSR LLLKNFYQKG LNNIKKGTEE TPRAFVIPKN QRDPAMAAYL
VNQLRTQAIE VHQAESGKNK GDYVVLLNQP YRNLAVSLLT KQNYPKEAKF PPYDDIAWTL
GYLYGVDVKA EDSVNYVPSE LKLLSENVNY AGTMEGEGTN YVLNYKAQTN VLPALLWLKG
QSKQAKAVVL DTKATFGGLK DTLSAGAVVF KGLTGDQAKK LAAQFGLDLQ ATKAEPMGVG
SPLRQHEVTL PRVAIYHSWY NTQDEGWARY TFEQRGIPYM SINKDHLKAG DLRKKFDVIL
IPRMRGTSTN FIHEIDKRFG PLPYTRTPEF PSHGFPDASS DITGGPGFDG VDKLKQFVEQ
GGVLVTLDNS SLMVAEAGIT RDLDEVAAPT LFHPGSIVTV KNRRPDSPVM YGFPEIFPIF
RGIAPLLQTK KHNRDMMLMQ YGTKPLKDEE EYKGLIMGMP DKKPAKEAKA TPKKEDPYVV
SGMVRNEQTI IGHGGIFNVP VGSGRVIAFT FDPLHRYLNH HDAPLLWNVL INWNHLDTPP
VSATADTETP NRANSPVIKT GDN