Gene Slin_4641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4641 
Symbol 
ID8728405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5649493 
End bp5652003 
Gene Length2511 bp 
Protein Length836 aa 
Translation table11 
GC content51% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389418 
Protein GI284039488 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0388606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGAA GCTATATCAA AACATCGAGT CGCAACTTAA TGCGTAACAA GCTGTTCTCG 
TCCATCAATA TTGTTGGCCT TGCCATTAGT ATGTCTGTTG GGCTATTGCT GATCGCGTTC
ATGCTCGATC TGTATTCGTA CGACAGATTT CACCAGAATG GGGAGCGGAT TTACCGCATC
ACCAGCATAC AAACCTCCAA TCAGGAAGAA CGTCAGTCCG GTCAGTCAAA TCCGGACCGA
GCGCGGTCGG GAGCCAAGTT TGCCACCACT TCTTTAAAGA TCGGAAAGCT AATCCGGCAG
AAGGTGACCG GCGTTGACCG GTACGCTGGT GCGGACGTGA CTATTCTACA CAACGACTTT
TCGCAGGACG CGCAGGTTGG CTCTTCCGTT GTTCCCATCA AGGGTTTCTG GGCGGAGCCG
TCTGTATTCA GAATTTTCAC CTTTCCGATG CTGGAAGGCA ATCCCGAAAC AGCGCTGAAA
GATCCGTACT CGATCGTTCT TACAGAGACG GCAGCCAAAA AGCTGTTCGG CAATGAATCA
GCACTCGGCA AGGCGATCAA ATTCGATACG CTCTCGTATC AGGTGACCGG TGTCATGAAG
GACGTTCCCT TCTTTTCGCA TATTCATTTC GAAGCCCTGG TATCACTGTC GACGGCCGAG
CAGCTCAACC GGAACAACTT CGAGAAATGG GCAAGTATGC CGTCGAACTC CGTATACCTC
CTACTGCCGG AAACTGCCAA TATGGCCTCA ATCCAGTCGC AGCTCGACGC CGTTGCCAGG
GAGGAAAATC GCGCCGACGA AAACACGAAG ACCCAGCTTG AGCTAATGCC TTTATATAGT
GTCGTGGTCG GCGAAAGCCT CCGTCAAGCC GAAGGGGGGC CTGGCGTTGG GGGGCCACAC
ATGCCACCAA CGGTGCTTTG GATACTCGGC GGGCTTGCCC TCATTGTAAT CCTGTCGGCG
TGTTTTAACT ACACCAACCT GTCGATGGCC CGCGCCATGC GCCGATTCAA GGAAGTAGGG
CTTCGCAAAG CGATTGGTGC TGATAAACGT CAGGTATGGC AGCAGTTTCT GGTTGAAGCC
GTCATGATAT CCCTGGCGGC CCTTGTTCTA TCCTACTTTA TCTTTCTCCT GTTGCGACCA
CAGCTGATTA ACCTGGCTCC GGAGTTGCAG CGCACAGTGA AGCTCGAACT TAGTCCGGCT
ATGGTCATCG CCTTCGTCGT CTTCTCCATT ACCGTAGGAG TTATTGCCGG TATCATGCCC
GCTCTGTTCT TTTCGAAAGT CAGCGCGATC AATGCACTCA GGAACGTATC CACCCGGAGC
CTCGTCGGCG GAGTATCACC ATCGTTCCTG TCAATAAACG TGTTCAAACA CGCAACACTC
CGGCAGGCGC TGGTGGTCAT TCAATACACG CTTACGCTAA TTTTTATCAC AACAACCGCC
ATTGGCTATG TGCAGTATAA GAACATCCTG AAATTCGACC TGGGATTCAA TACCCAGAAC
ATTCTGAATA TCAACATGCA GGGTAATAAA CCCGATGCAT TTCTGAAAGA CCTTGGCGAG
ATGCCGGAGG TAACGGCGCT GTCGCGGTCG CTCATTATCA CCAGCGTCGG CAATGCATGG
GGCGGCTACA TGAAATATAC AGATTCGCGC GACTCGGCGC TGGTGCTGAC GAACAACGTC
GACGAAAACT ACCTGGCGCT GCACGAATAC AAACTTATTG CCGGGGGTAA TTTTAAAACA
AGGCCCACAA CAGCCGAAGC CGTCAGCGAA GTGATCGTTA ACCAGCAAGT TTTAAAACGA
TTTAACATTG CTGACAACGA CCCCCAAAAA GCGATTGGGC AGGAGATCAC ATTCAGCAAT
TTCAGCGGAA CACGCCGGAT GACCATTGTG GGGGTTATGA AAGACTTTCA CTATGGCAAG
GTTGACAATC TCGTCGGGCC GGTAGCTTTC ATGAGCTGGA CACCCGGCGA CAGGGCCATT
ATCAATGCCA AAATACAAAG TACTGACCTG CTGGCAACCA TGGCCAGGAT TGAGTCGGCC
TGGAAAAAGA TCGACCGTGT TCATCCTTTT CAGGCCAAGT TCTATGACCA GGAAATCCAG
GACGCTTACA GTGAGTTTTC TGCGATTATC AAGATCATTG GCTTCCTTTC CTTCCTGGCC
ATTTCGATTG CTTCGATGGG TCTGTTCGGC ATGGTGGCCT ACACAACCGA AACCAGACTG
AAAGAAATCA GCATCCGCAA GGTAATGGGA GCAAGCTCCG TCAACCTTAT TTTCTTGTTG
AGCCGTGGTT TTCTCCTGCT ACTGTCGATT TCGGCACTTA TCGCACTCCC CATCAGCTAT
CTATTCTTCA AAAACGCTGT GCTCACCCAC TTCCCGTATC ACACCCCCGT TCAGATCGCC
GAGCTATTCG TGGGCTTGCT GGTAGTATTG CTGATCGCCT TCATTATGAT CGGCTCGCAG
ACGGTAAAGG CCGCAAAGGC GAATCCGGTA GACGTCCTGA AGAGTCAGTA A
 
Protein sequence
MIGSYIKTSS RNLMRNKLFS SINIVGLAIS MSVGLLLIAF MLDLYSYDRF HQNGERIYRI 
TSIQTSNQEE RQSGQSNPDR ARSGAKFATT SLKIGKLIRQ KVTGVDRYAG ADVTILHNDF
SQDAQVGSSV VPIKGFWAEP SVFRIFTFPM LEGNPETALK DPYSIVLTET AAKKLFGNES
ALGKAIKFDT LSYQVTGVMK DVPFFSHIHF EALVSLSTAE QLNRNNFEKW ASMPSNSVYL
LLPETANMAS IQSQLDAVAR EENRADENTK TQLELMPLYS VVVGESLRQA EGGPGVGGPH
MPPTVLWILG GLALIVILSA CFNYTNLSMA RAMRRFKEVG LRKAIGADKR QVWQQFLVEA
VMISLAALVL SYFIFLLLRP QLINLAPELQ RTVKLELSPA MVIAFVVFSI TVGVIAGIMP
ALFFSKVSAI NALRNVSTRS LVGGVSPSFL SINVFKHATL RQALVVIQYT LTLIFITTTA
IGYVQYKNIL KFDLGFNTQN ILNINMQGNK PDAFLKDLGE MPEVTALSRS LIITSVGNAW
GGYMKYTDSR DSALVLTNNV DENYLALHEY KLIAGGNFKT RPTTAEAVSE VIVNQQVLKR
FNIADNDPQK AIGQEITFSN FSGTRRMTIV GVMKDFHYGK VDNLVGPVAF MSWTPGDRAI
INAKIQSTDL LATMARIESA WKKIDRVHPF QAKFYDQEIQ DAYSEFSAII KIIGFLSFLA
ISIASMGLFG MVAYTTETRL KEISIRKVMG ASSVNLIFLL SRGFLLLLSI SALIALPISY
LFFKNAVLTH FPYHTPVQIA ELFVGLLVVL LIAFIMIGSQ TVKAAKANPV DVLKSQ