Gene Slin_4561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4561 
Symbol 
ID8728325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5531002 
End bp5533476 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003389339 
Protein GI284039409 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0251417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAT TTTTCCTACT GATTAGCCTG ACGCTAACCG GCGAACTGGC CTATACACAA 
CCCGCGCCAA GCCGAATTAT TCGTGGTTCG GTAACCGACG CTGACAGCCG CAAACCTATT
CCCTTCGGCA CCGTCAACCT CGTCGGTCAG AACAAAGGGG CTATTACTGA TGCTCGTGGG
CAATATTCGC TGTCCATCAA AGCCGATTCG GTGCTGCGCG ACCTGATTAT TTCCTGCGTC
GGCTTCAAAT CCGATACGGT ACGGATCAGC CCCGGCACTA ACGTATATGA CGTAACGCTA
CTGCCCATCG TTAACGCTTT AAATGAAGTG GTAGTGACGG GCGTAACGCG GGGAACCCTC
ATGAAGCAAA ACCCGGTCGC TATTCTGGCC ATTTCGCGTA AGTCGATTGA GAGTACAACC
AGTAGCAACA TCATCGACGT ACTGGTGAAA AATGCACCGG GACTGACAGC GGTTAAAACT
GGGCCTAATA TTTCTAAACC ATTTATCAGG GGGCTGGGCT ATAACCGGGT GCTGACGCTT
TACGACGGGG TTCGGCAGGA GGGCCAGCAG TGGGGCGATG AGCATGGCGT GGAAGTTGAC
AATTACAATA TCGACCGGGC CGAGGTTATC AAAGGCCCGG CCAGCCTGAT GTACGGCTCC
GACGCCCTGG CAGGTGTTGT GAGTATGATG CCGATCTATC CAAAAGACAC CGATGGCAAG
CTGAAAGTAG GTTTCGTGAC CGAATACCAG TCGAACAACC GGCTCCTGGG CGAGTCAATC
AGTCTGGCCT CCGGCGGGTC GCGCTGGGCC TGGAATATGC GCGGGTCCAT ACGGGCGGCC
ACTAACTACC AGAATAAAGT TGACGGCCGG GTATACAACA CTGGCTTTTC GGAACGAACG
CTCACCACTA TGCTGGGCTA TTCGGGCCAC CGGGGTTATT CGCGTTTCGG GGCTTCGCTC
TACGACAATT TGCAGGGTAT TCCCGACGGA AGCCGCGACT CGCTTACCCG CCAGTTTACC
CGTCAGGTGT TCGAATCGGA CCTGGACGAT ATCAAGAACC GACCCATCGT ACCCGCCAGC
GAGTTATCGT CATACCGACT TAGCCCGTTA ATTCAGCACA TTCAGCATTA CCGGCTACAC
ACGAACAACC ATTATCAGAT TGGTAATGGC GAACTGGACG TGTTGCTGGC GTTTCAGCAG
AATGTTCGGC GGGAGTTCAA TCACCCCACT CAACCCAACC AGCCGGGTTT GTACGTCCGG
CTGAACACCC TGAACTATGG CGTACGGTAT AACCTGCCGA ACGTGGGTCG ACTGGCTACA
ACCGTGGGCG TGAACGGCAT GGCCCAGTCC AACAAGAATA AGAACGGTAC GGCCTTCCCC
ATTCCGGACT ACAAACTGTT CGATGTCGGC ACGTTCGTTT TTCTGAAATA CCAGGCCGAT
AAACTCACCC TGAGCGGTGG CCTTCGCTAC GACAACCGGC ACCTGACCGG CGATGATTTT
TACGTAGGTG TCGACCCAAA AACCGGCTTT GACAAACGCG CTTTTTTACC GGATACTGCC
GGAGCCACAC TTCAGTTTCC CCGCCTGAAG CAAACATTTA CGGGTATTTC CATGAGTATG
GGCGTAACCT ATGAGTTCTC GGAGAGACTG GCCCTGAAAG CAAATATTGC CCGTGGATAC
CGTGCGCCGA GCATTACGGA AATTGCCTCC AACGGTCTCG ATCCCGGTGC GCGTATCGTG
TACATCGGCA ACCGAGACTT TAAGTCGGAG TTCAGTTTAC AGCAGGACAT TGGCCTCACC
GCCACCTTCC CAGACATCAA TTTCGGTGTC AGCGTATTCA ACAACTTCAT CCAGAATTAC
ATATCCCTGA CACAGCTGGT CGATGCACAG GGTGAGCCGG TGGTGATTGT TCCGGGCAAC
AAAACGTACC AGTACCAGCA GTCATCGGCT CAGTTGTACG GTCTTGAAAC GCAGCTCAGT
TTGCACCCCA CTACCTGGCG GGGATTCAGT TTCGACAACA GTCTGGCGGT GGTGTACGGC
TACAACCGGG GCAGCCGGTT TACGGATGCG GGCGTCAACG GCGAGTACCT GCCGTTTATT
CCACCCCTGC GCGTAACCAC CGGCATCAGT CAGGCCCTCC CGCTTAAACG CAGTTGGTTG
TCCGAATTGA CGCTAAAAGC AGATGTTGAA CACAACGCCC GGCAGGACCG CTTTCTGGGC
CTGAACGACA CGGAAACGGC CACGGCGGGT TTCACCCTCG TCAATGCCGG TGCTGATGCG
CAGCTACATG TCGGCAAGGA CAAACCCGCG TTGCACGTCA TATTTCAAGT CAATAACCTG
TTCGATGTGG CGTACCAGTC GAACCTTAGC CGGTTAAAGT ATTTCGAGTA TTTCACCCAA
TCGCCCAACG GACATCTGGG CATGTACGGC ATGGGTCGGA ATATTTGCTT AAAACTGGTA
GTGCCGTTCA ATTAA
 
Protein sequence
MKSFFLLISL TLTGELAYTQ PAPSRIIRGS VTDADSRKPI PFGTVNLVGQ NKGAITDARG 
QYSLSIKADS VLRDLIISCV GFKSDTVRIS PGTNVYDVTL LPIVNALNEV VVTGVTRGTL
MKQNPVAILA ISRKSIESTT SSNIIDVLVK NAPGLTAVKT GPNISKPFIR GLGYNRVLTL
YDGVRQEGQQ WGDEHGVEVD NYNIDRAEVI KGPASLMYGS DALAGVVSMM PIYPKDTDGK
LKVGFVTEYQ SNNRLLGESI SLASGGSRWA WNMRGSIRAA TNYQNKVDGR VYNTGFSERT
LTTMLGYSGH RGYSRFGASL YDNLQGIPDG SRDSLTRQFT RQVFESDLDD IKNRPIVPAS
ELSSYRLSPL IQHIQHYRLH TNNHYQIGNG ELDVLLAFQQ NVRREFNHPT QPNQPGLYVR
LNTLNYGVRY NLPNVGRLAT TVGVNGMAQS NKNKNGTAFP IPDYKLFDVG TFVFLKYQAD
KLTLSGGLRY DNRHLTGDDF YVGVDPKTGF DKRAFLPDTA GATLQFPRLK QTFTGISMSM
GVTYEFSERL ALKANIARGY RAPSITEIAS NGLDPGARIV YIGNRDFKSE FSLQQDIGLT
ATFPDINFGV SVFNNFIQNY ISLTQLVDAQ GEPVVIVPGN KTYQYQQSSA QLYGLETQLS
LHPTTWRGFS FDNSLAVVYG YNRGSRFTDA GVNGEYLPFI PPLRVTTGIS QALPLKRSWL
SELTLKADVE HNARQDRFLG LNDTETATAG FTLVNAGADA QLHVGKDKPA LHVIFQVNNL
FDVAYQSNLS RLKYFEYFTQ SPNGHLGMYG MGRNICLKLV VPFN