Gene Slin_5340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5340 
Symbol 
ID8729105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6503234 
End bp6505678 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003390108 
Protein GI284040178 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.640858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATA TACTACCATT TTGGTTGTGC AGCCTGCTCA TTTACTTTAT GCCGTTACAG 
TCCGGCAGCG CACAGGCTTC ACTAAATACA GCGGGCAACA GTACGCAGGC AAAAGGAGTG
CGGGGTGCAG CGCTGACCAT CAGCGGCTAC ATAAAAGATG CCGGTAATGG CGAAGGGCTC
ATTGGTGTTT CTGTTTATGT GAAAGAAACA GGCACTGGCG CCGTCACGAA CAGCTATGGC
TTTTATGCCG TTACGCTCCC GGCGGGCAGT TACAACCTCG TTATCAGCTA CGTTGGCTAT
ACCCGACAAA CCCGCACGGT CGATCTTGTG GACCGGAACG TACGCCTTGA CCTCGAACTG
AGTCAGGAAG GCAAGCAGCT TCAGGAAGTG GTTGTCTCCA CCAAGCGGGA GGACGACAAT
GTGAAAAACA TTGAAATGAG CGTCAACCGG ATTGACGTGA AAACCCTGCA ACGGATTCCA
GCCCTGCTGG GCGAAGTCGA CGTGATCCGG AGCATTCAGT TGCTGCCGGG CGTATCGACG
GTGGGCGAAG GCGCTACCGG TTTCAACGTT CGGGGCGGTA GTATCGACCA GAATCTGGTG
CTGCTGGACG AAGCCCCGGT CTATAACTCC TCGCACCTGT TTGGCTTCTT CTCGGTCTTC
AACCCCGACG CGGTCAAAGA CGTGAAGCTC ATCAAAGGCG GTATCCCGGC TAATTACGGT
GGCCGAATTG CCTCTATTCT GGATGTGCGT CTGAAAGAAG GCAACGCTAA GAAACCTGAA
CTCAACGGGG GTATTGGACT TATTTTCAGC CGTTTGTCGT ACGAGCGACC GCTTTTCAAC
GGCAAAGGCT CGTTTATCGT AGCCGCCCGG CGTTCTTACG CCGACGTGCT GGCCCAGCCA
TTCCTGACGG GCGACCTGAA AGGGGCCAAG TTTTATTTTT ACGACCTGAC GGCGAAGGGT
AATTACCGCA TCAACGATAA AAACACGGTG TTTCTGTCGG GCTATCTGGG CCGCGATGTA
TTCGGGTCGG ATTTTGGGTT TAACTGGGGG AATACAACCC TGTCGGCCCG CTGGAACCAC
GTTTTCAGCG ACCGGCTTTT CCTGAACACG ACGGCCTATT ACAGTAACTA CGACTACTCG
CTCAACTCGG ACCTGAAAGG GAAACGACCC AACGATTTCT TCCGAACCGA TTCCCGGATT
GTCGACTACA GCCTGAAACC GGATTTTTCG CTGTTTTTGG GTAAAAACAC CATCACCTTT
GGCGGTCAGG TCATCGCGCA TGATTTTCAG CCCGGCACCG CAACAGCCGC CAGTTCGGGC
AGTGTCCGGA CGTTTGGCAT AGCCAGTAAA CGAGCTATGG AAGCGGCTCT TTATGTGGGC
AATGAGCAGC AGCTGACCCC GAAGCTTCAG TTGCAATACG GACTGCGCTA TTCCCTATTT
AATTATGTTG GCGAAGGCGA AGCCTATACC TTCCGCACCG ATGTGCCGCT GGGCAGTCGC
CGTGAGGTGA TTACCACCGT AGGGTATCGG GGCGGAGAGG TGATCAAGAC CTACGGCAAC
TGGGAACCCC GCTTTTCGAC CAAGCTGGAT TTGTCGGATA ACAGCTCCCT GAAATTCAGT
TATAACCGCA TGGCGCAGTA CATCCACCTG GTTTCCAATA CAACAGCCTC GACGCCACTG
GATATCTGGA CACCATCGAC CAATAATATC AAACCGCAGA TTGCCGATCA GGTTGCCGGG
GGATATTTTA AGAACTTTGG CCGCTCCAGC GGAACGGGCA GCGAATTCGA AGCCTCGGTA
GAAGTATATT ACAAATGGCT GCAGAACCAG ATTGACTACA TTGACGGGGC CAGCCTGATC
CTGAATAAAT ACCTTGAAGG TGATTTGCTG AGCGGCAAAG GCCGCGCTTA CGGTGCTGAG
TTCTATGTAA AACGAAACAC GGGCGTCGTA AACGGCTGGA TTAGCTACAC GCTGGCCAAA
ACCGAGCGGC AGGTGGACGG CATCAACAAC AATAACTGGT ACCCCACGCG CTTCGACAAG
CGACACACGC TGACATCGGT ACTGCTGTTC GACCCGCCCC ATGCCAAGCG CTGGAACTTC
TCGGCAACCT TTACGCTGGC CAGCGGAACG CCCGGCACGT TCCCCACCAA TCGCTTTGAG
TACCAGGGCT ACGTAGTGCC GCAGAATACC GACAATGCCC GGAACAATTA CCGGATTCCG
GCGTACCACC GACTCGATCT GGCGGCTACC TTGCAGGGGC GTAAACGTCC GGGTAAGCGC
AAAGACGATA ACTGGGTATT CTCGATTTAC AACGTCTATG CCCGCAAAAA CGCCTTTTCA
GTCTATTTCC AACCCAATGC AGATAGTCCC CGCGTAACGG AAGCCATTCG CTATTCAGTT
TTTGCGACCC TGATTCCGTC CGTAACGTAC AACTTTAAAT TATAG
 
Protein sequence
MKHILPFWLC SLLIYFMPLQ SGSAQASLNT AGNSTQAKGV RGAALTISGY IKDAGNGEGL 
IGVSVYVKET GTGAVTNSYG FYAVTLPAGS YNLVISYVGY TRQTRTVDLV DRNVRLDLEL
SQEGKQLQEV VVSTKREDDN VKNIEMSVNR IDVKTLQRIP ALLGEVDVIR SIQLLPGVST
VGEGATGFNV RGGSIDQNLV LLDEAPVYNS SHLFGFFSVF NPDAVKDVKL IKGGIPANYG
GRIASILDVR LKEGNAKKPE LNGGIGLIFS RLSYERPLFN GKGSFIVAAR RSYADVLAQP
FLTGDLKGAK FYFYDLTAKG NYRINDKNTV FLSGYLGRDV FGSDFGFNWG NTTLSARWNH
VFSDRLFLNT TAYYSNYDYS LNSDLKGKRP NDFFRTDSRI VDYSLKPDFS LFLGKNTITF
GGQVIAHDFQ PGTATAASSG SVRTFGIASK RAMEAALYVG NEQQLTPKLQ LQYGLRYSLF
NYVGEGEAYT FRTDVPLGSR REVITTVGYR GGEVIKTYGN WEPRFSTKLD LSDNSSLKFS
YNRMAQYIHL VSNTTASTPL DIWTPSTNNI KPQIADQVAG GYFKNFGRSS GTGSEFEASV
EVYYKWLQNQ IDYIDGASLI LNKYLEGDLL SGKGRAYGAE FYVKRNTGVV NGWISYTLAK
TERQVDGINN NNWYPTRFDK RHTLTSVLLF DPPHAKRWNF SATFTLASGT PGTFPTNRFE
YQGYVVPQNT DNARNNYRIP AYHRLDLAAT LQGRKRPGKR KDDNWVFSIY NVYARKNAFS
VYFQPNADSP RVTEAIRYSV FATLIPSVTY NFKL