Gene Slin_3390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3390 
Symbol 
ID8727143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4094240 
End bp4097692 
Gene Length3453 bp 
Protein Length1150 aa 
Translation table11 
GC content56% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388197 
Protein GI284038267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.532087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.341167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA AGTATCTTCT CCTTTCTATG CTTATGGGCG GAGTCTACAC CCTGCCGCAA 
CCCAGCGTTG CGCAGGTACT GACACGCGCC CAGCCAGCCT ACGAAAGCAA CGCCCGAGGG
CAACAAGCTG AGGCACCGGC TGGTCAGCAC AGCCGCCGGA GCCGTTCGCT CAAAAAAGCC
CTGTCTGAAC TCGAACGACG CTTCAACGTC AACTTCGCCT ATGATGAACA AGTCGTGGCC
GGTCGGCAGA CCGACACCGA CCTGGAAGGC CTCTCGCTCG AACAGGCGCT CAATCAACTG
ATCGAATCGC AGGGGCTGCG CTACCGAAAA GTGAACGCCA ACCTGTTTGC CATTCAGGCG
GCACCCGCCA AAAGAGTGGG CATGGCCCTG CCTGCCGAAA GCCCCGCTCC TGCGCTGGCA
ACCGTTAGTA ACATACCGAC CGGCGTAACG GAAGCCCCAC AGCTTAAGCG CATCACGGGG
CAAATCACCG ACGAAAACAA CCAGGGATTG CCCGGCGCCA ACGTCATTGA AAAAGGAACG
ACGACAGGTG CCACAACCGA TGCCAACGGC AATTTTGTGC TGAATGTCAG CGACAATGCC
ACGACGCTAA CGATTTCCAG CGTGGGCTAC GCCAGTCAGG ATGTAGCGAT CGGCAACAAC
ACGGTCGTGA ACGTTAAACT CGTTCCCGAC GACAGAACTC TGAATGAGGT AGTCGTGATC
GGTTACCAGA CTGTTCGTAA GCGGGACGTA ACCGGTGCCA ATGACGTGAT CAGCCCCGCG
CAGGCCAACA AAGTAACGGC CAACTCGCTG GCGGAATCCA TTCAGGGGCT ATCGCCGGGG
GTTACGGTGC GAAACACGGG TGTACCAGGA CAGCAGGCGT CGATTCAGAT TCGCGGGGTG
GCCAGCTTCC TCAATACCGA CCCGCTGTAC ATCATCGACG GGATGATTGC CGATGCCAAC
CCAACCATCA ACAACGACGA TATTGAGTCC ATTCAGGTGT TGAAGGATGC CTCGGCTGCG
GCCATTTACG GGTCGAGGGC GGCCAATGGT GTTATCATCA TCACCACCAA ACAAGGCAAA
GAAGGCCCCA TCCGGGTAGG TTTTTCGGCC AAATACGGGA TACAGCAGGT GTATAAGCGC
TGGGATGTGA TGGACGCGGC AAATTTTGCG GCTACCCAAC GGACGCAATA CCAGAACGCC
GGACAAACAC CCCCGTCGAG TGTGGCTACC GGTACGTTCA ATCCGAACAT CAACACCGAC
TGGCAGGATC AGGTACTGCA AACGGGTAAT CTTCAGGATT ACAACCTGAC ACTGTCGGGT
GGTACCAAAA CCGGTACGTA TCTGATTTCA GGCAGTTATT TCAGCAACAA AGGCTACGTA
ATTGGCAGCG GCTTCGACCG GGCCAGTTTG CGCATCAACA CCAAAAGTCA GCTTGGACGC
TTCACGTTTG GCGAAAACGC CCTGCTGACG AACTCAAATA TTCAGAATTT TCCGGCAGGC
AACCCTATTT ACGACATGGC CGGTATGCTG CCCGTTATTC CGGTGCAGGA CCCCAGTTAT
ATCGACGCCA GCAACCCGGG GGGCTATGGC CTTGGCCGCA ATCCGGATGC CGTTACCTAC
GCCTTCAACC CGGTAGCAGT TCGTAATCTG GCCAGCAACA AAAGCAATTT CGCCAAGCTG
GTCGGCAACG CGTACCTCGA TGTAAAACTG ACCGACTGGC TGACCTACCG TGCCAACGCG
GGACTGGAGG TGAGTTTCGA TTACACCCAG AACCTGCGTC GGCTGGGCAT CTACCAATAC
AGTGCCTCGC CGGTGCCCAG TTCGGTTGGC GAAGATCGGT CGCGTTACCT CAGTATGCTG
TTCGAGCACA CGCTGAACTT CAATAAAGTG TTCGGTATGC ACAACATCAA CGGTGTGGTG
GGTATCAGCC AGCAAACGAC CCGGCGCGAC ATCACGTCCG GCTCGCGCAC CAATCTGGGC
TTTGCGGGTG GTCAGTATTT CAATACCATC AATGCCGCCA CGGGGATTTC CAACTCGGCT
GGCGGTACGC CCGACGATTA CCGGATTCTG GGGTACATTG GCCGGGTCAA CTATACCTAT
AACGACCGTT ATCTGCTAAC GCTGACTGGC CGGGTCGATC AGGACTCACG CTTCGGGGCA
AACTACCGAA CGGGTTTCTT CCCGTCGGTG GCCGCAGCCT GGCGCATTAG CGAGGAAAAA
TTCTTTAACG TCGACTGGAT TACTGACCTG AAGCTGAATG CCTCTTATGG TAAGCTGGGT
ATCGTGGTGC CCACGCTGGG CTCGTTCCCC TATACGGCTT TCATCAACAA CAATCCGAGA
ACTATTTTCG GAGTCGACCA GACACCCTTC GTAGGAGCCT ATCAGGCGCA GCTGGCCAAT
CCCGACCTGC GTTGGGAAGA GCGAATCCAG CAGAACTACG GCGTGTCGGC AAGTTTTCTG
AGAAATCGGA TCACGACGGA GATCAACATC TATAACTCAC TCTCGAACGA TGCGATTCTG
AACCTGGCCG TGCCGGGTTA TCTGGGTAAC CTGCGCGGTA ATCCATACGT GAATACGGCT
TCTATCCGCA ACCGGGGCGT CGAGTTTTCG GCCACTTACC GCAACAATGA GCATGCCCTT
AAATGGGATG TATCCGGCAA CTTTACGACC ATCAAAAACC GGGTTGAAAA CGTAGGTAAC
CAGGGGCAGA ACATCAACTA CATTCAGAGC GGCAACACAC GCTCGCAGGT TGGTCAGGCC
GTTGGGCAGT GGTACGTGCT GAAAACGGCG GGCCTCTTCC AGAATCAGGA GGAGATCAAT
AATTACAAAG GAGCCAACGG AAACCTCATT CAGCCGAATG CCAAACCGGG CGATATTAAG
TATGTCGATA CCAACGGCGA CGGGCAGATC ACCCAGAACG ACGACCGTCA GTTTGTGGGT
TCGCCCTGGC CCAAGTTACA GGCGGGCGCA CAGGCCAATG CCTCCTACGG GCAGTTTTCG
CTCAATGTGC AGTTGACCGG CGTGTTTGGC TACACCGTCT ACAACGACGT ACGGCGTGGA
CTGGACAGCT ACCAGCTGAC CAACTTCCGC ACCAACATCA GTCCGTGGAC GACGACCAAC
ACCAGCACAA CCGACCCTCG TCTGGGCCTG GAAAGTGGCG ATCAGGGGAT TATCTCGAAT
AACTTCGGAT ATACCGACCG CTGGCTTGAA AATGCCTCCT ACGTTCGGGT ACGCAACGTC
GAGATTGGCT ACACGCTGCC GAAGAACCTG CTCAACACAC TTAAAATCCG CAATGCCCGT
GTGTATGTGA GTGGTCAGAA CCTGTTCACC ATCACCAAAT ACACCGGCCT CGACCCCGAC
GTTACGGGTG CAAACATTCA GGAGCGGGGC GTTGATCTTG GCCACTGGCC GTCACCCCGC
GTTGTATCCG TTGGTGTCAA CTGCGATTTC TAA
 
Protein sequence
MKNKYLLLSM LMGGVYTLPQ PSVAQVLTRA QPAYESNARG QQAEAPAGQH SRRSRSLKKA 
LSELERRFNV NFAYDEQVVA GRQTDTDLEG LSLEQALNQL IESQGLRYRK VNANLFAIQA
APAKRVGMAL PAESPAPALA TVSNIPTGVT EAPQLKRITG QITDENNQGL PGANVIEKGT
TTGATTDANG NFVLNVSDNA TTLTISSVGY ASQDVAIGNN TVVNVKLVPD DRTLNEVVVI
GYQTVRKRDV TGANDVISPA QANKVTANSL AESIQGLSPG VTVRNTGVPG QQASIQIRGV
ASFLNTDPLY IIDGMIADAN PTINNDDIES IQVLKDASAA AIYGSRAANG VIIITTKQGK
EGPIRVGFSA KYGIQQVYKR WDVMDAANFA ATQRTQYQNA GQTPPSSVAT GTFNPNINTD
WQDQVLQTGN LQDYNLTLSG GTKTGTYLIS GSYFSNKGYV IGSGFDRASL RINTKSQLGR
FTFGENALLT NSNIQNFPAG NPIYDMAGML PVIPVQDPSY IDASNPGGYG LGRNPDAVTY
AFNPVAVRNL ASNKSNFAKL VGNAYLDVKL TDWLTYRANA GLEVSFDYTQ NLRRLGIYQY
SASPVPSSVG EDRSRYLSML FEHTLNFNKV FGMHNINGVV GISQQTTRRD ITSGSRTNLG
FAGGQYFNTI NAATGISNSA GGTPDDYRIL GYIGRVNYTY NDRYLLTLTG RVDQDSRFGA
NYRTGFFPSV AAAWRISEEK FFNVDWITDL KLNASYGKLG IVVPTLGSFP YTAFINNNPR
TIFGVDQTPF VGAYQAQLAN PDLRWEERIQ QNYGVSASFL RNRITTEINI YNSLSNDAIL
NLAVPGYLGN LRGNPYVNTA SIRNRGVEFS ATYRNNEHAL KWDVSGNFTT IKNRVENVGN
QGQNINYIQS GNTRSQVGQA VGQWYVLKTA GLFQNQEEIN NYKGANGNLI QPNAKPGDIK
YVDTNGDGQI TQNDDRQFVG SPWPKLQAGA QANASYGQFS LNVQLTGVFG YTVYNDVRRG
LDSYQLTNFR TNISPWTTTN TSTTDPRLGL ESGDQGIISN NFGYTDRWLE NASYVRVRNV
EIGYTLPKNL LNTLKIRNAR VYVSGQNLFT ITKYTGLDPD VTGANIQERG VDLGHWPSPR
VVSVGVNCDF