Gene Slin_3978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3978 
Symbol 
ID8727736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4778305 
End bp4781484 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388767 
Protein GI284038837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000935862 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAT ACTTACTTAG TTGTTTTATG CTGGTTATGA TCGCGACGGG ACCGCTCTGG 
GCGCAAACCC GACAATTGAC GGGTGTGCTA CGCGACGAGC AGGGCCAGAC CATATCCGGC
GCAAACGTGG TCGTTAAAGG GACTACGCGC GGTACTACAA CGGATGCCGC TGGTGAATTT
CGCCTGTCAA TCCCTGCCGA AAACACGATC CTGACAATTT CATCAGTCGG GTATACGGCA
AAGGACGTAC CTGTATCATC GAGTCAGACA CAACTCAGCG TAACGCTGGC GACCGATGAC
CGGCAACTGG GCGAAGTGGT TGTTACCGCT CTGGGTATCA AGCGCGAAGC GAAAGCTCTC
AGCTACGCTA CCCAGATGAT TAAACCCGCC CAGATCAACG AAGTACGCGA CGGTAACGTG
CTGAACACCT TACAGGGCAA AATTGCCGGT GCCTACATAA CCCAGGGTTC CGGCGGACCG
GGCACGGGAT CACGAATTGT GCTGCGGGGA AACCGCTCCA TTCAGGGAAC GAATAATGCC
CTGATGGTGG TCGACGGCGT TCCAATCAAC AACAGCACCT TTGGGCAGGC CACCAGCGAC
TTTGGCAGTG TGGCCAACTC AGATGGAGCC TCGAACATCA ACCCCGACGA CATCGAGAAT
GTGACCGTAT TACGGGGTGC GTCGGCGGCT GCCCTCTACG GCAGTCAGGC TGCCAACGGG
GTAATCCTGA TCACAACGAA ACGCGGAAAG TCGGGCCGGG TATCGGTCGA TATAAACTCG
GGCGTTTCAA TCGATAAACC CTTTGCGCTG CCAATGGTAC AAAACCAGTT TGGACAGGGC
GTTGGCGGAA AGCTGGACCC TGCCGTTGGG GCCAGCTGGG GCGCCCCCAT GACGGGACAG
TCGTACACAA ACTACCTCGG CAATCCTGAT ACGTACTCAG CACAGCCCAA CAACATTCGT
GATTTTTTCC GCACGGCGGT CAGTTTAAAT AACTCCATTG GCATTACGGG AGGCTCAGAG
CGGTCGCAGA CGTATCTGTC GTACACCAAT AACTCATTGC AGGGAACAGT GCCGGGCAAT
GACCTGACCC GTCACACCAT CAACCTGCGG TTGTCGAACC AGATCAGCTC GAAGCTATCG
ACCGATGCCA AGGTAACGTA CATCAATCAG TCTGTAGTGA ACAAGCCCCG GACGGGTGAG
GAAAACGCGC CGGTCATTGA CCTCTACCAG ATTCCCCGTA ACGTAAGCCT GACCACGGCA
CAAAACTACG CAGCGCCCAA CTCGTTCGGT CTGCCTACGC CAACGGCCTG GCCGTCGACG
CTGTCGTCGA TCTACCAGAA TCCCTACTGG ATGACCAATC AGACGGCCAT TAACCAGTAC
CGGGACCGCA TCATCGGCTT CGTGCTGGCG AAGTACCAGT TAACTGATTT CCTGAGCATT
CAGGGCCGGG CCAACCTCGA TAAGTATTTC GACAAAAATG AAGAAAACTA TAGCCAGGGC
ACAATTCTAT GGGCCAACCA GGCGGGTGGT AAATTCTCCC GAAACAACAT CGTAAATACC
CAAAGCTGGT ATGACCTGTT GATTGAAGGA CGGAATAAAA TCGGGACTGA CCTTACGCTC
GACTATCAGG CGGGGGCCAT TATTCAGGAT ACCCGCTACC AATCGACCAA CTCCCTGGCC
GACGGTCTCA ATGTACCGAA CCGGTTTAAC CTGAACTTTG GTACGAACCA GACACTGGGC
GATGATTTCT CGCGGATTCA GACCCAATCG CTGTTCGGGC AGGCATCGCT GGCATGGCGG
GACGCTATTT TTATCAATGC CAGTTTGCGT AATGACTGGT CATCGACCTT GCCAAAGCCT
TATTCGTTCC AGTATCCATC CGTCGGCGCA TCGGTGGTTT TGTCTGATCT GCTGAAACTT
TCGGGGCCGC TGTCATTCCT GAAAATTAAT GGATCGTTCG CGCAGGTGGG TAACGGAGCC
GATCCGTATC TGTTGCAAAC CAATTACTCG TACAGCCAGG GTGCCGGTTC CGGATTCATT
AGCCGGGATG GGACACAGGC CATTGGTAAC CTGAAGCCAG AGATCACCAA AAGTCTGGAA
CTTGGCGTCG ACGCCCGTTT TCTCAGCAAC CGTATTGGTG CAACGATTAC GGCCTACAAA
ACCAATTCGA TCAACCAGTT ATTGAAACTG GGACTGGCAC CCGCTTCGGG ATTCAGTGAC
CAGTACATCA ACGCGGGCGA TATCCGCAAC ATGGGTCTTG AGGTAGTAAT CAATGGAACA
GCGATCAAAA CCGACCGGTT GACCTGGGAT CTGACGCTGA ACATGGGCCT GAACCGGAAT
AAAATCGTTA GCCTGTCGCC CGATATTAAA ACGGCGTTCC TGTCGGGCGG TTATGGCCGG
TCAGCATCGC CGATTGTACA GGAAGGAGGC TCTTACGGTG ATATCGTATC GTACCGCTGG
GCGAAAAATG CCAACGGCCA ATACCTGATT GGTTCGCAAA CACCCAGCGG AACGGTGTCC
GAAGCATCGG TTGTATCGAC TGGCTTGCCC GTGGCTACCA AAGAGCAGGA ATACATCGGC
AACTTCAACC CCAAAATGCT CCTGGGATTC ACCAACACAT TTACGTTCAA AGGCTTTTCG
CTCCGCTTTC TGGTCGACGC CCGTTTAGGT GGCATAGCCG TATCGGGTAC TGAAATGAAC
CTGGCGTTCA GCGGCATTCC GGAAGTAACG GCGCTGAATC GGGGTGGTGG CTGGGTATTG
CCGGGCGTTA CGGCTGGCGT TGCCGGAGCC GATGGAACAA CCTTGATCGG AGCCGGCAAG
ACGAACGCAC AGGCCATTAC GGCCGAACAG TTCTGGCAAA CGGTATCGGG TAAACGCTAC
GGCTGGGGTG AGTTCTTCGC GTACGATGCG ACCAACGTGC GCCTTCGGGA AATTTCGATC
GGTTACGGCA TTCCGGTACC GTCGAATTTC TTTATCAAGT CGGCTCGCCT GTCGTTCGTA
GCCCGCAACC TGTTCTGGAT TTACCGGGGT AGTTCGCTGC TGGACATTCC CGGTATTGGC
AAGCGGAAGA TGTGGTTCGA CCCCGATGTA AATATCGGCA ACGGCAACTT CCAGGGCGTC
GAATACGGAA CCCTCCCATC AAACCGGAGC CTTGGCCTGA ACCTGAAACT TTCTTTTTAA
 
Protein sequence
MLKYLLSCFM LVMIATGPLW AQTRQLTGVL RDEQGQTISG ANVVVKGTTR GTTTDAAGEF 
RLSIPAENTI LTISSVGYTA KDVPVSSSQT QLSVTLATDD RQLGEVVVTA LGIKREAKAL
SYATQMIKPA QINEVRDGNV LNTLQGKIAG AYITQGSGGP GTGSRIVLRG NRSIQGTNNA
LMVVDGVPIN NSTFGQATSD FGSVANSDGA SNINPDDIEN VTVLRGASAA ALYGSQAANG
VILITTKRGK SGRVSVDINS GVSIDKPFAL PMVQNQFGQG VGGKLDPAVG ASWGAPMTGQ
SYTNYLGNPD TYSAQPNNIR DFFRTAVSLN NSIGITGGSE RSQTYLSYTN NSLQGTVPGN
DLTRHTINLR LSNQISSKLS TDAKVTYINQ SVVNKPRTGE ENAPVIDLYQ IPRNVSLTTA
QNYAAPNSFG LPTPTAWPST LSSIYQNPYW MTNQTAINQY RDRIIGFVLA KYQLTDFLSI
QGRANLDKYF DKNEENYSQG TILWANQAGG KFSRNNIVNT QSWYDLLIEG RNKIGTDLTL
DYQAGAIIQD TRYQSTNSLA DGLNVPNRFN LNFGTNQTLG DDFSRIQTQS LFGQASLAWR
DAIFINASLR NDWSSTLPKP YSFQYPSVGA SVVLSDLLKL SGPLSFLKIN GSFAQVGNGA
DPYLLQTNYS YSQGAGSGFI SRDGTQAIGN LKPEITKSLE LGVDARFLSN RIGATITAYK
TNSINQLLKL GLAPASGFSD QYINAGDIRN MGLEVVINGT AIKTDRLTWD LTLNMGLNRN
KIVSLSPDIK TAFLSGGYGR SASPIVQEGG SYGDIVSYRW AKNANGQYLI GSQTPSGTVS
EASVVSTGLP VATKEQEYIG NFNPKMLLGF TNTFTFKGFS LRFLVDARLG GIAVSGTEMN
LAFSGIPEVT ALNRGGGWVL PGVTAGVAGA DGTTLIGAGK TNAQAITAEQ FWQTVSGKRY
GWGEFFAYDA TNVRLREISI GYGIPVPSNF FIKSARLSFV ARNLFWIYRG SSLLDIPGIG
KRKMWFDPDV NIGNGNFQGV EYGTLPSNRS LGLNLKLSF