Gene Slin_5066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5066 
Symbol 
ID8728831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6189913 
End bp6192918 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content52% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389840 
Protein GI284039910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000778295 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTAT TCTGCGTGAT GAGCTTCAGC GCAATAGCTC AAACGAAGGT TGCCGGGAAA 
GTGATTGCCG ATGATAAAAA AGAGGAGTTA GCTGGTATAA GCATCGCCGT GAAAGGAAAG
GTTATTGGCA CCATTACCGA CCAGAAAGGA AATTTTTCCT TTACAACCAA CACGCCAACA
CCGTTCACGG TCGCTATCTC CGGGGTTGGT TTCGAAACGC AGGAGTATGT GATCAATGGC
AACCGTACGG ACCTGAACGT AAGCCTGAAA GAACAGGTGA CGATTGGTCA GGAAGTGGTC
GTATCAGCCT CCAGGGTCGA AGAAAGTGTC TTGAAATCAC CGGTATCCGT TGAGAAAATG
GATATTCGGG CTATTCAGTC TACACCTTCC GTTAATTTTT ATGATGGCTT AGCCAACGTA
AAAGGGGTCG ATGTGGCCAC ACAGGGAATG CTGTTCAAGT CGATAAACCT GCGGGGTTTT
GGCGCAACGG GTAACCCAAG AACCGTGCAG TTGATCGATG GGATGGACAA CTCGGCACCG
GGTCTGAACT TCCCGGTCGA TAATATCGTA GGTGTTCCGG AGCAGGACGT TGAAAGTGTC
GAGATTTTGC CCGGCGCGGC TTCTGCCCTT TACGGACCCA ATGCCATTCA GGGGCTGATT
CTGATTAATA GCAAAAGCCC GTTCCTGTAC CAGGGGTTGA GCGCTAACGT CAAAACGGGT
ATTATGGATG CATCGAACCG GACAACGTCT ACAACGGGTT TTTATGATGC GTCCATTCGG
TACGCTAAAG CGTTCAATAA CAAGTTTGCT TTCAAGATGA ACCTATCTTA CATAAAGGCA
AAAGACTGGG AAGCAACGAA TTACACGAAC CTGAATGGTG CTGGAAATTC TGATCCCAAC
CGGGGAGCGG GTACGGCTGT CAACTATGAT GGCGTAAACG TGTATGGCGA TGAGAACCAG
CAAAATATGC GTACGGTGGG TCAGGCGTTG ATCGGGGCGG GTCTTTTGCC TGCTGCTGCC
CTGAACATAC TGCCTAACGT AAACATCAGC CGTACGGGTT ATCCCGAAGT GAATCTGGTT
GATTACAACA CCAAAAGCTT TAAGTTCAAC GGCGCGCTGC ACTACCGAAT CTCGGATAAG
GTCGAAGCCA TTGGCCAGCT GAATTATGGT ACCGGTACCA CCGTTTATAC CGCTACCGGC
CGGTATTCGT TACGGGATTT CAGCATAGCG CAGGCCAAGC TCGAACTGCG GGGCGATAAT
TTTATGGTAC GTGCCTACAC AACGCAGGAG CGGTCGGGTA AATCATTCAC GGCAGGTTTG
GCCAGTATTG GCTTTAACGA GGCCTGGAAA CCAAGTGCTA CCTGGTTTGG GCAATATGTT
GGGGCGTATG CAGCCGCCCG GGGAGCCGGT CAGGGAGATG ATGCCGCCCA ACTGTCAGCT
CGTGGCATAG CCGATCAGGG TCGCCCGATA CCCGGTACAG AAGCGTATAA GGCTTTGTAT
GATAAACTAA GCTCGACGCC CATCAGTCAG GGGGGCGGGG CCTTCTCCGA CAAGTCGAAC
CTGTATCATG TAGAAGGGCT GTACAATTTT AAAAACCAGA TTAAGTTTGC CGATGTACTG
GTTGGAGCCA ACTACCGGCA ATACCAGTTG GCTTCGGAAG GAACGCTCTT TGCCGATCAG
GCGGCCGGAC GCAACGGTAC CATTGGTATT ACGGAGTTCG GCGGGTTTAT TCAGGCCAGT
AAATCCCTGT TCAGCGAACA CCTCAAGTTG ACGGCGTCGA CCCGTTACGA CAAAAACCAG
AATTTCGAAG GGCAGTTTAC GCCCCGTGTA TCGGCCGTAG CAACCTTCGG GGAGCACAAT
ATCCGGTTAT CATACCAGAC TGGTTTCCGT ATTCCAACCA CGCAGAATCA GTATATCGAC
CTGAAAACGC CACTGGCCCG GCTGATCGGT GGTTTGCCGG AGTTTTCGGA TCGTTACAAT
CTGGCGAACT CCTACTCCCG TACTGATGTA ACCGCCCTCG GCGCTGCCAT CACCGCCAGT
GCTGCCAGCC CCACCGTTCA GCAGGCCGCC GTACAGCTGA TTACGCAGCA GGTGACAGCG
CAGGTGACGG CGCAGGTAAC TGCTCAGGTA AATGCCGCCG TTGCCGCCGG TCAGATTCCG
GCCAGTGCTG CCGCAGCAGC CATTCAAAGT GCAGTTGCCT CTACCTTAAC TGCGGTGTTG
CCCGGTCAGA TTGCCGCCAA TATCAACAAT GCAGTAACAG CCGTAGCTAT TAACAGCAAT
ATTGGCAACC TGAAGCCTTA CCAGCGGCAG GCATTCAAGC CGGAGCGTGT GGCGAGCTAT
GAAATTGGTT ACCGAAGCGT ACTGGGTAAG CGCCTGTTTG TCGATGCCTA TTATTACTAT
AGTGTGTACA CAAATTTCAT TGGCAGCGTT ATCCTGCTTC AGCCAACGGC TCCGGTGGCT
GCGGGCCTGC CGCTGGCATC CGGTGTGTTA AGCGGAGGAA CGCGGAATGC GTATTCGATG
CCTGCCAACA GCAGCGAAAA AATCAATACG TCGGGTTGGG CGCTGGGTCT GAATTATCAG
TTACCAAAAG GTTATGGTAT ATCGGGTAAT CTGGCCAACA ACAAGCTCAA TAACTTCACG
CCAACGGCGG AGCTACAGAC ATCGGGCTTC AACACGCCGG AATATCGCTG GAACTTAGGC
TTCACCAAAC GGCCTATGGC TAACTCGAAT ATTGGCTTTG CCGTTGCCTT CAAACATCAG
GATGCGTTCA CCTGGGAGGG CTTTGCCGTA CCTACCGAAC TGGTGCCGAA TCTGTACGAG
AAAACAATTG TACCGGCTAT CAGTAACTTC GATGCGCAGG TCAATTACAA GGTGTCAAGC
CTTAAGTCGA TTGTGAAAGT GGGTGCAACC AACCTGTTTG GAAAGCCTTA CTTCCAGGCC
TATGGTAGTT CATACGTTGG TTCGACCTAC TACATCAGCC TGACGTTCGA TCAACTGATG
AACTAG
 
Protein sequence
MGLFCVMSFS AIAQTKVAGK VIADDKKEEL AGISIAVKGK VIGTITDQKG NFSFTTNTPT 
PFTVAISGVG FETQEYVING NRTDLNVSLK EQVTIGQEVV VSASRVEESV LKSPVSVEKM
DIRAIQSTPS VNFYDGLANV KGVDVATQGM LFKSINLRGF GATGNPRTVQ LIDGMDNSAP
GLNFPVDNIV GVPEQDVESV EILPGAASAL YGPNAIQGLI LINSKSPFLY QGLSANVKTG
IMDASNRTTS TTGFYDASIR YAKAFNNKFA FKMNLSYIKA KDWEATNYTN LNGAGNSDPN
RGAGTAVNYD GVNVYGDENQ QNMRTVGQAL IGAGLLPAAA LNILPNVNIS RTGYPEVNLV
DYNTKSFKFN GALHYRISDK VEAIGQLNYG TGTTVYTATG RYSLRDFSIA QAKLELRGDN
FMVRAYTTQE RSGKSFTAGL ASIGFNEAWK PSATWFGQYV GAYAAARGAG QGDDAAQLSA
RGIADQGRPI PGTEAYKALY DKLSSTPISQ GGGAFSDKSN LYHVEGLYNF KNQIKFADVL
VGANYRQYQL ASEGTLFADQ AAGRNGTIGI TEFGGFIQAS KSLFSEHLKL TASTRYDKNQ
NFEGQFTPRV SAVATFGEHN IRLSYQTGFR IPTTQNQYID LKTPLARLIG GLPEFSDRYN
LANSYSRTDV TALGAAITAS AASPTVQQAA VQLITQQVTA QVTAQVTAQV NAAVAAGQIP
ASAAAAAIQS AVASTLTAVL PGQIAANINN AVTAVAINSN IGNLKPYQRQ AFKPERVASY
EIGYRSVLGK RLFVDAYYYY SVYTNFIGSV ILLQPTAPVA AGLPLASGVL SGGTRNAYSM
PANSSEKINT SGWALGLNYQ LPKGYGISGN LANNKLNNFT PTAELQTSGF NTPEYRWNLG
FTKRPMANSN IGFAVAFKHQ DAFTWEGFAV PTELVPNLYE KTIVPAISNF DAQVNYKVSS
LKSIVKVGAT NLFGKPYFQA YGSSYVGSTY YISLTFDQLM N