Gene Slin_6451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6451 
Symbol 
ID8730235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7821239 
End bp7824427 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content56% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003391207 
Protein GI284041277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.402176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAAT TTATATTGGG CAGTTGGCTT CTGGCCCTTC TTTCCTGCTT ACCCGCTCTG 
GCGCAGGATT TTACGGTGAG TGGCCGGGTA ACATCCTCCG AAGACGGGGG CGGGCTGCCC
GGCGTAAGCG TCCAGCTTAA AGGAACAACG CGCGGCACCA CCACCGACGC CGAAGGCAAC
TATCGGTTAA GCGCGCCCGC TACGGGCCGG TTGGTATTCA GCTTCATTGG CTATGCGTCT
CAGGAGATTG CCATAGGTAA CAAATCGACC ATCGCGGTGA ACATGGTGCC CGATGCTGCT
AATCTGGATG AGGTCATCGT AACCACCTTT GGTACGGCAA AACGGGCATC CTTTACGGGT
TCGGCAGGGA CGCTGTCAAC AACACAGATT CAGAATCGTG GCGTCAGTAA CGTGGCACAG
GCACTTTCGG GTGCGGTGTC GGGCGTTCAG ACAACGGCCG GCAGCGGTCA GCCGGGCTCC
GCTCCCGAAA TTCGTATTCG GGGCTTCGGC TCGATCTCGT CGGGAAATGA CCCGCTGTAT
GTGGTAGACG GCATTCCGTA TTCGGGCAAT ATTGCCAACA TCAACCCCAG CGACATCGAA
AGCGTGTCGG TGCTGAAAGA TGCGGCCTCA ACCGCCCTGT ACGGTGCACG GGCGGCAAAT
GGTGTGGTGG TTGTAACCAC AAAAAAAGGG CTGAAAGACC GGAGTACCAT CAACGTGCGT
TATACGCAGG GCTTTAGTAG CCGGGGGCTG CCCGAATACG ACCGGGTTGG CGTTGGCGAG
TACTACCCGC TGATGTGGGA AACCTACCGG AACAGCATTG CCTACCGGGC CACCAACCCC
GTAGCCCTGG CAACGGCAAA CGCCGATGCC ACAAACCGAC TGGTTAGTCT GGTCGGCTAC
AACGTATACA ACGTGCCCGG CAATCAGTTG GTTAATACCG ACGGGCAGTT CAACCAAAAC
GCCCAGCTTC TGTTTTCGCC CGACGATCTG AACTGGGAGA AGCCGATTAC GCGTCAGGGG
AACCGGCGTG AACTGAACGT AAGCTTCGCT GGCGGGCAGA AAAACTCCGA CTATTTTGTG
TCTTTGGGCT ACCTCAACGA CAAAGGGTAT CTGATCCGTT CCGACTTCGA GCGGTTTACG
GGCCGGATCA ACATCAACTC CCAGATGAAA CCCTGGTTTC GGGTGGGGGC TAACCTGTCG
ACGACCATCT CGAAGTCCAA CCAGGCGGAT GCCGATGGCA GCACCAATTT CGTGAACCCG
TTCTTTTTCT CACGGAATAT TGGTCCTATC TATCCTGTGT ATGCCTACGA CCCAACTAAT
GTCGGCCAGT TTTTGACGCT GCCGAACGGT CAGCGACGGT GGGATTACGG GAACCTGACG
TCCCTGGGCT TACCGGCCCG GCCCCAGTTT GGTGGTCGCC ATTCGGTTGC GGAGACGTTG
CTGAACCAGA ACTTCCTGCG CCGTAACGTA CTGGGCGCAC GGGGTTTCGC CGAAGTTTCG
TTCCTGAAAG ATTTCAAGTT TTCGGTGAAC GTAGGTACGG ACATTACTAA TACGAATGTG
TTCACCTATG GCAACCCCGA AGTGGGTGAC GGCGCTCCGG CGGGCCGTGC GAACCACCAG
TTTCAGAACA TCACCAGCTT CAACCTCAAC CAGCTACTGA ACTACAATAA GTCGTTCGGG
AAAAATACGT TCGATGTGCT GCTGGGCCAC GAGAACTTCA GCATCAACGA CAACAACCTG
GAAGGCTCGC GTTCGCAGCA GATTGTGGAC GGTAACTACG AGTTAGGCAA CTTCACGACC
ACAACCTTTC TGTCGTCGGT GTACAACACG CGTCGGGTAG AAGGCTATTT CTCCCGTATC
AACTACGACT ACGACCAGAA ATACTTCCTC TCTGCTTCGG TTCGGCGCGA TGGCTCCAGT
AAGTTCTATC GGGATTCCCG CTGGGGTACG TTCTACTCCG TGAGTGGTGC ATGGCGTATC
GACCAGGAAG ATTTTCTGCG GTCTATTCCA ACCATCAACT CCCTGAAACT ACGGGCTTCG
TATGGCCAGA CCGGTAATGA TGGCGGAGGA AATACGGCGG CTTTTGCTCA GGACAATACC
ATCAGTTACT ACGCCTGGCA GCCGCTGTTC GGCTTAGGGA GCTGGAACAA CGCATCGGAA
GCGGGTATTC TGCAAACGAG CCTTGGCAAC CAGAACCTGG CCTGGGAATC GAGCAACTCC
TTCGATGCCG CGCTGGAGTT CAGCCTGTTC AAAGGGCGCG TTTCCGGTAC GGTTGAGTAC
TTCGACCGCC GGTCGTCCAA CCTGATCTTC GCCGTACCCT TGCCCCTATC GGATGGTATT
TCGACGGTTA CCCGAAATAT CGGCACGATG TACAACCGGG GTATGGAGAT TGAACTGGGC
ATTGAACCCA TCCGAACGAA GGACTTCACC TGGCGCATCG ACCTGAACGC CACCCGCGTG
AAAAACCGGA TTACGAAGAT GCCCGATGAG AATCCCGAAA TTATTGATGG CACGAAGAAA
CTGGCCGTTG GCCGGTCTAT CTACGACTAC TGGCTCCGCG AATACATGGG CGTGAACCCC
ACAACGGGCG AAGCACAATA CAGGGCGGCA AACTATGTAG CATCGAACTC CCGCATTACC
GAAGGCGGTG ATACGCTGAC AACCAGCGTC AACAATGCGC GCTACCATTA CAATGGCTCG
TCTATCCCCA CGGTGTCGGG TGGATTTACG AATACCTTCC GCTACAAAGG CATTACCCTG
TCGGCGCTGA CCGTATATCA GTTAGGCGGT AAAACCTACG ACGGAGCCTA CGCAGCCCTG
ATGAGTTCGG GCGGGTATGG CAGCGCCAAA TCCGTCGATA TTCTGAACCG GTGGCGGAAC
CCCGGCGACA TCACCAATGT GCCCCGTATG GATGCTGGAC GCACGTCGGA TTTTGATGCC
GCATCGGACC GCTGGCTCAC GAATGCCAGC TACCTGAACC TGCGTACGGT AACGCTTTCG
TACGCGCTGC CCGCTACCTT GTCGCGCAGA GCCTTCCTGG AGAATGCACA GGTGTACATC
ACTGGCGAGA ACTTCCTTAT CCTGTCTCAC CGGAAAGGGA TGAACGTTCA GCAAACCTTC
ACGGGGGTAA CCAGCAACGT ATTCAGCCCA GCCAAAAGCA TTATTCTGGG TGTTTCATTT
ACGCTTTAA
 
Protein sequence
MGKFILGSWL LALLSCLPAL AQDFTVSGRV TSSEDGGGLP GVSVQLKGTT RGTTTDAEGN 
YRLSAPATGR LVFSFIGYAS QEIAIGNKST IAVNMVPDAA NLDEVIVTTF GTAKRASFTG
SAGTLSTTQI QNRGVSNVAQ ALSGAVSGVQ TTAGSGQPGS APEIRIRGFG SISSGNDPLY
VVDGIPYSGN IANINPSDIE SVSVLKDAAS TALYGARAAN GVVVVTTKKG LKDRSTINVR
YTQGFSSRGL PEYDRVGVGE YYPLMWETYR NSIAYRATNP VALATANADA TNRLVSLVGY
NVYNVPGNQL VNTDGQFNQN AQLLFSPDDL NWEKPITRQG NRRELNVSFA GGQKNSDYFV
SLGYLNDKGY LIRSDFERFT GRININSQMK PWFRVGANLS TTISKSNQAD ADGSTNFVNP
FFFSRNIGPI YPVYAYDPTN VGQFLTLPNG QRRWDYGNLT SLGLPARPQF GGRHSVAETL
LNQNFLRRNV LGARGFAEVS FLKDFKFSVN VGTDITNTNV FTYGNPEVGD GAPAGRANHQ
FQNITSFNLN QLLNYNKSFG KNTFDVLLGH ENFSINDNNL EGSRSQQIVD GNYELGNFTT
TTFLSSVYNT RRVEGYFSRI NYDYDQKYFL SASVRRDGSS KFYRDSRWGT FYSVSGAWRI
DQEDFLRSIP TINSLKLRAS YGQTGNDGGG NTAAFAQDNT ISYYAWQPLF GLGSWNNASE
AGILQTSLGN QNLAWESSNS FDAALEFSLF KGRVSGTVEY FDRRSSNLIF AVPLPLSDGI
STVTRNIGTM YNRGMEIELG IEPIRTKDFT WRIDLNATRV KNRITKMPDE NPEIIDGTKK
LAVGRSIYDY WLREYMGVNP TTGEAQYRAA NYVASNSRIT EGGDTLTTSV NNARYHYNGS
SIPTVSGGFT NTFRYKGITL SALTVYQLGG KTYDGAYAAL MSSGGYGSAK SVDILNRWRN
PGDITNVPRM DAGRTSDFDA ASDRWLTNAS YLNLRTVTLS YALPATLSRR AFLENAQVYI
TGENFLILSH RKGMNVQQTF TGVTSNVFSP AKSIILGVSF TL