Gene Slin_2729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2729 
Symbol 
ID8726479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3298537 
End bp3301536 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003387543 
Protein GI284037613 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0287081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAC TTTTATTTGC GGGAGTAGTG TGGCTACTCT CGTGCGGAGG ATTGCTGGCA 
CAATCGACCG CAGCGCGTGT ATCCGGCACC GTTAAAACCG ACAACGGAGA ACCTCTGCCG
GGGGCCAACG TGGTCATCAA AAATCAGACA AAAGGCGCTA CAACCGACGC CAACGGCCTT
TTCAGCCTAG ATGCTCGCTC CGGCGATGAG TTAATGATCT CGGCCATTGG TTACCAGAGC
ACTCAGGTTA AAATCGGCAC AAAAAATACC CTTGAAATTT TTCTGCGCGA ATCGGCATCC
CAACTCAACG AGGTCGTTGT GGTTGGTTAC GGTACACAGG ATCGAAAAAA TCTGGTCGGC
TCCGTTACAC AGGTCAACGC CGATGAGATC AAGAATCGTC CCGTAGCTAG TTTCGACCAA
CAGTTGCAGG GTCGGGCAGT TGGTGTTCAG GTGGCGGCTA ACACGGGCGT TCCGGGCGAC
GGTATTTTCT TTCGTATTCG GGGTACCACA TCCATCAACG CCAGCAACGA CCCGCTGTAT
GTGGTCGACG GGGTATTCGT TAATAATCAA TCGCTCCAGA AAATCACCAC ACAGGGGCAG
GCCAACAATC CGCTGGCTGA CATCAACCCC GCTGATATTG AGTCGATTTC GATTTTGAAA
GATGCCGAAG CGACGGCTAT TTACGGGGCG CGGGCGGCCA ATGGCGTTGT GCTGATCACC
ACCAAACGAG GCAGCTATAA CAGCAAAACC AAAGTTAGTT TGAATGCATC CGTCGGGCAG
GCATGGGCTC CCAAACTGTG GGATCTGGTA ACCGGTCCCG AGCACGCGAC CATCATCAAC
GAAGCCTGGA TAAACGACGG CAAACCAGCC GCTACCCGAC CCTTCCGGCC GATCTCGGAA
GGCGGTCGCG GATTACCGGA AGAGCAACCC ACCTACGACC GGCTGCACGA TATTTTCCGG
ACGGGCGCCC TGCAAAACTA CGATCTGGCC GTTTCGGGCG GCACCAAACA AACCCGCTTT
TACATCGGTG GTGGGTACAC AAGCCAGCAG GCAACGCTGC GTACCAACGA CTTTTCGCGG
GCGAGTTTCA AGCTGAACCT GGATCAGGAC ATTACGGATA AAATCCGCAT CGGCACCAGC
AATATACTCT CTCAGTCGAA CCGGACCAAT GCACGGGTTG GCGATGGACC ACAGGGCGGT
ATTTTACAGG CTGCTTTACA CACGCCGACC TACCTGCCAA AATTCAATAC AGACGGCTCC
TACGCCAAAT GGGCCGGTTT CGACAACCTC GATGTGCTGA TCAACAATAC GGATATGCAC
TCGACCAGCA CCCGCTACAT CGGCAACATT TATGGTGAAT ACGACATTAT CAGCGGCCTG
AAACTACGCA GTAGCTGGAG CATCGACTAC AACGATTACA ACGAATACGA GTACTGGAAC
ACGCTAACCA ACCGGGGTAG CGCGAGCAAA GGACTGGCTA CATCGAGCGT TAGTAAAAAT
ACGATCTGGA TCAATGAACA GACATTGTCG TACCGACGGT CATTCGGTAC CCAGCACAAT
TTCGGCGCGC TGGTTGGGAA TACCTTGCAG GGAAACGTAT CAACGCAGAC GCTGGCTCAG
GGCACCAATT TCCCGTCCGA TGCATTTAAG CAGATCGCGT CGGCATCGGT AACGACCGCT
TCTTCTAACC GGAATCAGTA TAACCTGGTC TCGTTCTTCG GGCGGGTCGA TTATAATTTC
TCGAAAAAGT ATTTTCTGGA AGCCAGCCTT CGGGCCGATG CGTCGTCCAA GTTTGCCGAG
GGGCATCGCT GGGGCTATTT CCCCTCGGCG GGGGTGGCCT GGCAGCTTAA GCAGGAGAAT
TTTTTGCGGG ATGTCAATTT CCTGAGCGAC CTAAAAATTA GAGCCAGTGT GGGCTGGACG
GGCAACCAAA ACGGCATCGG CAACTATGCC TCCCGCGGCT TGTGGGGTGG CGGCAACAAC
TACCTCGACA ATCCGGGAAC GGTACCCGTT CAACTGGCCA ACCCGGAACT GAAGTGGGAA
ACCACCCGCC AAACCAACGT AGGATTGAAC GTCGGTCTAC TGAGCAACCG CATTGGCCTG
GAAATAAACG CCTATTCCAA ATACACCTAC GACCTCCTGC TTCAGGTGCC ACTGGCGCAG
AGTTCGGGTT TCTCCAGTAT CTACCGGAAC GATGGCGAAA TCAGCAACCG GGGGCTTGAA
TTTGGTATCA ATACCCAGAA CATCAACAAA AGCAGCTTCC AGTGGAATAC CAGCTTTAAC
ATTGCCGCTA ACGTAAACCG CATCGAAAAG CTTTCCATCC CGGTCGATGC CAGCTATGCG
GCCGAACGCA TGGCGCAGGG ACAGGCGTTT CACTCCTTTT ACGTCTACCG ACAACTGTAT
GTCGATCCCA AAACGGGCGA CGCGGTCTAT GACGATGTCA ATAAAGACGG CAAAATCACC
GTGGCCGACC GACAATTTTA CGGCAGTGCG TTGCCCAAAT TTTTTGGTGG GCTGAACAAC
ACCTTCGCTT ACAAAGGGTT CGATCTGTCG GTATTTTTCA ATTTCAGCTA CGGCAGTAAA
GTCTTTAACA ACAACCGCTT CTTCCACGAG TCGGGCGGGA CGCGGGATGA CCGGCGGGCC
ATCAACAAGA ATCAGCTAAA GCGCTGGCAG AAAGAGGGCG ATATCACGGA TGTGCCTCGC
GTGACGACCA TTGGCAACAA CTACAATCTG AGCCCCACCA GCCGATTTGT AGAAGACGGG
TCGTTTCTCC GACTGAACTC GCTCGTGTTG GGCTACACCA TTCCAAAAGC CGTTTTGCGC
AAAGTGGGCA TTTCGTCGGC GCGGGTGTAC TACAGCGGCT CCAACCTGTG GCTGCTGAGC
AACTACCAGG GTCCTGACCC TGAGGTAAAC GTCACCGCCG ACCCTACCAC CCAGGGATAT
GATCTGGGCA CCCCTCCACA ACCCCGAACG GCACAATTTG GCATCAACCT CACTCTCTGA
 
Protein sequence
MQKLLFAGVV WLLSCGGLLA QSTAARVSGT VKTDNGEPLP GANVVIKNQT KGATTDANGL 
FSLDARSGDE LMISAIGYQS TQVKIGTKNT LEIFLRESAS QLNEVVVVGY GTQDRKNLVG
SVTQVNADEI KNRPVASFDQ QLQGRAVGVQ VAANTGVPGD GIFFRIRGTT SINASNDPLY
VVDGVFVNNQ SLQKITTQGQ ANNPLADINP ADIESISILK DAEATAIYGA RAANGVVLIT
TKRGSYNSKT KVSLNASVGQ AWAPKLWDLV TGPEHATIIN EAWINDGKPA ATRPFRPISE
GGRGLPEEQP TYDRLHDIFR TGALQNYDLA VSGGTKQTRF YIGGGYTSQQ ATLRTNDFSR
ASFKLNLDQD ITDKIRIGTS NILSQSNRTN ARVGDGPQGG ILQAALHTPT YLPKFNTDGS
YAKWAGFDNL DVLINNTDMH STSTRYIGNI YGEYDIISGL KLRSSWSIDY NDYNEYEYWN
TLTNRGSASK GLATSSVSKN TIWINEQTLS YRRSFGTQHN FGALVGNTLQ GNVSTQTLAQ
GTNFPSDAFK QIASASVTTA SSNRNQYNLV SFFGRVDYNF SKKYFLEASL RADASSKFAE
GHRWGYFPSA GVAWQLKQEN FLRDVNFLSD LKIRASVGWT GNQNGIGNYA SRGLWGGGNN
YLDNPGTVPV QLANPELKWE TTRQTNVGLN VGLLSNRIGL EINAYSKYTY DLLLQVPLAQ
SSGFSSIYRN DGEISNRGLE FGINTQNINK SSFQWNTSFN IAANVNRIEK LSIPVDASYA
AERMAQGQAF HSFYVYRQLY VDPKTGDAVY DDVNKDGKIT VADRQFYGSA LPKFFGGLNN
TFAYKGFDLS VFFNFSYGSK VFNNNRFFHE SGGTRDDRRA INKNQLKRWQ KEGDITDVPR
VTTIGNNYNL SPTSRFVEDG SFLRLNSLVL GYTIPKAVLR KVGISSARVY YSGSNLWLLS
NYQGPDPEVN VTADPTTQGY DLGTPPQPRT AQFGINLTL