Gene Slin_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0203 
Symbol 
ID8723931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp268973 
End bp272116 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content52% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003385067 
Protein GI284035137 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.671806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAA TTTTATTGGG AAGCTGGTTA CTTTCTCTGT TATTCTGTTT GCCCGTACTG 
GCGCAGGATA TAGCAGTTAG TGGCCGCGTC ACTTCATCAG ACGACGGCTC CACCCTACCC
GGTGTGAGCG TGCAGGTAAA AGGAACAACC CGTGGTGCTA TTACAGACGC CGACGGAAAC
TATCGAATCA GCGTACCGGC CAATGCGCGA CTCGTATTTA GTTTTATTGG TTATACCGGA
CAGGAGGTTG CCGTTGGCAA CAAAACAACG ATCAACGTTA CGCTGGTAGC GGGTTCGCAA
AGCCTCGATG AAATCGTTGT GACGGCTCAG GGTATCGAGC GCGACAAGCG TTCACTGGGC
TACGCTACGC AGGAAATTGG CGGTAATATT CTAGCACAAC GCTCAGAACC AAACCTGCTT
AATGCCTTAC AGGGTAAAGT AGCGGGCGTC AATATCACGG GTTCCAGTGG CACACCAGGC
GCATCAACAA ACATCAACAT TCGGGGTATT ACATCATTCA ATGGCAGTAA CCAGCCCTTG
ATCGTGGTCG ATGGGATCAT CTTTAGCAAC GACGTTAACC TGACACAGAA CACGCTGTTT
GGTACGCAGC CTTCTAACCG TCTGGCCGAC ATCAACCCGG AGAGTATTGA ATCGGTGAAC
GTACTGAAAG GACCCGCAGC TGCCGTTCTG TACGGTTCGC GGGCTTCGGC TGGTGCTATC
GTTATTACGA CGAAATCGGG CCGCAACCAG AACAACAAAA CCGAAGTAAC GGTCAATTCG
TCGTACAACG TACAGAATGT ATACGGCATT CCAAGGTTCC AGAACGACTA CGGGCAAGGA
GCAAACAACC TGTTTACCCC AACGTCCAAT AACTCATGGG GACCTCCGTT TGTAGGTGGA
CCAACATCGG TTACTAATAC CCAGAACCAG GTTGTGCCCT ATCAGGCGTA TCCGAACAAC
GTACGGGACT TCTACCGTCA GGGCAGTATT CTGCAGAACT CGGTTAACAT TGCCTCGGGC
GATGCCACAC GCAACTACAT TATTGCTATT GGTAATACCC TACAGAATGG TATCGTTCAG
AACACGAAAT TCAACCGGAC AAACGTACAG TTAGGCGGTG AGTCGAAACT GAAAAACGGC
CTGAAAGTGA GTGGTACGGG TACGTACGTA CAAACGGTAT CGCGCGCAGT GCCGGGTGGT
AACGGCGCCA GTGCGTTTGG GCAGATTACC CGTATTCCGC GGAGTTATGA CCTGGCAAAC
GAGCCCTACC AGGGTGCTGA CGGAAAAAGT ATCTACTTCA CCCCGTCGAC AAACAACCCA
CAATGGAGCG TTAACAACGA ACGTCTCGAC AGCCAGGTTG ACCGTTTCTT CGGCAATTTC
CAACTGAGCT ACGACGTAGC CAGCTGGTTA AACGTCGCTT ACCGGGTAAC GGGTGATACT
TACACCGACC GTCGTAAGTT GATTCTGCCC ATTAGCTCGG GTCGTGCTCC GGCAGGCCAG
GTTCAGCAGG ATAACTTCTT CCGGAATGAG TTGAACGGCG ATCTGTTGAT CACGGCCCGC
AAGGACAACC TGTTTATGGA AGGGCTGAAC GCGAACCTGC TGCTGGGTAA CAACATCAAC
CAGCGGAAAA CCCAGGAGTC GGCGGCCGAT GCAACCTCCC TGACCCTGCC CGGTTTTTAT
AACATCAACA GCGGTACGGT GTTCACGGGT ACGTTCGAAA GCTCGACCCT GCGTCGTCTG
GTGGGTTACT ATGGCCAGTT GTCGCTGAAC TACAATAACT ACATCTTCCT GGAATTATCC
GGACGTGCCG ACCAGTCGTC TACTCTGCCG AAAGCGAACA ACACCTATTT CTATCCGGGT
GCCTCGGTTA GCTTCGTGCC AACAGACGCC TTCAAGATCA ACTCCGACGT GCTTTCTTAT
GCGAAAGTGC GGGCCAGCAT TGCTAAAGTT GGTCGTGATG CCGACCCTTA CCAGTTGAGC
ACCGTTTACA ATAAGTCTAG CTATGGTAAC AACGTTGCTA ACATCGTTTA CCCACTGGCA
CCAAACAACA CACCCGGCTT TAGCATTGAT ACCCGAATTG GTAACAACAG CCTCAAGCCT
GAGTTTGTAA CGTCGTATGA GTTCGGTATC AACCTTGGCT TCTTCAAGAA CCGCCTGAGC
GTCGATGCTA CGTACTTCGA CTCAAAGAGT ACGCAGCAGA TTTTCAACGT AGCCGTTTCG
AACTCATCGG GTTTCGACAC CCGGACAACC AACGTTGGTG AATTACAGAA CCGGGGGGTT
GAATTGATCC TGAGTGCAAC GCCCGTGCGA GTTGGTGGTT TCAAATGGGA TGCTACGCTG
AACTACACGC TGATCCGCAA CAAGGTGGTT TCCATTGCTC CCGGTGTTAA GTCGTCTAAC
ATAGCGGGTA ACTCATTTAT CGGTATTGCT CCGTCTATCT ACGAAGGCTA TCCGTATGGT
GTTATTGTTA GTACGGCCAA CTCACGGGCT CAGAATACCG ACCCCAATGG TTTGTACTAT
GACGCAACCG GTCAGTTTGC CGGTCAGTAT ATTATCAATG GAACCAACGG GCAGTTTGCG
CCGGGCTTGG CCAACTCGGT TATTTCGAAC CCGCAGCCTA ACTACATCGC GGGCTTAACC
AACACCTTCT CGTACAAAGG TATAGCCCTG TCTGTACTGG TTGATACCCG TCAGGGTGGT
CAGATTTTCT CGTTCAACGC CGTTGATGCC CGCCAGAATG GCTCGATGTA CGTAACGGGT
ATTGACCGCG ATCAGCCGCG TATTTTACCC GGCGTGATCC AGAATGCAGA CGGCACCTTC
CGCCCGAACA ACATCCAGCT ACCGGCCCAA ACCTACTGGG GCGCCCTGGG CGGTCTGGCT
TCGGAAGCCG CCGTTTACGA CGCAACGGTC TATCGTCTGC GCGAGGTGGC GCTGAACTAC
TCCTTACCGA AAACCTTACT TGGCAAAACG CCATTTGGTG CTATTTCGGT GGGCGTGAGC
GGTCGTAACC TGTTTTTCTA CGCGCCCAAC TTCCCGGCCG ACCCGGAGGT CAACACGCAG
GGAGCGGGTA ATATTCAGGG CCTTGACCTG AACGGGCCGC CAAATACACG GAACTTCGGC
GGTAACATTC GGCTCACGTT CTAA
 
Protein sequence
MQKILLGSWL LSLLFCLPVL AQDIAVSGRV TSSDDGSTLP GVSVQVKGTT RGAITDADGN 
YRISVPANAR LVFSFIGYTG QEVAVGNKTT INVTLVAGSQ SLDEIVVTAQ GIERDKRSLG
YATQEIGGNI LAQRSEPNLL NALQGKVAGV NITGSSGTPG ASTNINIRGI TSFNGSNQPL
IVVDGIIFSN DVNLTQNTLF GTQPSNRLAD INPESIESVN VLKGPAAAVL YGSRASAGAI
VITTKSGRNQ NNKTEVTVNS SYNVQNVYGI PRFQNDYGQG ANNLFTPTSN NSWGPPFVGG
PTSVTNTQNQ VVPYQAYPNN VRDFYRQGSI LQNSVNIASG DATRNYIIAI GNTLQNGIVQ
NTKFNRTNVQ LGGESKLKNG LKVSGTGTYV QTVSRAVPGG NGASAFGQIT RIPRSYDLAN
EPYQGADGKS IYFTPSTNNP QWSVNNERLD SQVDRFFGNF QLSYDVASWL NVAYRVTGDT
YTDRRKLILP ISSGRAPAGQ VQQDNFFRNE LNGDLLITAR KDNLFMEGLN ANLLLGNNIN
QRKTQESAAD ATSLTLPGFY NINSGTVFTG TFESSTLRRL VGYYGQLSLN YNNYIFLELS
GRADQSSTLP KANNTYFYPG ASVSFVPTDA FKINSDVLSY AKVRASIAKV GRDADPYQLS
TVYNKSSYGN NVANIVYPLA PNNTPGFSID TRIGNNSLKP EFVTSYEFGI NLGFFKNRLS
VDATYFDSKS TQQIFNVAVS NSSGFDTRTT NVGELQNRGV ELILSATPVR VGGFKWDATL
NYTLIRNKVV SIAPGVKSSN IAGNSFIGIA PSIYEGYPYG VIVSTANSRA QNTDPNGLYY
DATGQFAGQY IINGTNGQFA PGLANSVISN PQPNYIAGLT NTFSYKGIAL SVLVDTRQGG
QIFSFNAVDA RQNGSMYVTG IDRDQPRILP GVIQNADGTF RPNNIQLPAQ TYWGALGGLA
SEAAVYDATV YRLREVALNY SLPKTLLGKT PFGAISVGVS GRNLFFYAPN FPADPEVNTQ
GAGNIQGLDL NGPPNTRNFG GNIRLTF