Gene Slin_0553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0553 
Symbol 
ID8724281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp681252 
End bp684362 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003385416 
Protein GI284035486 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAACT ATTTACTCAA TAGGCTTCAA CAGTCGATAC CCTACTGTTG GTTAGCTCTG 
CTGTTGACGA TCGGCATTGC CAACGGGCAG ACAACCACTT ACTCGTTTAG CGGGCGGGTA
CTGGATGAGA AAAATACGGG CCTCCCCGGC GCTACGGTCG TCCTGAAAAA CAATAACAAG
ACGGGCACTA CCACCGACGC CAACGGTAAG TTCACGATCA GCATGCCCAC GGGCGGTGGC
ACACTGGTGG TATCGGCCAT TGGTTACCTG GCCAAAGAAG TTGCCGTTAC GAGCGAAACC
ACCATCGACG TACCGATGGC TCCCGACGTA AAGACGCTCA ACGAAGTGGT GGTAGTTGGT
TACGGAACTC AAAAGAAGGA AAACCTGACA GGGGCCGTGG CGGCCATTAC CATCGATGAT
AAAATATCGA GCAGGTCGCT GTCAAACGTC TCCTCGGCGC TGTCGGGCCT GATTCCGGGT
CTGGCCGTTC AGCAGTCGAC GGGGCAGGCG GGGCGCAGCG GGGCCGCGCT AGTGATCCGG
GGGCTGGGAA CGGTCAATAA CTCGGGGCCG CTGATTGTCG TAGATGGTAT TCCCGATGTC
GACATCAACC GGATCGACAT GAACGACGTT GCCAGTATCT CCGTCCTGAA AGATGCGGCC
TCGGCTTCCG TTTACGGGTC AAGGGCAGCT AACGGCGTTG TGTTGATCAC CACCAAAAAC
GGCTCGCAGA ACAAGAAACC GGTCATCAGT TACACGGGCA CGTACGGCCT TTCAGAACCG
ACCAATTTCT ATAACTACTT CGATGACTAC GCCCGTTCGC TTACCATGCA CCTGCGGGCC
TCGGGGGCGG GGGCATCGTC GACCACCTTC CGGTATGGCA CCGTGGAGGA CTGGCTGTCG
AAGAGCATGA TCGACCCGAT CAAATACCCC AGCACCAACT GGTGGGATGT GGTGCTGCGC
GATAAGGGCC GGATTCAGAC ACATAACCTG TCGGCAGCGG GTGGCAATGA GCGTTCCAAT
TTTTACCTGT CGGCGGGGAT ATATGATGAG TTGGGTATCC TGATCAATCA CGATTACAAG
CGATACAACA CCCGATTTAA CCTGGACTAC AAACTGAGCG ACCATATTAA AGTCGGCATT
CGGATGGATG GACAATGGTC GAAGCAGACC TACGCCAACT CCGAAGGGCT GATTACCTAT
ACCGGAACCG GGGGCTACGA CATTCGCTAT GCCGTGGCCG GTATTCTGCC GCAAAACCCG
CTTACTGGTC AGTATGGGGG TGCAATGGCC TATGGCGAAG ATGCCCTGGC GTATAATATG
CTGGCGGCCA TGAACGTCAA CCATAACCTA CGCGACCGCC AGGAAGCCAA CGGTAATTTA
TACGGCGAGT GGACGCCCAT CACGGGCTTG ACCATCCGCG GTGATTATGG CCTGCGGTAT
TACAATCAAT TTACAAAAAG CTACGCCGAC CCCTCGGATG TCTTTAACTT CCAGACGAAC
CAGATTTCAC GCAATCTCGT ATCCAGCAGC GCTGGTATTA GCAACGCCAT CAATTCGGGC
TATAAAACGC TGCTTCAGGG CCGGGTAACG TACAACAAAA CGCTCTTCGG CAACCATCAG
CTGAGTCTGT TGGGGGCTTA TACGGAAGAA TACTGGTTCA ACCGAAACCT GTCGGCCAGT
CGCCTGGAGC GCATCAACCC GCTCCTGAGT GAAATCGACG CGGCTCTTAC CACGACGCAG
GCTGCCGGGG GTAACTCCGA CGCCGAAGGG TTGCGGTCGG GCATCGGTCG ACTCAATTAC
GTCGTCAACG ATAAATACCT GTTTGAAGTG AACGCCCGCT ACGATGGGTC CAGCAAGTTT
CTGCCGGGAT TTCAGTACGG CTTTTTTCCG TCGGCATCGG CAGGCTGGCG TTTCTCGGAA
GAGCCTTTCT TCAAGCGGTT CAGTTCCGTG GTATCGTCGG GTAAAGTCCG GGCTTCCATT
GGTAAGCTGG GCAACAACTC GGGCGTGGGG CGGTACGAAC AGCGCGATAT TTTCAACCTG
ACCAATTATA TCCTGAACGG GAAAATCACC AAAGGCTTTA GTTCGGCCAA AATTATCAAC
GAGGATTTCT CCTGGGAAGA AACCAACGTA ACCAACCTTG GGCTGGACCT GGCTTTCTTC
GGCGGTCGCC TGACAACCGA TATTGATTAC TACAACAAGC TGACTTCCGG GATGATCCGC
CCGTCGTCGC TGTCGACTTT CCTGACCGGC TACAACGCCC CGCGTGTGAA CATCGGCAAG
CTCCGGAATA CGGGTGTAGA AGTCAATGTC ACCTACCGGG CCAAGGTTCG TGATGCTAAC
GTTGGAGCAA CGCTCAACAT GGCCTTTAAC CAAAATAAAC TGCTGGAGTG GAATGAGTTT
CTGAGCAAAG GGTACACCTA CCTGAACCTG CCTTATCACT TCGCTTACAG CCGCGTGGCC
ACGGGCATTG CCCAGAGCTG GGAGGATATT GCCAACGCAC CCTATCAGGG ACAGTACTTT
TCGCCGGGCG ATATTCTGTA TAAAGACCTC AATGGCGACG GTCAGGTGAA CGACGAAGAC
CGTAAAGCCG AACCCAAATT TAACCGGGAT CAGCCTACGG GTACCTACGG CCTCAATCTG
TTTGCCAACT GGCGGGGTTT CGATGTCAGC GTACTCTGGC AGGCTGCCAC CGGCCGGAAA
GATTTCTGGC TGGAGCCGTT CAACAACGTC AACATTCCGG CCGCACGCAA CGCCTTTCAG
GACTTTTTAT GGAACGATAC CTGGAGCCTC GACAACCGGC TGGCGTCGCT GCCCCGGCTT
GTTACGGGTT CGGGTGGTAA CAACCAGGCC GAATCGACCT TCTGGCTCGA CAACTTTGGT
TACCTGCGGC TTAAGAATAT TCAGTTGGGC TACAACATTC CCACCAAATA TATCAGCCGG
TTGGGATTGA GTAAGGTCCG TATCTACGGA ACATCCGAAA ACCTGCTGAC CTTCACAAAA
TACCGCGGTG TCGACCCGGA AAAGAGCACG AGTGTGTCGG GAGCCGATAA TAATGACGAC
CCGTTTCCGC TGCTCAAATC CTATTCGTTC GGTCTTAACC TCAGCTTTTA A
 
Protein sequence
MPNYLLNRLQ QSIPYCWLAL LLTIGIANGQ TTTYSFSGRV LDEKNTGLPG ATVVLKNNNK 
TGTTTDANGK FTISMPTGGG TLVVSAIGYL AKEVAVTSET TIDVPMAPDV KTLNEVVVVG
YGTQKKENLT GAVAAITIDD KISSRSLSNV SSALSGLIPG LAVQQSTGQA GRSGAALVIR
GLGTVNNSGP LIVVDGIPDV DINRIDMNDV ASISVLKDAA SASVYGSRAA NGVVLITTKN
GSQNKKPVIS YTGTYGLSEP TNFYNYFDDY ARSLTMHLRA SGAGASSTTF RYGTVEDWLS
KSMIDPIKYP STNWWDVVLR DKGRIQTHNL SAAGGNERSN FYLSAGIYDE LGILINHDYK
RYNTRFNLDY KLSDHIKVGI RMDGQWSKQT YANSEGLITY TGTGGYDIRY AVAGILPQNP
LTGQYGGAMA YGEDALAYNM LAAMNVNHNL RDRQEANGNL YGEWTPITGL TIRGDYGLRY
YNQFTKSYAD PSDVFNFQTN QISRNLVSSS AGISNAINSG YKTLLQGRVT YNKTLFGNHQ
LSLLGAYTEE YWFNRNLSAS RLERINPLLS EIDAALTTTQ AAGGNSDAEG LRSGIGRLNY
VVNDKYLFEV NARYDGSSKF LPGFQYGFFP SASAGWRFSE EPFFKRFSSV VSSGKVRASI
GKLGNNSGVG RYEQRDIFNL TNYILNGKIT KGFSSAKIIN EDFSWEETNV TNLGLDLAFF
GGRLTTDIDY YNKLTSGMIR PSSLSTFLTG YNAPRVNIGK LRNTGVEVNV TYRAKVRDAN
VGATLNMAFN QNKLLEWNEF LSKGYTYLNL PYHFAYSRVA TGIAQSWEDI ANAPYQGQYF
SPGDILYKDL NGDGQVNDED RKAEPKFNRD QPTGTYGLNL FANWRGFDVS VLWQAATGRK
DFWLEPFNNV NIPAARNAFQ DFLWNDTWSL DNRLASLPRL VTGSGGNNQA ESTFWLDNFG
YLRLKNIQLG YNIPTKYISR LGLSKVRIYG TSENLLTFTK YRGVDPEKST SVSGADNNDD
PFPLLKSYSF GLNLSF