Gene Slin_2474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2474 
Symbol 
ID8726218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2991081 
End bp2994368 
Gene Length3288 bp 
Protein Length1095 aa 
Translation table11 
GC content52% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003387292 
Protein GI284037362 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.840021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.838838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAA TGTTTACACA ACAAAAGCTC CTCTTGTTTA GAACAGGGGC TTCCCTGACA 
AACGCAGACG TCAGTCGACG GTTTTCGTTT TGTTTGTTAT TCCTTCTGGG CCTCCTGGTC
TCAGTAGGTG CTTACGCCCA GACGAAGGTT TCGGGTAAGG TGGTTGATGC GCAGGGGCTA
GCTCTGCCGG GGGTGAGTAT CGTCGTGAAA GGTACGACAA TGGGTACGGT TTCGGCCGCA
CAGGGTGATT ATACCCTTAA TCTGGCCAAA GGGAATGAAA CGCTGGTTTT TTCCTACATC
GGATTCCTGA CTCAGGAAAT ACCCGCTAAC AACCGGTCTA TGATTAACGT TACACTCGCT
TCAGACGATA AAATGCTGAA CGAGGTAATT GTGGTTGGTT ACGGCGAGCA GAAGAAAGAG
ACGGTAACGG GCTCGGTTGC CACCGTAAAA GGTAGTGAAT TGATCAAGTC GCCGGCGGTC
AACCTGTCGA ACTCGATTGC AGGCCGGATG CCGGGCGTTA TCGCCACTAA CGCCAGTGGT
GAGCCGGGTT ATGATGGAGC CGCTATCAAG ATCCGGGGTT CTAACACGCT GGGTAACAAC
GACGCGCTGA TCGTAATTGA CGGTGTACCG GCACGGGCCG GGGGTATCGA CCGCTTGAAT
CCCAACGACA TCGAGAGCAT CTCGGTATTG AAAGATGCGT CGGCTGCCAT TTACGGATCA
CGCGCTGCTA ACGGGGTTAT CCTGGTAACT ACCAAGCGGG GTAAAACGGG TAAGCCTGAC
ATTTCGTACA GCTTCAACCA GGGCTTCGCT CAGCCAACCG TCATTCCTAA GATGGCGACA
GCCTCTCAGT ATGCGGAGTT GAACAATGAG ATCAACGTTT ATAACCTGCC TTCTCAGTAT
TGGAAAGATG CTTCGGCTGC ATTCAAGGCT ACGGGTAGCT ACACCCGCCC TGACAACGGC
TCGATTGCCA AAGCGGCTTT CACGCCCGAT GATATGAAAA AGTATCAGGA TGGTTCCGAT
CCCTGGGGTC ATCCCAACAC TGACTGGTTT GGTGCGGCTC TGAAAAACTG GTCGCCACAA
ACCCGGCATA CCCTGCAACT GGTGGGTGGA AATGACAATG TTAAGTATTT AACCTCTGTC
AATTATCAGA ATCAGGATGG CTACTACAAG AACTCGGCCA CGGGTTACAA GCAGTATGAC
TTCCGCATGA ACCTGGATGC TAAGGTTAAC AAGTACATTA ACACAGTGGT TGGCGTGGTA
GGTCGTCAGG AGAACCGTTT CTTCCCAACA GTAGGGGCGG GGGCGATCTT CCGTATGCTG
ATGCGGGGTT ACCCCAACAA GCCAGCTTTC TGGCCAAACG GTCTGCCCGC TCCGGATATC
GAGAACGGAC AGCAGCCTGT ACTTGTTACG ACCGATGCTA CTGGTTATGA CAAAGACACT
CGCTATTATC TGCAAAGCAA TGCCAGCGTA ACGGTAACTA ACCCTTGGAT TGCCGGTCTG
AAGTTTGTAG GTAGTGTGGC CCTGGACAAA TACATCCAGC AGGGTAAAAC ATGGCAGACG
CCGTGGTTCG TATATAGCTG GGATTATACC TCCTACGATG CCAACAAGCA ACCACTTCTG
CAACGCGTTC AGAAAGGACC TGCTCAGGCT ACGCTGAATC AGTACACCAA CGACCAGTTC
AACTCGCTGT TGTCGGGTAT CCTCTCGTAT GACCACGTCT TCGGGGGTAA CCACGCGGTA
ACGTTACTGG CCGGTATTAC CAAGGAGCAG TCCAATTCAA GCGGTTTCTC AGGCTTCCGG
AAGTACTTTG CCTCGACCGC CATCGACCAA CTGTTTGCTG GTGGTAGTGC CGAGAAAAAC
TCGAACACCA CCGCTGCCTG GCAACGGGCT CGTATGAGCT ACTTCGGTCG GGCGGGCTAC
AACTTCAAGG AGAAGTATCT GGCCGAGTTC CTGTGGCGTT ATGATGGTTC CTATATGTTC
CCTTCAGCTA CCCGTTGGGG CTTCTTCCCC GGTGTAACGG CGGGCTGGCG TATTTCGGAA
GAAAACTTCT TCAAGAAGAG TCTGCCAGCA GTAAGTTCGT TGAAACTGCG GGCTTCATGG
GGTCAGTTGG GTAACGACCA GGTATACTTC AACGGTTCGC TGCGTGAGTA TGACTACCTG
CCTACTTATG CCTATGGCGA CGTAGTGAAC TCGAACTGGG GTTATGTAAC GGGTGGTCAG
GTGTCGCAGA CACTGTATGA GAACGGGGTG CCTAACCCAA CCCTCACCTG GGAAGTGGCT
AACAACGCCG ACATCGGTCT GGAAGGCTCG CTGTTGAACG GGAAGATCTT CTTCGAATTC
GACGTATTCC AGAATAAGCG GTCGAATATT CTGTGGCGCA AGAGTGCTTC CATTCCTCAG
ACCACGGGTA TGACCCTGCC AGCAACGAAC ATTGGTAAGG TGACCAACAA AGGGTATGAG
TTCAACATTG GTTACAATGG TCAAACGAGC GGTGGTCTAA AGTATAGCGT TAGCGTCAAT
GGTGGTTATG CCAAGAACGA AATCACGTTC TGGGACGAAA CGCCAGGTGC ACCCGAGTGG
CAGCGGTCGA CGGGTAAGCC CATCCCAAGT AACGTGAACG ATCCGAACCA GCAAAACGGT
ACACTGCTTT ACCAATATGA CGGTATCTTC TCGACACAAG CCGACATTGA TGCCAACAAG
CTGGATTACA GTGGTGTTGG AGCCAGCCTG CTACGTCCTG GCGACATGAA GCTGAAGGAC
ATCAACGGCG ATGGCAAGAT CAACGGCGAT GACCGGGTTC GGGCCGACCG CAACAACCAG
CCTCGTTTCC AGGGTGGTTT CAACGCCAAC CTGCGCTATA AGAACTTCGA CCTGAGCATT
CTGGTACAAG CTTCGGCTGG TGGTCAGATC TTCCTGCAAA CGGAGTCGGG TACCATTGGT
AACTTCCTCG CCTGGAGCTA TGACAACCGC TGGACGGTCG ATAACCCAAG CACGGTTAAC
CCCCGCATCG TTGACCGGAG CAACCAGTAC TTCTCCAACG GTACCAGCTA CTGGTTGAAG
AGCACGGACT ACGCTCGTCT GAAAAACCTG GAGTTAGGCT ATACCTTACC GAGCACCATT
GGTAGTAAAA TTGGTCTGAA CAACCTGCGC GTTTATGTTA ACGGTCTGAA CCTGATCACC
TACGCACCTG CCATGAAGGG TCTGTTTGAT CCTGAATCGA CCAGCGGTAG TGCTCAGTAC
TATCCTCAGG CACGGGTTAT CAACACAGGG GTATCCGTTA GTTTCTAA
 
Protein sequence
MQKMFTQQKL LLFRTGASLT NADVSRRFSF CLLFLLGLLV SVGAYAQTKV SGKVVDAQGL 
ALPGVSIVVK GTTMGTVSAA QGDYTLNLAK GNETLVFSYI GFLTQEIPAN NRSMINVTLA
SDDKMLNEVI VVGYGEQKKE TVTGSVATVK GSELIKSPAV NLSNSIAGRM PGVIATNASG
EPGYDGAAIK IRGSNTLGNN DALIVIDGVP ARAGGIDRLN PNDIESISVL KDASAAIYGS
RAANGVILVT TKRGKTGKPD ISYSFNQGFA QPTVIPKMAT ASQYAELNNE INVYNLPSQY
WKDASAAFKA TGSYTRPDNG SIAKAAFTPD DMKKYQDGSD PWGHPNTDWF GAALKNWSPQ
TRHTLQLVGG NDNVKYLTSV NYQNQDGYYK NSATGYKQYD FRMNLDAKVN KYINTVVGVV
GRQENRFFPT VGAGAIFRML MRGYPNKPAF WPNGLPAPDI ENGQQPVLVT TDATGYDKDT
RYYLQSNASV TVTNPWIAGL KFVGSVALDK YIQQGKTWQT PWFVYSWDYT SYDANKQPLL
QRVQKGPAQA TLNQYTNDQF NSLLSGILSY DHVFGGNHAV TLLAGITKEQ SNSSGFSGFR
KYFASTAIDQ LFAGGSAEKN SNTTAAWQRA RMSYFGRAGY NFKEKYLAEF LWRYDGSYMF
PSATRWGFFP GVTAGWRISE ENFFKKSLPA VSSLKLRASW GQLGNDQVYF NGSLREYDYL
PTYAYGDVVN SNWGYVTGGQ VSQTLYENGV PNPTLTWEVA NNADIGLEGS LLNGKIFFEF
DVFQNKRSNI LWRKSASIPQ TTGMTLPATN IGKVTNKGYE FNIGYNGQTS GGLKYSVSVN
GGYAKNEITF WDETPGAPEW QRSTGKPIPS NVNDPNQQNG TLLYQYDGIF STQADIDANK
LDYSGVGASL LRPGDMKLKD INGDGKINGD DRVRADRNNQ PRFQGGFNAN LRYKNFDLSI
LVQASAGGQI FLQTESGTIG NFLAWSYDNR WTVDNPSTVN PRIVDRSNQY FSNGTSYWLK
STDYARLKNL ELGYTLPSTI GSKIGLNNLR VYVNGLNLIT YAPAMKGLFD PESTSGSAQY
YPQARVINTG VSVSF