Gene Slin_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2101 
Symbol 
ID8725839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2532200 
End bp2535316 
Gene Length3117 bp 
Protein Length1038 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003386935 
Protein GI284037005 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.693236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.298537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTA CCCGATTTGT CGGATTTTGG ATTGTGTGGA CATGCTTTTT ATTAAGCAGT 
CAACCAGTTT TAGCGCAGGA AAGCCATACC GTAAAAGGAT TGGTACTCGA CGAAAACCAG
AAACCGCTAT TTGGTGCCTA TGTGATCCTG AAGAACACAA CAACCGGCAC GACAACGGAT
GTCAATGGAG AATTTGCCCT GAAAGTACCC GCCGGTAAGC AAACGCTGGC CATCTCTTAT
CTTGGGTCGA AGCCACAGGA CATTACGGTG GAAAAGGGGG GGAATGTTAA GGTTGTACTC
TCGGGCGGTG ATATTACCCT GGGCGAGACA GTCGTGGTGG GCTATGCGCA GCAAAAAAAG
CAAAGCGTTG TGGGAGCCAT CTCACAAACC ACCGGGAAAG TACTCGAACG AGCCGGTGGC
GTATATAGTG TAGGGTCAGC ACTGACCGGT AATGTGCCGG GGGTGATTAC TACCTCCAGC
ACAGGCATGC CCGGCGAAGA AGATCCCCGC ATTCTGATTC GGGGGTTAAG CTCCTGGAAC
AACAGCTCTC CGCTCATTCT GGTCGATGGG ATCGAACGCC CCATGAACAG CGTTGACATT
TCCTCGATCG AAACGATTTC TGTACTGAAA GATGCGTCCG CAACGGCTGT ATTTGGCGTG
AAGGGTGCCA ACGGCGTTAT TCTGATCACC ACTAAACGGG GTAAGGAAGG TAAAGCGAAC
ATCAGCATAC GGGCCGACTA CACCCTAAAG GCACCCTCTA CCCTACCCAA CAAGTATGAC
GCCTACGATG CCCGACGCAT CCGTAACCAG GTCATCGAAA ACGAATTAGG GCTGGAGTCG
GCAGGCTGGG CAACCTACAC GCCCTACGAT ATTCTAACGA AATACCGCAA CCCGGCCAAC
CTGGCCGAAG CCGAGCGCTA CCCGAACGTA AATTGGGCCA AAGAGATGTT CAAGCCGGTT
ACCCAAACGT ATAGCGCCAA CCTCAACGTA TCGGGAGGAA CGTCGTTCGT CAAGTATTTC
GCATCGGCCG ACTTTCTGAT GGATGGCGAC ATTTTTAAAG AGTGGGATAA CAACCGGGGG
TATCAGGCGG GCTACGGATT CAACCGGATC AACGTACGGA CAAACCTTGA TTTTCAGCTG
ACACCCACGA CGTTGCTGGT CACCAACCTG TCCAGCTCAA ACGGGGTCAA AAAGAGTCCC
TGGGGGGCTA CCGGTGGCGA ATACGATATG TGGCAAGGTG CCTATGGCGT AGCGCCTGAT
GCCATGCTGC CCCGCTATTC CGACGGTACC TGGGGCTATT ATGCGCAGGA CCCCGTGGCG
GCTACCAACT CCATTCTGAA TCTGGCCCGC AGTGGTATCA TGAAGCGAAC CACGAGCCGG
ATCACCACGG ATTTTACGCT GAATCAGGAT TTGAAATCAA TTGCGAAAGG CCTAAGTACA
TCGGGCACCA TTTCGTGGGA TAATAGCTTT GTGGAGAGCC AACGCGGCAT CAACGACCTC
TACAATGATC CGCAGACGAA GTGGATCGAC CCCGCTACGG GGGAGTCGCG CTACCGGTTT
ACGTATGATG CCGCTAATAT GTTCGACTTC CAGGAGGCTA TCCGCTGGGC ACCACAGGCG
GGTACAATGG ATAACGGAGC GACTTATCGG CGATTATTTT ACCAGCTCAA ACTGAATTAC
AACAGGACCT GGAATAAGCA CACGGGCTCG GCTATGGGGC TGTTCAGCCG CGAAGACCGG
GCGTCGGGCA GCGAGATTCC CAATTATCGG GAAGACTGGG TGTTCCGTAC CACCTACGAT
TATGCCGGTA AGTACTTCGC CGAAGTCAAC GGGGCCTATA ACGGTTCCGA AAAGTTTGGT
AAAGACAACC GCTTTCACTT TTTCTCGTCG GGTGCGGTGG GCTGGCTGCT ATCGGAAGAG
GCATTTTTCA AGCGGGCCAA GTTTCTGGAT CTGCTCAAGC TGCGGGTGTC CTACGGAAAG
ATCGGTGATG ACAACATCAA CCAGCGATGG CTCTATATGA CTCAGTGGGG ATATGACGGC
CAGGCAAAAA TAGGCGAGAA CAACTCGGAT GTCAGCCCCT ATGTCTGGTA TCGGGAGTCC
TCGGTGGGTA ACCCGTCGGT ACACTGGGAA ACGGTAACGA AAACGAACCT GGGAGCCGAT
TTCTCCCTGT TCAACGGGAC TATTTCGGGT AGTGCGGATT ACTTCAACGA CTACCGCACG
GATATTCTCA TTGCTGGTGG GTCGCGGGCT ATTCCGTCTT ACTATGGCAC CAATGCCCCC
GTTGCCAACC TGGGTAAAGT GCGGGTGAAA GGCTATGAGC TGGAAGTAAA GGTAAACCAT
CAGCTGACCA ACGGCATCCG GCTCTGGGCC AACGCAAACA TGACCCATGC CAAAGACCGG
ATCATTGAAG CGGATAACCC AAGCCTGCTG CCCGACTACC GCAAGAGCGA GGGCAAGCAG
ATCGGACAGA CCTATTCGTA CGTTAGCCAC GGCTACTACA ACAACTGGGA TGAGCTGTAT
GGCAGCACCG AACAGAACAC CAACGACCGG CAAAAGCTGC CGGGTAACTT CCAGATCGTT
GATTATAACG GCGATGGCAA GATCGACACT TACGACAATA TCCCATATGG TTTCCCCGAG
CGTCCGCAGA ATACTTACAA CGGTACGGTT GGTTTTGAGT GGAAAGGATT CAGCGCCTTC
GCGCAGTTTT ATGGCGTCAA TAACGTAACC CGTCAGGTGG TGTTCACCAG CTTTGGCTCC
CGAATGAACA CTGTTTATAA CCAGGGCGAG TACTGGACGA AAGACAATCC AACGGCGAAT
TCACCTCTGC CCCGTCTGAT GACGGCCACC GACCCATCGA CCTACGGCAA CTTCTATATG
TACGATGGCT CCTACGTGCG CCTGAAGAAC GCCGAGGTCG CCTACACCTT CAACCGGGGC
TGGATCGAAC GGTTCGGGCT GCGCACGCTG CGAATCTACG CCAACGGGAA CAACCTCTGG
CTCTGGACCA AAATGCCCGA CGACCGCGAA TCGAATTTCG CGGGTACGGG CTGGGCCGCC
CAGGGCGCTT ACCCGACAGT GAAGCGGTAC AATTTCGGCA TCAACATTAC CCTGTAA
 
Protein sequence
MNVTRFVGFW IVWTCFLLSS QPVLAQESHT VKGLVLDENQ KPLFGAYVIL KNTTTGTTTD 
VNGEFALKVP AGKQTLAISY LGSKPQDITV EKGGNVKVVL SGGDITLGET VVVGYAQQKK
QSVVGAISQT TGKVLERAGG VYSVGSALTG NVPGVITTSS TGMPGEEDPR ILIRGLSSWN
NSSPLILVDG IERPMNSVDI SSIETISVLK DASATAVFGV KGANGVILIT TKRGKEGKAN
ISIRADYTLK APSTLPNKYD AYDARRIRNQ VIENELGLES AGWATYTPYD ILTKYRNPAN
LAEAERYPNV NWAKEMFKPV TQTYSANLNV SGGTSFVKYF ASADFLMDGD IFKEWDNNRG
YQAGYGFNRI NVRTNLDFQL TPTTLLVTNL SSSNGVKKSP WGATGGEYDM WQGAYGVAPD
AMLPRYSDGT WGYYAQDPVA ATNSILNLAR SGIMKRTTSR ITTDFTLNQD LKSIAKGLST
SGTISWDNSF VESQRGINDL YNDPQTKWID PATGESRYRF TYDAANMFDF QEAIRWAPQA
GTMDNGATYR RLFYQLKLNY NRTWNKHTGS AMGLFSREDR ASGSEIPNYR EDWVFRTTYD
YAGKYFAEVN GAYNGSEKFG KDNRFHFFSS GAVGWLLSEE AFFKRAKFLD LLKLRVSYGK
IGDDNINQRW LYMTQWGYDG QAKIGENNSD VSPYVWYRES SVGNPSVHWE TVTKTNLGAD
FSLFNGTISG SADYFNDYRT DILIAGGSRA IPSYYGTNAP VANLGKVRVK GYELEVKVNH
QLTNGIRLWA NANMTHAKDR IIEADNPSLL PDYRKSEGKQ IGQTYSYVSH GYYNNWDELY
GSTEQNTNDR QKLPGNFQIV DYNGDGKIDT YDNIPYGFPE RPQNTYNGTV GFEWKGFSAF
AQFYGVNNVT RQVVFTSFGS RMNTVYNQGE YWTKDNPTAN SPLPRLMTAT DPSTYGNFYM
YDGSYVRLKN AEVAYTFNRG WIERFGLRTL RIYANGNNLW LWTKMPDDRE SNFAGTGWAA
QGAYPTVKRY NFGINITL