Gene Slin_4782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4782 
Symbol 
ID8728546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5824338 
End bp5827772 
Gene Length3435 bp 
Protein Length1144 aa 
Translation table11 
GC content57% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003389559 
Protein GI284039629 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCAAC ATGAACCAGC GTTAGTGAAT CCTGACGTGC ATCGGACCGA CTGCGAATCG 
GGCCCCGACA AGCCGGATTG TTTTGCCGAT TTCCTATCTA ACAAAACCCT TAAAGGTATG
AAAAAAAGAC TACCTGTGCC CACCGGCGGC TTACCCGGTT GGTCAGCCGA TCTACTTCCA
CGATTAATGA ATCTTTCGCT TACCCAGCTT TTCCTGATGA TTGCCTGCAC GAGTTTCTCG
TTTGCCTTCG ATGGCCAGGC GCAGGAATTA ATGAACCGTC CCGTAACCCT GAAGGTAGAG
GGGCAGCGGC TCCGCGTGGT GCTGGCGCAA ATCGAACAGC AAACAACGGC CCGCTTCGTT
TACAGCTCGA AGTCGATTGG TGTCGACCGC CCCATAACCA TCACCACCCG CGACAAACGG
CTGGCCGACG TGCTGACTGA ATTACTCCGG CCGCTGAAAC TGAGCTACCG GATGGTGGGC
GGTCAAATCG TGCTGGAAAG CGATGCCGAC GCCCATTCGC TGGTAACACC GGCAAACGAA
GCCGCCGACC GCGCGCTGTC GGGAATGGTG ACCGACGAGA AAAACGCGGC CCTACCGGGC
GTAAGTGTCG TGATTAAAGG CTCGAACCGG GGCTCCACTA CGGATGCCAA CGGGCAGTTC
AAAATCACGG TGCCGGACGG TAACGCCGTT ACGCTGACGT TCTCATTTGT TGGCTACCAG
AGCCAGGATG TGGTGGTTGG CAGTAAAACG ACGGTCAACG TGTCGATGGT ACCCGACGTC
AGTGCGCTGG ACGAAGTTGT GGTCATCGGC TACGGGGCCG TTCGCAAAAA AGACTTGACC
GGCTCGGTGG TGCAGCTCAA GAGTGAGCAA CTAAAAGAAG TACCGACCTC CAACGTGCTC
GAAGCCGCGC AGGGTAAAAT TGCCGGGGCC GACATTACCC GCAGCAGCGG TCAGGCGGGG
GCGCGAATAA ATATCTCCAT CCGGGGCAAC CGCTCCATTG GCGGCAACAA CTCCCCGCTC
ATTATCGTGG ATGGTATCCA GTACAGTAAC CTGGAAGACA TCAACGCCAA CGACATCGAG
ACGATGGATG TCCTGAAAGA TGCGTCATCT ACGGCCATTT ACGGGTCGCG CGGGTCGAAC
GGGGTTATTC TGATCACGAC CAAGAAAGGT AAGCTGGGCA AACCCGACAT TTCGTTCAAC
GCCTATTCCG GTATCTCGCA GGTGACGATG TACCCGAAGG CGATGGACAT TACCGGTTTC
CGGGATTTCA AACGGGAAGC GTGGCGGGCC GCCGGTATCT GGAAAAGCCC CGCCGATGAT
GCCGCCATTT TCACCAACGT AGCCGAATAC GACGCCCTGC AAAAGGGTCT CTGGACCGAT
TATCAGGACG CGCTGATTCA CAACGGGCTT CAGCAAAACT ACCAGGTGGG TATTCGCTCC
GGCACCGACC GGCTGAAATC GTACGTTTCG GTCGACTATT TCAATGAGAA AGGCATTCTG
AAACTGGACG AACTGAGCCG CTACACGGGC CGACTCAATG TCGACTTTAC CATTAACGAC
TGGATGAAAA TCGGGTTGCA AAGCCAGCTG ACGTACTACA ACCAGAGCGT ACGCCGGGAC
CCGCTCAACC AGGCCAACAA GATCAGTCCG CTGGGATCGC TCTACGATGC CAACGGCAAT
TTCAATTTCA TTATGCTCGA TGGACAGACC GCCAACCCCC TCTCCGACGA GCAGCCCAAT
GTGTTTAACA ACTCAGTGCT GACCACCCGG GTGCTCACCA ATGGGTACCT CGAACTGACG
CCCTTCAAAG GGTTCTCGTT CCGGAGTACG CTGGGCGTCA ACCTGGCTTC CATACGTGAT
GGGGCCTATT CATCGCCCAA ATCCATCGAC CGCTCGCTGA CGGGCAAATC GCTTTCTACG
TACAACACCA GCAACGGCCG CACCGTGAAC TGGGAGAACG TCATGACCTA CCAGCGGACC
TTCGGCCAGC ACGCCGTTAC CATGACGGGC ATCGCCAGTT ACCTGGGCAA CACCTCCGAC
AACTCGGCGG CTTCGGGCGT CAATCAGTTG CTGCCTTCGC AGTTGTTTTA CTCGCTGGGC
AGTGCCACGG AAGAGATCAA GATCAATTCG GCGTTTTCCA AGAACAACCT GGTCTCGTTT
GCCGCCCGCC TGAACTACGC CTTCCGCGAC CGGTATCTGC TCACGCTGAC CGCCCGCGAA
GATGGTTCGT CGAAGCTGGC AGCGGGTAAT AAGTGGACGT TCTTCCCGTC GGCGGCTTTC
GCGTGGCGGG TTATCGAGGA GAAATTCATG CAGGACGTAA AGGGCCTGAG CGACCTCAAA
ATCCGGGCAA GTTATGGCGT AGCGGGTAAC GACCCATCCG GCCCTTACGC GACCCAGACA
ACCCTGACCC GGCTGGCTTT TGGTTTCGAT GACATCTCGG CTCCGGCCTA TACCTTCTCC
CGAAACGTGG GCAACACCGC CCTCGGCTGG GAATTGTCGA ACACGAAGAA CCTGGGCGTG
GATTTCGGGC TGTTCAACGG GCGCGTCAAC GCATCGCTCG ACTACTACGA CACCCGCACC
TCCGATCTGC TGCTGGACCG GGGACTTCCA CCAACGACCG GCGTAACGAC GGTGAAACAG
AACATCGGCA AAACCCGCAA CCGGGGCCTT GAGCTGTCGC TGGGAAGTAC CAACATCCGC
ACCCAGAACC TGACCTGGAG CAGCAACGTC ACCTTCACTA AGAACAAGGA AGAAATCACC
GAACTGGTGA CCGGCTCCAA CGACATCGGC AACGGCTGGT TCATTGGCTC GCCCATCAGC
GTGTATTATG ACTACGAGAA ACTAGGCATC TGGCAAACTT CGGAAGCCGA CCTGGCCGCC
AAACTCGCGC CCACGCAGCT ACCCGGCGAA ATCAAAGTCA AAGACCAGAA CAACGATGGC
AAGATCGACG CCGTCAACGA CCGGATTATC CTGGGAACAC CCCGCCCAAA ATGGAGTGGC
GGCTTCGACA ACACGGTTAA ATTCAAAGGA TTCGATCTGA ACGTGTTCCT GTATGCCCGT
GTGGGCCAGA TGATCAACTC CGACCGGTCG GCGCGTTTCG ACCAGCAGGG AGTCGGCAAC
AGCACGGCTG GGCTAGACTA CTGGACGCCC GAGAATTCAA CCAACGCCTA TCCGCGCCCG
AACAAGAACG GCGGTTTGAA ATACCTCTCC ACGCTGGGTT ATCAGGACGG CACCTACGCC
CGCATCCGGA ATATTACGCT GGCGTATAAC GTCCCAGTCA AAGTGCTTCC CAAGGTGGTT
CGGGGCGTTC GCGTGTATGT AACGGGTAAG AACCTGGTCA CCTTCACGAA GCTGAATTAC
GACCCCGAGC GGGGTGGTTC GGAAAACTTC CCCATGACCA AACTGTACGT CTTCGGCCTG
AACGTCAACT TATAA
 
Protein sequence
MRQHEPALVN PDVHRTDCES GPDKPDCFAD FLSNKTLKGM KKRLPVPTGG LPGWSADLLP 
RLMNLSLTQL FLMIACTSFS FAFDGQAQEL MNRPVTLKVE GQRLRVVLAQ IEQQTTARFV
YSSKSIGVDR PITITTRDKR LADVLTELLR PLKLSYRMVG GQIVLESDAD AHSLVTPANE
AADRALSGMV TDEKNAALPG VSVVIKGSNR GSTTDANGQF KITVPDGNAV TLTFSFVGYQ
SQDVVVGSKT TVNVSMVPDV SALDEVVVIG YGAVRKKDLT GSVVQLKSEQ LKEVPTSNVL
EAAQGKIAGA DITRSSGQAG ARINISIRGN RSIGGNNSPL IIVDGIQYSN LEDINANDIE
TMDVLKDASS TAIYGSRGSN GVILITTKKG KLGKPDISFN AYSGISQVTM YPKAMDITGF
RDFKREAWRA AGIWKSPADD AAIFTNVAEY DALQKGLWTD YQDALIHNGL QQNYQVGIRS
GTDRLKSYVS VDYFNEKGIL KLDELSRYTG RLNVDFTIND WMKIGLQSQL TYYNQSVRRD
PLNQANKISP LGSLYDANGN FNFIMLDGQT ANPLSDEQPN VFNNSVLTTR VLTNGYLELT
PFKGFSFRST LGVNLASIRD GAYSSPKSID RSLTGKSLST YNTSNGRTVN WENVMTYQRT
FGQHAVTMTG IASYLGNTSD NSAASGVNQL LPSQLFYSLG SATEEIKINS AFSKNNLVSF
AARLNYAFRD RYLLTLTARE DGSSKLAAGN KWTFFPSAAF AWRVIEEKFM QDVKGLSDLK
IRASYGVAGN DPSGPYATQT TLTRLAFGFD DISAPAYTFS RNVGNTALGW ELSNTKNLGV
DFGLFNGRVN ASLDYYDTRT SDLLLDRGLP PTTGVTTVKQ NIGKTRNRGL ELSLGSTNIR
TQNLTWSSNV TFTKNKEEIT ELVTGSNDIG NGWFIGSPIS VYYDYEKLGI WQTSEADLAA
KLAPTQLPGE IKVKDQNNDG KIDAVNDRII LGTPRPKWSG GFDNTVKFKG FDLNVFLYAR
VGQMINSDRS ARFDQQGVGN STAGLDYWTP ENSTNAYPRP NKNGGLKYLS TLGYQDGTYA
RIRNITLAYN VPVKVLPKVV RGVRVYVTGK NLVTFTKLNY DPERGGSENF PMTKLYVFGL
NVNL