Gene Slin_6529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6529 
Symbol 
ID8730315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7921930 
End bp7925157 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003391285 
Protein GI284041355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAC ATATGCTTTC AGGCGCGTTC GTCCTGAGTA GCTGGCTTAC TGTCCACGCT 
CAGGATCGAA CCGCACCCAG ATTAGCCCAG TTGGACAATC CGGCTGTTCT GGCCACAAAC
AGGGGTGCGA AAACGCGCAT CGACATCCAG TCGGATGTGA AAGCCCTGAT AAAAGGGACG
GTGTCCGACG AAAAAGGGAA TACGTTGCCC GGTGCTACGG TATCGGTGAA AGGGACACAG
TTAGGAACGA CGACGGACGT GAATGGAGCC TTCTCGATCA ATATGCCCGC TGGTGCTAAA
GTACTCGTCA TCTCGTTTAT CGGGATGAAA ACGCAGGAGG TTGAAGTGGG TAGCCGCACA
ACGCTCAACA TCACTCTCCA GACCGGCGAT CAGTCGCTGG ACGAAGTGGT CGTTATTGGG
TACGGTACGG CCAAACGGTC GGATGTGACC TCGTCCATCA CAACGGTAAA AGCGGCCGAT
CTGAAAGATA TTCCGGCGGC TGGTATTGAC CAGCTTCTAC AAGGTAAAGC GGCCGGTGTA
ACGGTAACCA GCAACGGTGG TCAGCCGGGC GGTGGCGTAT CGGTAAAAGT GCGCGGAGTT
ACGTCCATCA ACAGCAATGA CCCGCTGTTC GTGATCGACG GCGTACCGTT CGTGGGCGGT
AACACCTCGA ACAGCACGGG CTATGCCGGT CTGGGCGGTG GCGATGGCCA AACCGGAAAC
AGCGTAATGG CCATGCTGAA CCCTAACGAT ATTGAGTCCA TCGACGTCCT GAAAGATGCG
TCGGCGCAGG CTATCTACGG GTCACAGGCG GCTAACGGCG TTATTATGGT GACCACCAAG
AAAGGTAAGC AGGGCGAAGG CAAGATCAAC TACGAAATGT ACACCGGCGT TTCGGAAGTG
GCTCGTCGGC TGAACCTGAT GAAGCTGCCC GATTTTGCCC GATACCAGAA CGAAGTGCTT
CCAATCATTG GCAACCCGGT TGCCGATGAG TTCAAAAATC CGGATCTGCT CGGGCCCGGT
ACCGACTGGC AGGAAGCGAT GTTCCAGCAG GGTAAAATCA ACAACCACCA GTTGAGCTTC
TCTGGTGGAC GGGACAAGAC GACCTACTAC CTGTCGCTGA ACTACTTCGA CAACAAAGGG
ATTCTGCTGG GTTCGGATTT CAAACGGTAT TCGTCGCGTT TCAGCCTCGA TACACAGCTG
AAGAGCTGGG TTAAAGTAGG TCTGAGTGCC AACGTATCGC GCAGTATCCA GAACGTATCC
CTGGCCGATG CGGCCGAAGG AACCATCTGG TGGGGTGCGT CGACTAGCCC GCTGATTCCC
GTAAAAAACC TCGATGGTAC GTGGGGTGGT GGCCAGACGG TGGGTGGTGT GCAATATAAC
AACGCCAACC TGGTCGGTAA CAGCCAGTTC CGGGGCAATA CCAAAACATC GAACAACGTG
TTTGGTAGCT TGTACGCAGA ATTTCAACTG CTGAAAGGCC TATCGTTGCG CAATGAACTG
TCGTACTCGC TGGGGCAGGA TAACAACATC GCGTTCCAGA AAGCCGGTAA CGTGGGTGGT
ACGTCGTTCC GTAGCAAACT GATCGACTCC CGGTCGGATA GCTACTACTA TTCGATTACC
AACTACCTGA ATTACAACCT GTATTTCAAG AAACACGGTA TTCAGGCCAC GCTGGGCCAT
CAGGCGCAGC ACTCGTATTA CCAGTCGATT TCGGGTACGA AAGTGGATTT GCAGGCCAAC
ATCTTCGACT TGAACACGGG TAGCTCGGAC CAGACGACCT GGGGCTTGAG TGGTGGTAAA
GGCCAGTGGG CTATGGAGTC GTATTTCGCC CGGGCTAACT ACACGTATGA TGATCGCTAC
TCGATTTCGG CCAGTTTCCG CGCCGATGGT TCGTCGAACT TCGGTCCTAA CAACCGCTGG
GGGTATTTCC CCGGTGTATC GGCGGGCTGG ACGATCTCGA ACGAGAAGTT CATGAAAGGC
AACATCTCGA AAGTGTTGAG CTATGCTAAA GCACGTCTGG GCTACGGTAT CGTAGGTAAC
CAGAACTTCC CCGGTGGCGC ACCGAACCCA GCCTATGTGG GTGCGGTTCA GTTCTTCTCC
GGTCCGGTGG GCTTCGGCTC ATCGAACATG ATCAACGGGA TACCAAACCC GAACCTGAAA
TGGGAATCGG TGAAAACGGC CAACGCCGGT GTTGACCTCG GCTTCTTCAA CGGCCGCATC
GACGCTACCA TCGATGTCTA CAAGAAAGTA ACCTCCGACA TGATCATCTT CCTGACCGGC
CCGAACCTGA TCGGGGTAGG CGACCAGTGG GATGATCTGA AAGCACCGCT GGGCAACGCG
GGTCAGATGA CCAACACGGG TATCGACATT GGCCTGACCA CGACCAACAT CAAAAAGGGT
AACTTCAGCT GGAAGAGCAA CGTTGTGTTC ACGCAGTTTA CGAACCGTTA CGACCGGGCC
GCCAGTGCTG CTTCGGCTCT CGATGGTAAG GTGTACTACA ACAACTACCT GATTACGCAC
ACCACGCCAG GCACGCCTGT TGGCTCGTTC TGGGGATTGG TAACGGACGG ATTGTTCCGT
ACGCAGGCCG ATCTGGACGC CAGTCTGCCG CAGTTCGGCT ACAAAGTGAA CCAGACAGAA
ACCTGGTTGG GCGACATCCG GTACAAAGAC ATCAACGGCG ACAAAAAGAT CGACGCGCAG
GACCTGACAT TCATTGGTAG CCCGCTGCCG AAGTTCACCT GGGGCTTCAC CAACACGCTG
AACTACGGGG ATTTTGATTT CACGCTGTTC TTTCAGGGAA GCCAGGGCGC CAAAGCTTAC
AACTTCCTGC GCTGGCAGTT GGAAGGGCTG AACAACGCTT ATACTAACCA ACTGAATACG
GTAACGGACC GGTACACCGA AAAGAATCCG AATGGTGCGC TGCCACGCTT CACGAATACC
AACAAGAACA ACACGGCCAT GTCGGACCGG TATGTGGAGG ACGCATCGTA CGCCCGGATT
CAGAACATCA CGCTGGGTTA CCGGCTGCCA AGAACGCTGC TGAGTAAAGT GAAGATTACT
AACCTGCGGG TATACGGGTC GATTCAGAAC CTGAAGACGT TCACCAACTA CTCGGGATAC
GACCCAGAAA TCGGGGCCTT CAATAACAGC ATCAAACTCA TGAACGTGGA CACGGGCCAC
TACCCGAACC CACGGACGTT CACCGTAGGC GCAAATCTGC AATTCTAA
 
Protein sequence
MIKHMLSGAF VLSSWLTVHA QDRTAPRLAQ LDNPAVLATN RGAKTRIDIQ SDVKALIKGT 
VSDEKGNTLP GATVSVKGTQ LGTTTDVNGA FSINMPAGAK VLVISFIGMK TQEVEVGSRT
TLNITLQTGD QSLDEVVVIG YGTAKRSDVT SSITTVKAAD LKDIPAAGID QLLQGKAAGV
TVTSNGGQPG GGVSVKVRGV TSINSNDPLF VIDGVPFVGG NTSNSTGYAG LGGGDGQTGN
SVMAMLNPND IESIDVLKDA SAQAIYGSQA ANGVIMVTTK KGKQGEGKIN YEMYTGVSEV
ARRLNLMKLP DFARYQNEVL PIIGNPVADE FKNPDLLGPG TDWQEAMFQQ GKINNHQLSF
SGGRDKTTYY LSLNYFDNKG ILLGSDFKRY SSRFSLDTQL KSWVKVGLSA NVSRSIQNVS
LADAAEGTIW WGASTSPLIP VKNLDGTWGG GQTVGGVQYN NANLVGNSQF RGNTKTSNNV
FGSLYAEFQL LKGLSLRNEL SYSLGQDNNI AFQKAGNVGG TSFRSKLIDS RSDSYYYSIT
NYLNYNLYFK KHGIQATLGH QAQHSYYQSI SGTKVDLQAN IFDLNTGSSD QTTWGLSGGK
GQWAMESYFA RANYTYDDRY SISASFRADG SSNFGPNNRW GYFPGVSAGW TISNEKFMKG
NISKVLSYAK ARLGYGIVGN QNFPGGAPNP AYVGAVQFFS GPVGFGSSNM INGIPNPNLK
WESVKTANAG VDLGFFNGRI DATIDVYKKV TSDMIIFLTG PNLIGVGDQW DDLKAPLGNA
GQMTNTGIDI GLTTTNIKKG NFSWKSNVVF TQFTNRYDRA ASAASALDGK VYYNNYLITH
TTPGTPVGSF WGLVTDGLFR TQADLDASLP QFGYKVNQTE TWLGDIRYKD INGDKKIDAQ
DLTFIGSPLP KFTWGFTNTL NYGDFDFTLF FQGSQGAKAY NFLRWQLEGL NNAYTNQLNT
VTDRYTEKNP NGALPRFTNT NKNNTAMSDR YVEDASYARI QNITLGYRLP RTLLSKVKIT
NLRVYGSIQN LKTFTNYSGY DPEIGAFNNS IKLMNVDTGH YPNPRTFTVG ANLQF