Gene Slin_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0441 
Symbol 
ID8724169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp547991 
End bp551095 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content50% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003385304 
Protein GI284035374 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000408448 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCAA CAATTACCCA ACCACCACGC CTGGCGTGGC TTCCTCTGCT TGGCTTAGCG 
GCACTTACGC TGGTAAGCCA ACCGGCGTTT AGTGCGCCAC CAACAGTAAA ACTTAAGTTA
GCCAGCCCAA CTCAGGAACG CTCCGTTGCT GGTAAAGTAT TATCAGGCGA TGATAACACT
GGATTACCGG GTGTAAGCGT TGCCGTGAAG GGCACCACGC GCGGTACAAC TACTGACGCT
AACGGCGAGT ACAAAATCAG CATACCTAAC GAACGGGCTG TTCTGGTTTT CTCCGCTGTT
GGCTTTATTA GCCAGGAAGT TACTATCGGC AATAAGTCAA CGGTTAATCT AACCCTAAGC
ACTGATACAC GCGCCCTGAA TGAAGTCGTT GTTATTGGCT ACGGTTCTCA GAAAAAGAGC
CAGACAACGG GAGCTATTTC GTCAGTTACG CCAAAGCAAA TTACAGAACA GCCTATTACC
AACATTGGTC AGGCCATGCA AGGCCGGGTA GCAGGTGTCG ACGTAGCACA GTCGGGTAGC
CGACCAGGTT CCGTACCAAC AATCCGGGTT CGTGGGCGTC GTTCGTTCAA TGCCGGTAAC
GACCCGCTCT ATGTAGTTGA CGGGATTCCC CTTTCAGAAG GTTATGAAGA CATTAACCCG
AACGATGTGG GTTCGATGGA AATCCTGAAA GATGCTACCG CAACGGCCAT TTATGGTGCC
AGAGGTGCCA ATGGCGTTAT TCTGGTTACA ACCAAGCGGG GTAATCCGGT TGGTAAAACA
ACCATCAGCT ACGATAACTA CGTCGGTTTT ACCGATGCGC TGGATAAAGT AAAGCTGTTC
AGTGGCTCTG AATTTGCCGA ATTTGTTCGG GAAGCTTACC GGACTACAGG CAACTACAAA
GACGCGAACG GCAATCCCGT TCCAACGGGT GTGGCCGATC CATATGCCGA CTCCAAAGTG
GCGGTACTGG GTGGTGACCC GAACGTTGCA GCTGGCCTTG CCGCCAACCG GAATACTGAC
TGGCAGTCGT TGATTCTGAA GCAGGGAGTT CAGCAGAATC ACTCGTTGGG TATTCAGGGC
GGCAACGAGA AAACGCAGTT TTATATATCG GCTGGTTTTT TCCAGGACAA AGGGATTATG
CCTGGTCTGG ACTTTACCCG TCAGTCGCTG CGTGCCAATA TTGATCACCA GATCAACAAG
GCTCTTAAAG TGGGGATTGC CTCGTATATG ATGTATAGCG TACGGAACGG AGAGACGCTG
AACCCCTATA ACTTTACCCT TCAGCAAAAT CCGCTTGGTC GGCCTTACGA CGATAACGGT
AACCTGATCT TCTCGCCTAC GAACGATGCG CTGCTTACCA ATCCACTCGC CGAAGTTGTG
CCGGGTGCTC AGGTAGAGAA TAGAAAGAAA TACCGCATTT TCAACAGCGT TTACGCAGAA
GTAAACATCC TTGAGGGCTT AAAATACCGC GTTAACTTCG GGCCAGACTT TACCATCAAC
CGATTTGGCC GCTTTATCGG TGCGCAAACA AACGCCCGGA AAGGTGGTGA CCCACAGGCG
CAGACGGCCA GTGCATTTGG CTTCAACTAC ACGCTGGAGA ACGTGGTGAC GTATAACAAA
AAAGTGGGCG ATCACAACTT CGGTTTTACC GCCCTGCAAT CCATTCAGCG GGATAACTTC
GAGCAGAATA ACATCTCTGT TCAAGGTGTG CCAGCCGAAT CGCAGCAGTT CTACAATGTA
GGCAACGCCA GTGCTGTATT GGGAGTAGGT AGTGGATTGC GGCAGTGGAC CATTAACTCG
TACATGGGTC GTATCAACTA CGATTATAAA GATAAGTACC TGGTAACCGC TACGTTGCGC
CGGGACGGAT CGAGCCGATT TGGCGAAAAT ACCAAATATG GTAATTTCCC CGGTATCGCC
CTAGGCTGGA ATGTCAGCAA CGAAGACTTC ATGAAGGGAT CTAGCTGGGT CGATCTGCTA
AAAATCCGGG CCAGCCGTGG TTCGGTAGGT AACCAGGGTG TAGCTCCCTA TCAAACGCAG
GGATTATTGG ACCGCACGGT ATATGCCTTT GGCAATACAC CCGCTTATGG CTATCGCCCT
AACACGATTG GCAACCCTGA TTTGCGCTGG GAAACGTCAA CCAGCACAAA CATTGGTATT
GACTTCAGTC TCTGGCGGGG CCGGGTATCA GGTGCTATTG AATTATATAA TACCCGCACG
ACCGACCTGC TACTATCCGA TCTGCTGCCT ACATCAATCG GTTTCAACTC TGTGACCCGC
AACATTGGCG AGACCCAGAA TAAAGGGATA GAAGTGAGTG TATCAACGGT GAACGTGAAT
TCAAAAAGTG GATTCAAATG GACATCCGAC ATTGTGTTCT CTAAAAATTC GGAAGCCATC
ATCTCCCTTT TCAACGGACC GGTTGATGAC GTGGGTAACA AACGCTTCAT TGGCAAGCCT
TTGACGGCCA TGTATGATTA CAAAAAAGCG GGTATCTGGC AAACCAGTGA AGCAGATGCC
GCTAAATCCT ACCAGAGTGC AGTTGGCCAG ATTAAAGTGC AGGACACCAA CGGCGATGGT
AAAATCACGG CTGATGACCG GGTATACTTA GGCTCTGACA TTCCAACCTG GAGTGGCGGT
ATCACGAACC GGTTCAGCTA TAAAGGATTT GACCTGAACT TCTTTATTTA TGCCCGTATT
GGCCAGACCA TTCTAAGCGG TTTCCACCGC GACAACAACC AGTTGGCTGG TCGTTATGAG
CAAATCAAAG TTGACTACTG GACACCTAAC AACCCAACGA ACGAGTTCCC ACGGCCTAAC
TCCAGCCAGG AGTTCCCGGT CTATAACTCA GCTATCATCT ATTTCGATGG ATCGTTTGTG
AAAGTACGGA ACATCAACTT TGGTTATACG TTCCCATCGA GCATTACGTC GAAACTGCGC
ATGCAGTCGC TACGTCTGTT CAGTAGCATT CAGCAGCCGT TCATCTTCTC GTCGTACCGG
TCGAAGTACA ACGGTGTTGA CCCAGAGACA AGCGATGGCA CGGTAAGCAA CGGTGTTACG
CCTGCTACCC GCGTAGTAAC CTTTGGTTTG AACGTCAAAT TCTAA
 
Protein sequence
MHSTITQPPR LAWLPLLGLA ALTLVSQPAF SAPPTVKLKL ASPTQERSVA GKVLSGDDNT 
GLPGVSVAVK GTTRGTTTDA NGEYKISIPN ERAVLVFSAV GFISQEVTIG NKSTVNLTLS
TDTRALNEVV VIGYGSQKKS QTTGAISSVT PKQITEQPIT NIGQAMQGRV AGVDVAQSGS
RPGSVPTIRV RGRRSFNAGN DPLYVVDGIP LSEGYEDINP NDVGSMEILK DATATAIYGA
RGANGVILVT TKRGNPVGKT TISYDNYVGF TDALDKVKLF SGSEFAEFVR EAYRTTGNYK
DANGNPVPTG VADPYADSKV AVLGGDPNVA AGLAANRNTD WQSLILKQGV QQNHSLGIQG
GNEKTQFYIS AGFFQDKGIM PGLDFTRQSL RANIDHQINK ALKVGIASYM MYSVRNGETL
NPYNFTLQQN PLGRPYDDNG NLIFSPTNDA LLTNPLAEVV PGAQVENRKK YRIFNSVYAE
VNILEGLKYR VNFGPDFTIN RFGRFIGAQT NARKGGDPQA QTASAFGFNY TLENVVTYNK
KVGDHNFGFT ALQSIQRDNF EQNNISVQGV PAESQQFYNV GNASAVLGVG SGLRQWTINS
YMGRINYDYK DKYLVTATLR RDGSSRFGEN TKYGNFPGIA LGWNVSNEDF MKGSSWVDLL
KIRASRGSVG NQGVAPYQTQ GLLDRTVYAF GNTPAYGYRP NTIGNPDLRW ETSTSTNIGI
DFSLWRGRVS GAIELYNTRT TDLLLSDLLP TSIGFNSVTR NIGETQNKGI EVSVSTVNVN
SKSGFKWTSD IVFSKNSEAI ISLFNGPVDD VGNKRFIGKP LTAMYDYKKA GIWQTSEADA
AKSYQSAVGQ IKVQDTNGDG KITADDRVYL GSDIPTWSGG ITNRFSYKGF DLNFFIYARI
GQTILSGFHR DNNQLAGRYE QIKVDYWTPN NPTNEFPRPN SSQEFPVYNS AIIYFDGSFV
KVRNINFGYT FPSSITSKLR MQSLRLFSSI QQPFIFSSYR SKYNGVDPET SDGTVSNGVT
PATRVVTFGL NVKF