Gene Slin_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3040 
Symbol 
ID8726792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3690422 
End bp3693745 
Gene Length3324 bp 
Protein Length1107 aa 
Translation table11 
GC content50% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003387850 
Protein GI284037920 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0955373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAG GTTTACAAAG GCTTATTATC TTATTGTGGG TAATAAGTAC CCCCGTATTC 
GCTCAGACGA TATCGGGTAG GGTCACGGCT GGTACCGATG GACAGCCATT ACCGGGTGTT
TCCATTCTGG TAAAAGGAAC AACGTCTGGT ACCATAACGG ATACGGATGG GAAGTACAGC
CTTGCTGCGG CAAAGAATAA AGTACTTGTA TTTTCCTTTA TCGGTTACAA GAGCAAGGAA
GTTGTTATCG ACAACAAAAC GACGGTCGAC GTTACGCTGG ACGAAGATGC ATCCGTGATC
AATGAAGTAG TTGTCACCGC CTTAGGCATT CCCAAAGCAG AGCGTGCACT GGGTTATGCT
ACGGCGGTTG TCAAAAATGA TGCGCTGATC AAAACTGCGA CACCAAACTT TGCCACGGCG
CTTTATGGTA AAGCTCCTGG TGTAACAATC AATGCAACAC CGGGTGGAGC CACTAGTGGC
GTAAGCATCA GCATTCGCGG GTTAAGTTCG ATAACCGGCA ACACACAGCC GCTCATCGTG
ATGGATGGTA TCCCTATCCG GAATGGCGAA GCCCGCAATA CTGACTACTG GGGCGACCAG
CGGATTCGGG GTAACGGCCT TCTGGACCTC AACCCTGCCG ACATCGAAAA CATCTCGATT
CTGAAAGGGG CATCTGCGGC AGCTTTGTAC GGTTCGGAAG CTGTAAACGG GGTTGTACTG
GTGACTACTA AAACCGGAAA AGGCCGCAAA GGGCTGGGCG TTGATTTCAG CGCGAGCTAC
AGTGCCGATA AAATTGCTTA CCTGCCACGT TACCAGAATG TTAGAGGCCC TGGCTATTTC
CAGAACTACG CCAATGGGGG GCAGGATGCC AATGGTTTCA TTTCATACGA TACCGATGGA
GATGGCAAAG GGGATACCCG CGGTCTGTTG GGTGCTACGG TTAACTTCGG CCCCAAGTTC
GATGGACAAC CGGTTATGGC CTTTGATGGC GTCATTCGGC CTTATGTGGC ATCCAATAAC
AGTTATGCCA ACTTGTTCCA GAACGCAAAT AGCGCGAACA TCAACCTGGC TGTTTCCAAA
GCAACGGATA ATTCGACGAT TCGCTTTTCG TACACGCGGC AGGATAATGG CATGATTAGC
TATGCGGCAA AAAACGAAAA GAATATTATG AACCTGAATG CCAGCTTTAG TCTCAACAAA
AAGCTGACAA CGGATCTGAT GGTCAACTAC GTAAACCAAT ACACGCACAA CCGGCCCTTT
AAAGTGGATC GTATGATCAA CAACTTCTCG GGGATGATGA ACCGGTTCGA ATCAGCCGAT
TGGTACTTCA ATAAATACCA GACCAGCCAG GGCTATAAGT ATGTAACCGG TACCAACCAA
AGCCTGACTC CCAAAGAGAA CATCATTCGT AACGGATTCA AAGGCGATAT TGGTGATTAC
GTTTGGAGCA CCCGTGCCAA CACCTATGAT GAATACAGCA ACCGGGTTAT TGCCAGCATC
ACGCAGCATT GGCAAATTCT GGACAACCTG AAGCTCCGAG GCCGGATTGG TACTGACCTT
ACATCTGAGC GGCTCGAAGA CAAGCAGCGG AGTTCTATTC CTCTGGCCTT TGGTTACTCG
GGTTACTTTG CCATGAACAA CAACCTGTAC AGTAATGTCT ATGGTGATGT GTTGCTTACC
TATACCAAAA AGCTGAATCC GGACGTAACC GTGATGGCAT CGGGAGGCTA TACCGCCAAC
AAGATGCTGA ATACGTACGT GGGCCGGTCA ACTAACGGAG GACTGAGCAC CGAGAATTTC
TTCGATATCT CCGCGTCGGT GAATACACCG AACGGCAGCA ACAGCCGCGA CAAGTCTATC
CGGGATGCAT TCCTGGGTAC GGTGAACTTT GATTACAAAA ATTTCTTCTT TATCGAAGGT
ACCTTACGTC GTGACCGCAC GTCTACACTC GCACCGGGTA ACAACGCCTT CGTGTATCCG
TCTTTGAACT CCAGCCTTGT ATTCAGTGAT CTGTTCCGGT TACCGGCGGT TATCGACTAC
GCCAAGCTGA GAGGTTCGTG GGGTATTGTG GGTAACTACC CAACCATCTA TAGCGCCAAT
AATGCCTATA ACCAGGGTAA CCTGAGCATC CAGCAAACCG GTGGCAGCTC GGTATTGTAT
ACCAACATCA GCAGCGACTA TGGCAACGAC AAGATCCGTC CCGAGCAGAA ACATGAGTTT
GAGTTCGGCC TGGAAGCCAA GCTGTTCAAG AACCGGCTGG GTGTAGACCT GTCGTATTAC
AACGCCCAGA TTGTTGACCA GATTCTGCCG TTAACGATTG CCGCTACATC CGGTGCCAAG
TCGATCCTGG CCAACATCGG TACATTGAGA AACCAGGGTG TCGAACTGGC CCTTAACTTT
TCCGCCCTAA AAAGCGCGGA CCCTAACGGT CTGAACTGGG ACGTTACGTT GAATCTGGCT
AAAAACAGCA ACAAAGTAGA GAAGTTGACC AACAACTCAA CCGAGCTGCT GCACGCCGAT
TATGATGGCA ATGCCGCTCA GCTTCGTTCG GTGGTTGGCC AGCCAATGGG CGATATTTAT
GTGCATGGTA TTCTCAAAAA TGCCGATGGA CGCAATGTCG TTGGGCCGAA TGGTATCTAC
CAACTCGATG GTGCCAACTG GATAAAGGCT GGTAACGCCA TGCCAAAACT GACGGGTGGC
TTGCTGAACA ACATAGGCTA CAAAGGTTTC AATCTGGATG TGGTTGTTGA CTTCCGGTAT
GGTGGCTCTA TTATGCCAAC GGGTATCAAC TGGTTGACAT CGCGCGGGCT GACTGAGGAG
AGCCTCACTG CTATGGACGC CGAGCACGGC GGGTTGCGTT ACTACAAAGA TGCCAACGGT
AAAGGCATTG CAACTACAGG TTCTGCCGGG CCAAACGGTG AAGTGGTGTA TAACGACGGT
ATGTTAATGG ATGGCGTACT GCCAACCGGC GAAGCTAATA CCAACATTAT CTCTCAGGCT
GTGTATTACA ATAACACCTA CAACTGGGGT GGACCGCAGT ACAGCAGCTC GCGTTATGAG
CTGTACGTAA AGGAAAATAC GTACATAAAA ATGAGAGAGA TCTCGCTGGG CTATCGGATT
CCGGCCAGTA TTACCCGTAA GATTGGTACC CAGAACCTGA CCCTGTCGGT ATTTGGTCGT
AACCTGTTCT TCATCTACAG AACTATTAAG GATCTGGACG CCGAACAAAC CAATTCGAGT
ACACGCTGGG CCGAAAACAT CAATAACGCT GGTAACAACC CGTCGTTCCG CACCATGGGG
GTAATGCTAC GCGCCAGCTT CTAA
 
Protein sequence
MVKGLQRLII LLWVISTPVF AQTISGRVTA GTDGQPLPGV SILVKGTTSG TITDTDGKYS 
LAAAKNKVLV FSFIGYKSKE VVIDNKTTVD VTLDEDASVI NEVVVTALGI PKAERALGYA
TAVVKNDALI KTATPNFATA LYGKAPGVTI NATPGGATSG VSISIRGLSS ITGNTQPLIV
MDGIPIRNGE ARNTDYWGDQ RIRGNGLLDL NPADIENISI LKGASAAALY GSEAVNGVVL
VTTKTGKGRK GLGVDFSASY SADKIAYLPR YQNVRGPGYF QNYANGGQDA NGFISYDTDG
DGKGDTRGLL GATVNFGPKF DGQPVMAFDG VIRPYVASNN SYANLFQNAN SANINLAVSK
ATDNSTIRFS YTRQDNGMIS YAAKNEKNIM NLNASFSLNK KLTTDLMVNY VNQYTHNRPF
KVDRMINNFS GMMNRFESAD WYFNKYQTSQ GYKYVTGTNQ SLTPKENIIR NGFKGDIGDY
VWSTRANTYD EYSNRVIASI TQHWQILDNL KLRGRIGTDL TSERLEDKQR SSIPLAFGYS
GYFAMNNNLY SNVYGDVLLT YTKKLNPDVT VMASGGYTAN KMLNTYVGRS TNGGLSTENF
FDISASVNTP NGSNSRDKSI RDAFLGTVNF DYKNFFFIEG TLRRDRTSTL APGNNAFVYP
SLNSSLVFSD LFRLPAVIDY AKLRGSWGIV GNYPTIYSAN NAYNQGNLSI QQTGGSSVLY
TNISSDYGND KIRPEQKHEF EFGLEAKLFK NRLGVDLSYY NAQIVDQILP LTIAATSGAK
SILANIGTLR NQGVELALNF SALKSADPNG LNWDVTLNLA KNSNKVEKLT NNSTELLHAD
YDGNAAQLRS VVGQPMGDIY VHGILKNADG RNVVGPNGIY QLDGANWIKA GNAMPKLTGG
LLNNIGYKGF NLDVVVDFRY GGSIMPTGIN WLTSRGLTEE SLTAMDAEHG GLRYYKDANG
KGIATTGSAG PNGEVVYNDG MLMDGVLPTG EANTNIISQA VYYNNTYNWG GPQYSSSRYE
LYVKENTYIK MREISLGYRI PASITRKIGT QNLTLSVFGR NLFFIYRTIK DLDAEQTNSS
TRWAENINNA GNNPSFRTMG VMLRASF