Gene Slin_3400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3400 
Symbol 
ID8727153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4112434 
End bp4115607 
Gene Length3174 bp 
Protein Length1057 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388207 
Protein GI284038277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.101583 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTTT TATTTTGCCT GTTGTCCGGC TTTTCAGCCG TAGCCCAGAC GGTTACTGGA 
CGCGTAACAT CGGGCGATGA TAATCAGCCG CTGCCGGGTG TATCCATCGT TGTGAAAGGC
ACTAATGCCG GTACTACTTC CCGCGCCGAT GGCACCTACT CTATCAATGT ACAGCCATCC
AGCACCCTGA CATTCTCGTT CATTGGCTAC GCCACGCAGG AAATTGCCGT AGCCAATGGC
AACGGCCAAC CGCGCACCAA ACTGGACGTG ACGATGCTGG CCGGCGACCG GACGCTGAAC
GAAGTGGTGG TAACGGCTCT CGGTATTAAG AAAGACGTTC GCAACATCGG CGTCTCGATT
CAGTCAGTAG ATGGTTCACA ACTGCTCAAA GCCCGGGAAC CAAACGCGGT TAATGGTCTG
GTTGGTAAAG TGGCCGGTCT GACCATCGGT GCCTCCGCCG AACTGCTGCG TCGGCCAAAC
ATCGTTCTGC GTGGTAATAC CGACGTGCTG TTCGTAGTAG ATGGCGTGCC CATCAACTCC
GATACCTGGA ATATCAACCC CGACGATATT GATACATACT CTGTTCTGAA AGGGGCATCG
GCGTCGGCAC TCTATGGGTT CCGGGGTAAA AACGGAGCCA TCCTGATCAC GACCAAGCGC
GGCACGAAAG ATAAGCGCGG CTTTGAGGTG GCTGTAAACA CGAGCCAGAT GGTCGACAAT
GGCTTCCTGG CCATTCCTAA AGTACAGGAC GAATATGGCC CAGGCGACCA CGGCGTTTAT
GAGTTCGTGG ACGGTAAAGG GGGTGGTAAA AACGATGGTG ACTACGACAT CTGGGGCCCC
CGTTTCGAAG GTCAGTTGAT TCCCCAATAC GACAGCCCTA TCGACCCTGT AACAGGCAAA
CGGATCGGTA CGCCCTGGGT AGCTCGCGGT AAAGATAACC TCAAGCGGTT TCTGCAGGCA
GGTATACTCT CGACCAACAA CATCTCGGTG TCGTCGTCGG GTGAGAAATA CGACCTGCGT
TTCTCGGTAT CGCATAACTA CCAGCGCGGT CTGGTGCCCA ATACCAAGCT GAATAGTACG
ACCTTCAAAG TATCGACAGG TTATAATTTC TCGAATCGCC TTCGGTTTGA AGGCGATGTG
CAGGTAAACC GTCAGTTTAC GCCCAATATA CCTGACGTAA ACTACGGCCC TAACTCCATG
ATCTATAACA TCGTGATCTG GGGTGGTGCC GACTGGGATG TTGATCAGTT GAAAAACTAC
TGGCAGCCAG GTAAAGAAGG TACGCAGCAA ATCTATGCTG AATATCAACG GTATAATAAC
CCCTGGTTTA CCGCTAAAGA GTGGCTGCGC GGGCATTATA AAACGGATAT CGTTGGTCAG
ACCTCCCTCA AGTACAACAT TACCGATGGC CTTGACCTAA CCCTGCGGAC ACAGGTATCG
ACCTGGAACC TGCTGCGTAA CGAGAAGTTT CCGTACTCAG CCACCAGCTA TGGCCGCGAA
GAAACGAAAG GCGATTACCG CGAAGACCGC CGGAATCTGC TGGATAACAA CACCGACTTA
CTGCTCAAGT ACGCAAAGCG TGTTAGTCCG TTGCTGAACG TAAACGCCAT TGCCGGGGGT
AACCTGCGGG TGTACAACTA CAACTCGAAC TACACATCGA CCAACTACCT GAACGTGCCT
GGCGTTTACA ACTTCGCCAA CTCGCTCAAC CCGGTCATTG CGTCGAACTT CCAGTCTGAT
ATGCGGGTAC TTTCGGCCTA TTACTCGGCT GACTTTACGC TGAAGGATTT CGTAACCCTG
TCCACAACGG GCCGGATGGA TAAACTGTCT ACCCTGCCAA AAGGAAACAA CACGTTCTTC
TATCCGTCAG TAGCGTTGAG CACAGTATTG TCGGATTACC TCCGGTTACC ACAGGCTATT
TCGTTCCTGA AATTCCGGGC GTCGTATGCC AATGTAAAAG ACGGTTTGAC TCAATCGACC
ATCGGTGTAC CAAACTGGTC GCTGGGCTAT GGTGAGCAGT ATCAGTCTTC TTACGATGGG
CCAACGTATC AGAACTCAGC CGTGTATAGC ACACCATACA CGGTTGGTAA TACGCCAACG
GCTTACTTTA CAAACGCCCT GAACAACCCA AACATCAAAC CGAACAGTAC CTCGCAGACC
GAGGTGGGTT TAGATGTTCG TTTCCTGAAC AACCGGATTG CGTTTGATGC TGCTTACTTC
ATCAGTGACG ATGGTCCGCG TATTTTCAAC CTGCCTATTT CTGAAACAAC CGGCTATTCA
TCAGCTCTGG TAAACGGTAT CAAAACCCAG AAGAAAGGGA TTGAATTGTC GCTGACGGGT
AAAGTTCTGT CAAATCCAGG TGGCCTGAAC TGGGATGTGC TGGCTAACTG GTCGACCTAC
AAAGAAATCT ATAAAGACTT CTATCCGGGT GTAACGGCGC TGAACACCTT CTTCAAAGTG
GGCGACCGGA CGGATAAATA CTATACCTCG ACCTACGTGC GTACGCCCGA CGGGCAGATT
ATCAATGACG CCGGTGGTCG CCCAATTCGT ACCACAGTGG CACAGTATGT AGGAAACCTG
CTCCCTGATT TTGTTTTTGG CTTGAACAAC CGGTTTAGCT ACAAGAATCT AACCTTTAGC
TTCCAGTTCG ATGGTCGTGT AGGCGGTGTT ATCTCCGACT ATGTGCAGCA GAAGACCTGG
GCCGGTGGCC GGATCATTAA TACCGTGCAG GGCGATATGG CCGCAGCGCG TCTGAACGAT
ACGAAAGGAA TCAAGTCCTA TCTGGGCGAA GGCGTACAGG TAAGCAACGG AGCTGCCATC
AACTATAATT CGGATGGGTA CGTGACCAAC TACGCTGAAC TTCAGTTCAA ACCAAACGAA
ACCAAAGCGT ATTTGCAGGA TTACATTGCC CGGCGTTACG GCTTCGACGG TGGCAACATC
ATCAGCCGCT CATACGTAAA ACTTCGTGAA GTAGTGGTTG GGTATTCGCT GCCGCAGGTG
TTCACCAGCC GACTGGGTAT CAAGCAGGCA TCAGTTTCTT TAGTTGCCCG TAACTTGCTC
TACTTTGCTG AGAAGAAGGA CATCGACATC GACCAGTTTA CGAGCGGTGG TCGTTCGGAT
CTGCAAACGC CAACGACGCG TCGCTACGGT ATCAACCTGA ATCTGACATT CTAA
 
Protein sequence
MAFLFCLLSG FSAVAQTVTG RVTSGDDNQP LPGVSIVVKG TNAGTTSRAD GTYSINVQPS 
STLTFSFIGY ATQEIAVANG NGQPRTKLDV TMLAGDRTLN EVVVTALGIK KDVRNIGVSI
QSVDGSQLLK AREPNAVNGL VGKVAGLTIG ASAELLRRPN IVLRGNTDVL FVVDGVPINS
DTWNINPDDI DTYSVLKGAS ASALYGFRGK NGAILITTKR GTKDKRGFEV AVNTSQMVDN
GFLAIPKVQD EYGPGDHGVY EFVDGKGGGK NDGDYDIWGP RFEGQLIPQY DSPIDPVTGK
RIGTPWVARG KDNLKRFLQA GILSTNNISV SSSGEKYDLR FSVSHNYQRG LVPNTKLNST
TFKVSTGYNF SNRLRFEGDV QVNRQFTPNI PDVNYGPNSM IYNIVIWGGA DWDVDQLKNY
WQPGKEGTQQ IYAEYQRYNN PWFTAKEWLR GHYKTDIVGQ TSLKYNITDG LDLTLRTQVS
TWNLLRNEKF PYSATSYGRE ETKGDYREDR RNLLDNNTDL LLKYAKRVSP LLNVNAIAGG
NLRVYNYNSN YTSTNYLNVP GVYNFANSLN PVIASNFQSD MRVLSAYYSA DFTLKDFVTL
STTGRMDKLS TLPKGNNTFF YPSVALSTVL SDYLRLPQAI SFLKFRASYA NVKDGLTQST
IGVPNWSLGY GEQYQSSYDG PTYQNSAVYS TPYTVGNTPT AYFTNALNNP NIKPNSTSQT
EVGLDVRFLN NRIAFDAAYF ISDDGPRIFN LPISETTGYS SALVNGIKTQ KKGIELSLTG
KVLSNPGGLN WDVLANWSTY KEIYKDFYPG VTALNTFFKV GDRTDKYYTS TYVRTPDGQI
INDAGGRPIR TTVAQYVGNL LPDFVFGLNN RFSYKNLTFS FQFDGRVGGV ISDYVQQKTW
AGGRIINTVQ GDMAAARLND TKGIKSYLGE GVQVSNGAAI NYNSDGYVTN YAELQFKPNE
TKAYLQDYIA RRYGFDGGNI ISRSYVKLRE VVVGYSLPQV FTSRLGIKQA SVSLVARNLL
YFAEKKDIDI DQFTSGGRSD LQTPTTRRYG INLNLTF