Gene Slin_4222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4222 
Symbol 
ID8727981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5084658 
End bp5087690 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content53% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389006 
Protein GI284039076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAT TTTGTGGTAC TATTTTGGGA CTGGCGCTGC TGGTTCTGCT GCCGGGTTAT 
CTGTTTGCTC AGCAGCTGCG TATAACGGGT AGAGTAACAT CACAGCAGGA TGGGCTGCCG
ATACCTGGTG TCAACATCTC CGTTCGGGGA ACAACAAACG GGGTCAGTAC AGATGCCAAC
GGTAATTACA GCATCACCGT TTCGGGTAGT TCGGCCGTGC TGCTGCTTAC TTCCATTGGT
CTGGTGCAGC AGGAGATAAC GGTGGGTAAC CGCACGGTGA TCAACGTCCA GATGAAGGAA
GCCGTTAATG AGCTGAGTCA GGTTGTCGTT ACGGGTTACA ACACTACACA GCGAAAAGAT
ATTACCGGCT CTATCGCATC CGTTTCTCCC GATAAATTCA AAGATATTCC CGTTGCCAGC
TTCGACCAGG CTTTGCAGGG CCAGGCGGCT GGTGTGCAGG TAACACAGTC GTCGGGTACA
CCCGGTGGCG GACTTACCGT GCGGGTGCGG GGCAATACGT CCATATCGGC CAGTAACCGC
CCGCTGTTTA TTGTCGATGG GGTGCCCGTA TCAGACGGGG GCTTATCGGG TCGGGAGTTT
GGCGGGCAAA CAGACAATGC GCTGTCGCTC TTTAACCCCA ACGACATCGA ATCCATCCAG
GTACTGAAAG ATGCCTCCGC CAAAGCGATC TATGGCTCAC GGGCTGCGAA CGGCGTGGTG
CTGATAACGA CCAAGCGCGG GAAGGCCCAG AAAACGAGCT TCACCGCCGA TGTGCAGCGG
GGGTTAACGG ACGTGGTAAA GCGGCCGGAT CTACTTAATT CGGTCGAGCT GCTCGAATTA
CAGCGCGAAG CGGTTACCAA TGCCGGGCTG GACCCCGACA AACTGGGTCT GATAAAAGGG
GTTACCGACG GGCAGAATAC GGACTGGATA GATGCCGTAC TGCGAAGAGG GGTTTACCAG
CAATATCAGC TGTCGACGCA GGGCGGTAAC GACCGCACAC AGTTTTACCT CAGTGGCAGC
TACCGCGATG AACAGGGTGT ACAGCTGAAT AACCAGTTTA CCCGGTATAC AGGCCAGTTG
AAACTGGATC ATAAAGCAAC CGACAAATTA TCGTTCGGGA CAAACGTGAC CCTGTCAAGG
GCGCTGAACA AACGGGTAAA AGGCGATAAC TTTCTGGATG GTGTGTACTC TGGTGCCATG
AAAAGTCTGC CGTACTATTC GCCTTACAAT GAGCAGGGGC GACTTTACGG CCCCGCCGAC
GCCGAATACC CTGGATTCCC AAATTTCAAC CCCGTTGCAC AGGCTGTGCT GCCGCGATTC
AACGCCTACA CGGTGAAAAT ATTGGCGGGT CTGTATGCCG AATACGAAAT CCTCCAGAAC
CTCCGCTTTC GGTCGAAAGT AAATATCGAC TACAACAACG TAACCGAAGA TCAATTTGAA
CCGTCCACAA CGGCAATTGG AGGGTTTCTG TCCAGCGTAG GCGGGCAGGG CTACGGGGTG
TTCATCAATC AGTCGTCATC GACCTTTGTC AATACAAATA CCCTTACCTA TAATTTTCAG
CTGGCTGAAA AGCACCAGTT CAACGCGCTG GCAGGGGTAG AGATTCTACA GGCTACCGCC
CGGGACGGTA ATGTTCAGGG TCGATTATTT CCCAGCGATG ACTTTACCTA CATAAATTCA
GCGGGTATTG TCGATCAGGG GGGCTCTTCC GTAACGAACA ACGGCCTGCT GTCGACCTTC
GGCGAAGTCC GCTACAGTTA CGATGAAAAA TACCTGGCCA CGATTACCGC CCGTTACGAT
GGATCGTCGC GTTTTGGGCA GAGCCGCCGG TTCGGGGTGT TTCCGTCAGC CTCCTTTGCC
TGGCGTATTT CGAGCGAGAA ATTCATGGAA CGCTTCCGGT TCCTGAGCGA CCTGAAGTTA
CGGACGAGCT ACGGCTTTAC GGGCAACGAG CGCATTGGCG ATTTTCAGTT TCTGGGCACT
TGGGCATCAG TTACCTACAG TGGCGCAACG GGCGTGGGTC CGGCCACGCT GGCTAATGCA
AACCTGCAAT GGGAGCGCAC CCGCGAAGCG AACATAGGCC TAGATGCTTC GTTCTTTAAC
GGGCGGCTTA ATTTTATCGT TGATGCCTAT GATAACCTGA CGGATAAACT CCTGTTTGCC
CAGCCGATTC CGCAAACCAC TGGCTTCAGC ACCGTGCAGG GCAACATCGG GAAAGTATCC
AACAAAGGCC TTGAACTAAC CATTTCGACG GTGAACGTCA ATAAGGCTGT TCGCTGGAGT
ACCGATTTAA ACCTGTCCCA CAATGTAAAC AAAGTGGTGG AACTGGCCAG TACAGAGCCT
GTTCTGCGGG GCTATCAGGG CAATGGGGTA GCCACCACCA ACGTGGTAAT ACCAGGTCAG
CCACTGGGTA CATTCTGGGG GTTGAAATTC CTGGGAGTTG ACCCCGCTAC CGGCGACGCG
ATCTATGATG ATAAAAACGG CGATGGGCGT ATTACTCCCG CCGACGGACA GGTTATTGGC
AATGCCCAGC CCAAGGTGTA TGGCGGGTTG ACCAACAAGA TTTCCTGGAA AGGGATTGAC
CTGAGTGCGC TGCTTCAGTT TTCGTACGGG AACAGCATTC TCAACTTCTC GAACCAAACG
CTCCTAAACT CGGGTGCCGA CATTCAGAAT AACCAGACGC GGCAGGCACT CAAACGCTGG
CGTAAAGAAG GCGATATCAC GAGCGTACCC CGTTACGAAT ACCAGAATAC CTATAATAAC
TACACCAGCA GCCGGTTTGT GGAAGACGGG TCTTATCTGC GGCTGAAAAA CGTTTCGCTG
GGCTACAACA TTCCCAAGAC CTGGATCAAT AAATACAAAG TGGCCAACGC CCGTCTGTAC
GTCTCGGCTA CGAACATCCT AACCTGGAGC CGGTATTCTG GCGCAGATCC GGAAGTAAGC
ACGCTCGATG GCTCTACCAC GGCGCAGGGC ATTGACTTTT TCACCTTCCC TCAGATCAAA
ACGGTATTGG TAGGGGCAAC CCTTAGCTTT TAA
 
Protein sequence
MRKFCGTILG LALLVLLPGY LFAQQLRITG RVTSQQDGLP IPGVNISVRG TTNGVSTDAN 
GNYSITVSGS SAVLLLTSIG LVQQEITVGN RTVINVQMKE AVNELSQVVV TGYNTTQRKD
ITGSIASVSP DKFKDIPVAS FDQALQGQAA GVQVTQSSGT PGGGLTVRVR GNTSISASNR
PLFIVDGVPV SDGGLSGREF GGQTDNALSL FNPNDIESIQ VLKDASAKAI YGSRAANGVV
LITTKRGKAQ KTSFTADVQR GLTDVVKRPD LLNSVELLEL QREAVTNAGL DPDKLGLIKG
VTDGQNTDWI DAVLRRGVYQ QYQLSTQGGN DRTQFYLSGS YRDEQGVQLN NQFTRYTGQL
KLDHKATDKL SFGTNVTLSR ALNKRVKGDN FLDGVYSGAM KSLPYYSPYN EQGRLYGPAD
AEYPGFPNFN PVAQAVLPRF NAYTVKILAG LYAEYEILQN LRFRSKVNID YNNVTEDQFE
PSTTAIGGFL SSVGGQGYGV FINQSSSTFV NTNTLTYNFQ LAEKHQFNAL AGVEILQATA
RDGNVQGRLF PSDDFTYINS AGIVDQGGSS VTNNGLLSTF GEVRYSYDEK YLATITARYD
GSSRFGQSRR FGVFPSASFA WRISSEKFME RFRFLSDLKL RTSYGFTGNE RIGDFQFLGT
WASVTYSGAT GVGPATLANA NLQWERTREA NIGLDASFFN GRLNFIVDAY DNLTDKLLFA
QPIPQTTGFS TVQGNIGKVS NKGLELTIST VNVNKAVRWS TDLNLSHNVN KVVELASTEP
VLRGYQGNGV ATTNVVIPGQ PLGTFWGLKF LGVDPATGDA IYDDKNGDGR ITPADGQVIG
NAQPKVYGGL TNKISWKGID LSALLQFSYG NSILNFSNQT LLNSGADIQN NQTRQALKRW
RKEGDITSVP RYEYQNTYNN YTSSRFVEDG SYLRLKNVSL GYNIPKTWIN KYKVANARLY
VSATNILTWS RYSGADPEVS TLDGSTTAQG IDFFTFPQIK TVLVGATLSF