Gene Slin_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3031 
Symbol 
ID8726783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3669574 
End bp3672642 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003387841 
Protein GI284037911 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.258909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAC ACTTACTTTA TTTCAAAATC CTGGTTCTTA CACTGCTTAC GACGCTCAGT 
TTTGCCCAGA GCGTTGAGGT GAAGGGACGC ATAACGGGCG AAGGAGGAGC GCCTATCTAC
GGGGCCAACG TAGTGGTGAA AGGTAGCCGT CAGGGGGCTA TTTCAGACGA AAAGGGCGAT
TACCGCATTC AGGTACCCAA AGGCTCTACC TTAACCGTCA GTTTTATTGG CTATGTAGCC
AAGGATGTTG TTGTCGGCAA TAGCTCGGTT ATCAATGTGT CCCTGGCTCC TCAGTCGTCG
GTATTGGATG AGGTGGTGGT TACGGCTCTT GGTATCAAGA AGGAGAAAAA AGCACTTGGT
TATTCTGTAA GTGAAGTAAA GGGCGAGGAA CTCACGCAGG CCCGGACGGT CAACGTAGCG
AACTCACTAC AGGGTCGGGT AGCGGGCCTG AACATCGCCA ATACCGCCAC TGGCCCGGGG
GGCTCGGCAC GAATCATTAT CCGGGGTAAT GGATCGATCT CCGGAAATAA CCAGCCGCTG
ATCGTTGTTG ATGGCACGCC CATCAATAAT GACAACCAGG GCTCTGCAGG TATGTGGGGT
GGTGGCGACG GTGGCGATGG TATCTCCAGT CTCAACCCGG ACGAAATCGA AACCATCAGC
GTACTGAAAG GAGCTACGGC TTCGGCGCTG TATGGCTCAC GGGCTTCGAA CGGGGTTATT
CTGGTGACGA CGAAAGGAGG TAAAGCCAAC AAAGGCATTG GTGTTGAGGT CAACAGCAAC
TTTGTTGGTG AGAGCCTGTT GCTACCTACC TACAAAGATT ATCAGTACGA GTACGGTATG
GGAAACAATG GCATTAAGCC AACAACCATT GCCGAAGCGC TGACATCGAA TAGCTGGGGT
AGCAAATTGG ACGGAAGCAG TGTTATTCAA TTTGATGGCG TTTCCCGGCC TTACTCGGCT
GTTCGGGATA ACCAGCAGAA TTTCTACCGG GTAGGAAGCA CCTTTACCAA CTCGGTTGCG
CTTACGGGAG CTACCGAATC GATGACGTAC CGCCTGTCGA TGAATGACCT CAACAACAAG
GCGGTTGTTC CCAATAGCGG TCTGCGCCGG AACAATTTCG CCTTGAATCT CAATGCGAAT
CTGGGTAAAA ACCTGTCTGT CGTTACCAAC GTCAAGTATA TCCTGGAACG CACCAACAAC
CGTCCCCGCT TATCTGACTC ACCGGGTAGC GCAACCTATG CTCTGAATGC CATGCCTACA
TCGCTGGGTA TAGCAGCACT GGAGCAAAGC CGGTATAATG CGGATGGTTC GGAGAAAACC
TGGTCGGATA ACATCTATAT TCAAAACCCT TATATCGCTG CCTACGACTG GCGTCAGGAG
GACAAAAAAG GCCGGATCAT CGGTGTAATA GAGCCTCGGT ACAACTTCAC TGACTGGCTG
TTCTTACGGG GTCGTCTGGG CTTCGATAAC TTCAACTACC GGAACCTGAG CATTACGCCG
TATGGTACAC CCTTCCAGCC TCGGGGCGGT ATGAACGTTG CCAACCGCAA CTTCACGGAA
ACCAATACCG AATTGCTGTT GGGTGTTAAC CGGAAGTTTG GCGAAGCATT TGGTGTAAAT
GCGCTGTTCG GTGGTAACCT GATGCGTCAG GTGTACCAGA ACTCAAACTA CGGCGGCAAC
AACTTCAATA TCCCGTACTT CTACGATATA TCGAACATCG ACCCGGCTGC CCGTAACTCC
AGCGAAAACT ACATCGAGAA GCGGATCAAC TCGGTATATG GTTCGGCTGA ATTCTCGTAT
AAAGGTTATC TGTTCGTAAC GGCTACGGCT CGTAACGACT GGTTCTCGAC GTTGGCTAAA
GGTAACAACA GTATTCTGTA CCCATCCGTT GGTGGTAGCT TTGTGCTGTC AGAAGCGGTT
AGAATGCCGA AAGCCGTTAA CTATCTGAAA TTTAGAGGCT CGTGGGCGCA GGCCGGTGGT
GATACCGACC CGTACAACCT GTCTCTGTAT TACGGACTGG CCGGTGCTCA CCTGGGCGCT
CCGCTGGCAC AGATCAATGG TGACCGCGTA CCGAACTCCA ATCTGCAGCC GCTCACCTCG
ACAACTTCGG AAGCCGGTCT GGAAACACGT TTGTTCAACA ATAAGCTTAG CATCGACTTC
GCTGTATATT CCCGTAAAAC AACCAACGAC ATCGTTGGTG CCACCATTTC CAACACGTCG
GGCTACAACA GCGCCCTGTT TAACGTGGGC GAAATTTCCA ACAAAGGGAT TGAGCTTTTG
CTGACCTACC GGTTGGCAAG CAGCAAGGAT TTTAGCTGGG ATGCTTCGTT CAACATGGGC
TACAACAAAA GCGAAGTGGT TAGCCTGTAT GGTAACCTGA CAACCCTGCG GGTAGATGAA
AACCGGACGC GTATTGCGTA TATCCACCAG GACGTAGGAC TGCCCTACAG TCAGGTAAAA
GGGTTTACCT ACAAGCGGAA TTCGGCCGGG GCTATTGTCT ATGATTCGCA GGGTTACCCA
ATGCAGGGTG ATCTGGTTAA TTTCGGCACG GGTGTAGCGC CAACTACGCT TGGCTTCAAT
AACTCTTTCC GATACAAAGG GATTGGCATT AGCTTCCTGA TCGATGGTAA ATTTGGTGGC
GTGATTTACT CCGGTACCAA CGCGTACGCA AACCGCCGTG GTTTGCTCAA GTCGACGCTC
GAAGGTCGTG AAACGGGTAT CGTTGGCGTA GGTGTAAACG AAAAAGGTGA ACCTAATACG
GTTAAGGTGC CAGCACAACA GTACTATGAG CGCCTGTTCA ACATTGCTGA TCCCTTCGTT
TACAGCGCTG ATTTCCTGAA ACTCCGCCAG GTAATTATCG ACTACACGAT TCCGGCGCGG
GTATTCGGCA AATCGCCTAT CAAAGGGGCT TCGATTTCTA TTGTTGGTCG TAACCTGGCT
ATCCTGATGA AGCATACGCC AAACATCGAT CCTGAATCGA CTTACAATAA TTCAAACGCA
CAAGGGCTTG AACTGGCAGG CGTACCCGCC ACACGCACTT TGGGGGTTAA CCTGAATTTG
AAATTCTAA
 
Protein sequence
MQKHLLYFKI LVLTLLTTLS FAQSVEVKGR ITGEGGAPIY GANVVVKGSR QGAISDEKGD 
YRIQVPKGST LTVSFIGYVA KDVVVGNSSV INVSLAPQSS VLDEVVVTAL GIKKEKKALG
YSVSEVKGEE LTQARTVNVA NSLQGRVAGL NIANTATGPG GSARIIIRGN GSISGNNQPL
IVVDGTPINN DNQGSAGMWG GGDGGDGISS LNPDEIETIS VLKGATASAL YGSRASNGVI
LVTTKGGKAN KGIGVEVNSN FVGESLLLPT YKDYQYEYGM GNNGIKPTTI AEALTSNSWG
SKLDGSSVIQ FDGVSRPYSA VRDNQQNFYR VGSTFTNSVA LTGATESMTY RLSMNDLNNK
AVVPNSGLRR NNFALNLNAN LGKNLSVVTN VKYILERTNN RPRLSDSPGS ATYALNAMPT
SLGIAALEQS RYNADGSEKT WSDNIYIQNP YIAAYDWRQE DKKGRIIGVI EPRYNFTDWL
FLRGRLGFDN FNYRNLSITP YGTPFQPRGG MNVANRNFTE TNTELLLGVN RKFGEAFGVN
ALFGGNLMRQ VYQNSNYGGN NFNIPYFYDI SNIDPAARNS SENYIEKRIN SVYGSAEFSY
KGYLFVTATA RNDWFSTLAK GNNSILYPSV GGSFVLSEAV RMPKAVNYLK FRGSWAQAGG
DTDPYNLSLY YGLAGAHLGA PLAQINGDRV PNSNLQPLTS TTSEAGLETR LFNNKLSIDF
AVYSRKTTND IVGATISNTS GYNSALFNVG EISNKGIELL LTYRLASSKD FSWDASFNMG
YNKSEVVSLY GNLTTLRVDE NRTRIAYIHQ DVGLPYSQVK GFTYKRNSAG AIVYDSQGYP
MQGDLVNFGT GVAPTTLGFN NSFRYKGIGI SFLIDGKFGG VIYSGTNAYA NRRGLLKSTL
EGRETGIVGV GVNEKGEPNT VKVPAQQYYE RLFNIADPFV YSADFLKLRQ VIIDYTIPAR
VFGKSPIKGA SISIVGRNLA ILMKHTPNID PESTYNNSNA QGLELAGVPA TRTLGVNLNL
KF