Gene Slin_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3654 
Symbol 
ID8727407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4410943 
End bp4413375 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content47% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388459 
Protein GI284038529 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA TAAACATACT TTTGGTTTTG ATGCTCCAGT TGTGTGTACA CACGGGTATC 
GCCCAAAATC AAGCAGCAGG CTATTCGATT CGCGGAATCA TTTCCGATTC GGCAAGCCAT
AACGCATTGA CTTTCATAAC GGTAAATCTG ATAAAGAGCC CAGCTACCGT TCTGAAAGTC
GATTATTCCA AAGCGGATGG CTCATTTTCG TTTGCGGGGT TGGAATCGGG AAGTTATACG
CTGGCTGTTG TCGGTGTAGG CTACAAGACG ACCAGAATCA CGGTGGAGTT GCCGGATTCC
AGTCAAAAAG CAGGGGATTT AGGCAACATA CCGCTGATGC CTGACTTGGT GGGCCTGAAG
GAAGTGGTTG TAAAAGCGTC AAAACAGATT GTAAAGCAGG AAATTGACCG TATCACGTAC
GATATGCAGG CCGATCCTGA AAACAAGGTC TTTAATGCCT TGGAAATGAT GCGGAAAGTG
CCCCTGCTGT CGGTGGACGC GGACAATAAC GTTTATCTAA AAGGAAATGC TGATTTCAGA
ATTCTGATTA ACGGCAAACC GTCAAGCATG ATGGAACGAA ATTACCGCGA CATCCTGCGG
AGCATGCCTG CCTCCTCGAT TGAACGGATT GAAGTGATCA CTACCCCCCC GGCAAAATAT
GACGCGGAAG GTCTGGCGGG CATCATCAAC ATCATTACCC GTAAACCTGT CGACAATGGC
TATAGTGGCA CCGTCAACGT TAGCGAGCGT TTTCCGGTGG GCGGGCCTGG TGTAGGTGGC
ACAGTATCCG CCAAATTCAA TAAGCTAGGG ATGACCTTGA TGACGGGGGG CAACCAGTAT
CGAATACCAA GTATCCGAAC GATGTCCGAA CGCGCTACCC GAGGCCAGGA CCCGACCTAT
CTTACCCAAA ACGGCACTAA CCGATCAACT AATTATAGCG GTTACCTGGG CTACGAAATT
AGTTACGAAT TTGACACGCT TAACCTGATT AGTGCGCAGT TCAACATTAA TGGCAGTCAG
GCTAGAGGGT TTGTCTCCCA AAACTCCCTC CTGATGGCCG ACTCCGGGCT TCTTGAGCAA
TATGGGCTGG CCAACACCAG TCGTGATCTT GGTCGCGGGA TGGATGCTGC CTTTAACTAT
CAACGAAGTG CTAAAGTGAA CAAAAACCAG CTGCTTACAT TTTCCTACCG GTATTTTGGT
TACAGCAATA ACCAAAAGGG CCAGATAGAC ATCACCGAAC GGATCAACGT TAACCTACCC
GATTACCGCC AGGTCAACAA TCAGCAATTC TCGGAGCAAA CCCTCCAGGT GGATTATGTG
TATCCGATAA AAAAACTCAA CGTCGAAGCG GGATTGAAAG CCATCATGCG TGATAACCGA
AGCGATTTTC AGTTTGAATC ATTTGTCACG GCAGCGAATG CTTTTGATCC GGACCCATTC
CGAAGCAATC GGTTTACCAA TACGCAGAAC GTTTTCGGTG CTTATAACAC GTATCAATAT
ACATTCAAAA AATGGGGGCT AAAAGCGGGG GCAAGGATTG AACAAACCTT TGTAAAAGCG
GATTTTGTAT CCACGGGTTC AAGAGTAAAT CAGAATTACG TCAATGTCAT TCCGTCGCTG
AGCATCAATC GAAAGTTGAA AAACAACGCC GGTTTCAATT TTGGCTACAC CCAGCGCATT
CAACGTCCTG GTATCTACCA ACTCAATCCG TTTGTGGATC GCTCAAACCC CAGTTTTGAG
CGTAGCGGCA ATCCGGTCTT ACGGCCCGCT TTTGTAAATG ATTTTCAGCT CGGCTTCAGT
ACGTCAAAAA AAGTCTCCCT AAACATCGGT TTAGGCTATA CTCACTTACG GGACATGATC
ATGGCGGTGG CCGTTTTCAA TCCCCAGACC AATATTACCC GCTTAACGTT TGGTAATACA
GGACGAGCGC GGTTATTTAT GCTGATGCTG AATAGCAATT ATTCGATTAA TAAGAACTGG
AATTTAAGCC TGAACTCTCG GGTTGCCCAT GGTAGGGTCT TTGGCGTGGT CGATGGTCAG
GCGGTTTCTA ATCAGGGCTT TATGTACCAA GCGAACTTGT CAACGGGCTA CAAACTGGCC
AAAGGCTGGC GGGTGAACGG CAACCTGAAC GTAGTTGGCC CCAATATCAA CATTCAGGGT
ACTACTAATT CAATAGTGAG TTCCTCGTTG AGCGTGAGTA AAGATATGTT TAACTACAAA
TTAGCCCTCT CAGCAGCCGT CAATAATCCG TTTACCCAGT TCCGGAATGC ATATCGTGAA
ACGTTCGGGC CGAACTTCAA CCAAGTAGAT CTTCGACGCG ATTATTTTCG GTCCTTTACT
GTTAGCCTGA ACTATAAATT CGGCAAGCTA AAAGAAGCGA TCAAGAAAAA TAAACGAGGC
ATCCGGAATG ACGATGTACA AACCGATAAT TAG
 
Protein sequence
MKIINILLVL MLQLCVHTGI AQNQAAGYSI RGIISDSASH NALTFITVNL IKSPATVLKV 
DYSKADGSFS FAGLESGSYT LAVVGVGYKT TRITVELPDS SQKAGDLGNI PLMPDLVGLK
EVVVKASKQI VKQEIDRITY DMQADPENKV FNALEMMRKV PLLSVDADNN VYLKGNADFR
ILINGKPSSM MERNYRDILR SMPASSIERI EVITTPPAKY DAEGLAGIIN IITRKPVDNG
YSGTVNVSER FPVGGPGVGG TVSAKFNKLG MTLMTGGNQY RIPSIRTMSE RATRGQDPTY
LTQNGTNRST NYSGYLGYEI SYEFDTLNLI SAQFNINGSQ ARGFVSQNSL LMADSGLLEQ
YGLANTSRDL GRGMDAAFNY QRSAKVNKNQ LLTFSYRYFG YSNNQKGQID ITERINVNLP
DYRQVNNQQF SEQTLQVDYV YPIKKLNVEA GLKAIMRDNR SDFQFESFVT AANAFDPDPF
RSNRFTNTQN VFGAYNTYQY TFKKWGLKAG ARIEQTFVKA DFVSTGSRVN QNYVNVIPSL
SINRKLKNNA GFNFGYTQRI QRPGIYQLNP FVDRSNPSFE RSGNPVLRPA FVNDFQLGFS
TSKKVSLNIG LGYTHLRDMI MAVAVFNPQT NITRLTFGNT GRARLFMLML NSNYSINKNW
NLSLNSRVAH GRVFGVVDGQ AVSNQGFMYQ ANLSTGYKLA KGWRVNGNLN VVGPNINIQG
TTNSIVSSSL SVSKDMFNYK LALSAAVNNP FTQFRNAYRE TFGPNFNQVD LRRDYFRSFT
VSLNYKFGKL KEAIKKNKRG IRNDDVQTDN