Gene Slin_0744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0744 
Symbol 
ID8724474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp902723 
End bp905989 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003385606 
Protein GI284035676 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGC TCCTGGTAGT CTCAAGTTTT TCGAATGGAT GGGCGCAAGG CCGGAAGGTC 
AGCGGCACGG TCACTGCTGA TGAAGGGATC GGCGGATTTG CCGGGGCTAC TATTGTTGTG
AAAGGCACAT CTATAGGAAC CGTAACGGAC GCAAAAGGTG AGTATACCTT GACTGTGCCG
CAGACCGGTA CAATTCTTGT TTTTTCTGCC GTAGGTATGC AAAGTGTTGA AGAAGCAATT
GGCGGGCGCA GTCAAATTAA CGTTCAATTG AAAACCGACA CAAAACAGCT CAACGAAGTC
ATCATCACTG CACTGGGGGT AAAAGAAGAA CGTGATAAGT TTGCGTCATC CGTTTCAACA
GTTGGCGGAA AAAACATAGC TCAGTCTGGC GAAACGGGTC TGTTATCCGG CTTAAGCGGT
AAGGCATCGG GCGTTGTTAT TACCAAGAGT GGCGGTGACC CGGGCGCAGG AGCCTATATT
CAGATTCGGG GCCAGAATAC CATCAACGGT AATGCCCAGC CACTTTTCAT TGTCGATGGA
ATACCGGTCA GTAACTCAAA CTTTAATGAC GGCTCGGCCG CTGGCAACTC TATCGTTCAG
CAATCCCGGA TCAACGACAT CAACCCCGAA GATGTTGAAA GCATGGAAGT GCTGAAGGGG
GCATCGGCAG CCGCTTTGTG GGGTACCCGT GCCGCCAACG GGGTTATTAT CATCACGACA
AAGAAAGGAA AGGACTCGAA AGGCAAAGTC AACATTTCGT TCAAATCGAC GGTTTCGTTT
GACAAAGTCA ATAAAATGCC GCCCCTTCAA ACTACGTACG GACAAGGGAG TGGTGGCTTT
TTCAGACAGG GTAACAAGTA TAGCTACGGC GACCTGATTG CCGAGCGGTC GGGAGGGCAG
GACACCTACA TCACCGACCC CAACGCAGCA GGCTATCAGG GCTTTGTTAC GTTCCCGGAT
GGAACCAAAC GGTATGCTAT TGCTTCTGGT AATGCTGCAA ACCCGCATGG TGGCAAAAAC
TCGAAAGATA CGTTTGATCA CACAAGCGAT GTTTTCCAGA CAGGTCATTA CACAGACAAT
GCCGTTAATA TCAGCGGAGG CAATGCCCGG TCTAACATCG CCATTAGTTA CTCGAACTTA
AGTCAGGACG GGATCATTAA AGCGTTTAGC AACTACCAGC GCAACACGGC CCGGCTTAAT
GCCAGCAACC AGTTTACGGA GTGGTTCCGG GCATCGGCAT CAGCTTCGTA TACAAAAGTA
TCGTCGTCCC GGGTTCAGCA GGGTGATAAC CTCGATGGTA TTCTGCTAGG TGGTACCCGT
ACTCCTTCTG ATTTTAATAA CCAATATTAT ACCGGCACCT ACACGGATGA GTCGGGGCAG
GTATTTAACG ATGCCCACGT CTCGTACCGG AATCCGCTCG GTATTGATCA GAACACCATT
TACTCCAACC CGGTCTGGAA TATCAACAAC AATAAAAATA CCAGCGATGT CGACCGGATC
ACGGGTAATG TCGAGTTGAA CATTACGCCA AAATCCTGGC TGTCCATTAC GGGCCGCACC
GGGATTGATA ATTACACGGA TACCCGTCTG GAACGGTTTG CCCGAAATTC TGCGTCGTTT
CTGACGGGTC TTTTGTCGAA AAACTGGATA ACGGAGAAGC AATTCAACAC CGACGTATTT
GCCAATGCCA ACAAGACATT CAGCAATAAC TTCAGCGGGT CTGTGCTGGT TGGTGTGAAC
TACAATAGCC GCCGACTGGC TACTTTATCA GACCAGATCA CCAGTCTGAT CGTACCAACA
GCCCCCGATA TTCTGACCAA TGCGCTGAAC TCAAACCTAT CGGCCAACAA CTACAACCAG
CTCATTCGAA CATACGCCTA TTACGGCCAG GCTGAAGTAC AGGCCTATAA TATGCTGTTC
CTCACCCTAA CCGGCCGAAG TGAAAGTGCT TCGACATTCG GCGCTAAAAC GAACAGCAGC
TTCTTTTTTC CATCGGCGGC TCTGGCCTGG CAGTTCAGTA AGCTGAAAGG TCTGGAAAAC
AGCTCGGTTC TGAGTTTCGG TAAACTTCGC CTTACCTGGG GACAGGTGGG TATTCAGCCC
CAACCGTATC AGAACTTTAC CACCTTTACG CCAGCCACTT TCGGCGACAG TTACGCAAAC
GGTCTTTCAT CGGCCAGTTC GTTATATGGT GGTGGTTATG TGCGCAGTAC AACAGCAGGC
AACGACTTCC TGAGACCGGA GCGTAAAACT GAAAGCGAAG TCGGGGTAGA CCTGCGCTTC
CTGAACAACC GGTTCACCTT TTCTGCCACC TACTACAACA ACCAAACGAA CGACGTAATT
TTGTCGTTGC CTGTGCCGTA CGAAACCGGC TACACGGTCC GTAACATCAA CGCAGCCCAG
TTGTCGAACA AAGGCATAGA ACTCGATGCC AGTGCTGATG TGGTGCAGAA AGGGGACTTT
AAGTGGAACC TGTCGGCCAA CTTCTCCGAG AACCGCAACA AAGTATTATC GCTGGCTGGC
GCTACAGCTT ACACACTTCC CAACAGTTAT GCCGGGCAGT CGCTGATTGC CGGTCAGCCT
TTCGGGGTGT TCTATGGCAC TGACTTTTTG AAAGATGAGT CGGGCAAGTA CATTTTGGAT
GCGAACGGTT TTCCACAGGG CGGCACCAGC AACGAAATTG TTGGCAACCC AAACCCCAAA
TGGCGGGGTG GTCTGGGGAG TACTTTCTCC TACAAAAATC TATCGCTGTA TGTGTTGTTC
GATAAAGTAT ACGGCAATGA TTTCTACAAC GGTACGCGGG GAGCCCTTTA CCTGATCGGT
ACACATGGCG ATGTGGGCAA TACATCCGTA GCCCCAGCCG GTGGCATAAA AGATTATAAC
GGCAAGACGA TTGCTGCCGG CACGAAGTTT CAGGGCAACA TTACCGACTT CGGCGCCGGG
CCGGTAGCCC TCACCCAAGC CTGGTATCAG GGACCCGGCA CGTCGTTCAG CTCGGCTTCG
ACCAAGCAAT TTGTAGAAGA TGGCGGTTCG ACCCGTTTGC GCGAAGTAAC GCTCACCTAT
AACCTGCGCA GTCCTGGTTT CCGTCGGATC ACTCACCTGT CTTCCGTTGA TTTTTCGCTG
ACTGGCCGGA ACGTTCTGCT ATGGACCAAC TACCGGGGAA CCGATCCCGA AGTCAGCATT
ACCGGCCCCA GCTTGTCACG TGGGCAGGAC TGGTTCACCA ACCCGAACAC CAAATCGCTC
TTGTTTTCGA TCAAGATCAA CTACTAA
 
Protein sequence
MSLLLVVSSF SNGWAQGRKV SGTVTADEGI GGFAGATIVV KGTSIGTVTD AKGEYTLTVP 
QTGTILVFSA VGMQSVEEAI GGRSQINVQL KTDTKQLNEV IITALGVKEE RDKFASSVST
VGGKNIAQSG ETGLLSGLSG KASGVVITKS GGDPGAGAYI QIRGQNTING NAQPLFIVDG
IPVSNSNFND GSAAGNSIVQ QSRINDINPE DVESMEVLKG ASAAALWGTR AANGVIIITT
KKGKDSKGKV NISFKSTVSF DKVNKMPPLQ TTYGQGSGGF FRQGNKYSYG DLIAERSGGQ
DTYITDPNAA GYQGFVTFPD GTKRYAIASG NAANPHGGKN SKDTFDHTSD VFQTGHYTDN
AVNISGGNAR SNIAISYSNL SQDGIIKAFS NYQRNTARLN ASNQFTEWFR ASASASYTKV
SSSRVQQGDN LDGILLGGTR TPSDFNNQYY TGTYTDESGQ VFNDAHVSYR NPLGIDQNTI
YSNPVWNINN NKNTSDVDRI TGNVELNITP KSWLSITGRT GIDNYTDTRL ERFARNSASF
LTGLLSKNWI TEKQFNTDVF ANANKTFSNN FSGSVLVGVN YNSRRLATLS DQITSLIVPT
APDILTNALN SNLSANNYNQ LIRTYAYYGQ AEVQAYNMLF LTLTGRSESA STFGAKTNSS
FFFPSAALAW QFSKLKGLEN SSVLSFGKLR LTWGQVGIQP QPYQNFTTFT PATFGDSYAN
GLSSASSLYG GGYVRSTTAG NDFLRPERKT ESEVGVDLRF LNNRFTFSAT YYNNQTNDVI
LSLPVPYETG YTVRNINAAQ LSNKGIELDA SADVVQKGDF KWNLSANFSE NRNKVLSLAG
ATAYTLPNSY AGQSLIAGQP FGVFYGTDFL KDESGKYILD ANGFPQGGTS NEIVGNPNPK
WRGGLGSTFS YKNLSLYVLF DKVYGNDFYN GTRGALYLIG THGDVGNTSV APAGGIKDYN
GKTIAAGTKF QGNITDFGAG PVALTQAWYQ GPGTSFSSAS TKQFVEDGGS TRLREVTLTY
NLRSPGFRRI THLSSVDFSL TGRNVLLWTN YRGTDPEVSI TGPSLSRGQD WFTNPNTKSL
LFSIKINY