Gene Slin_4206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4206 
Symbol 
ID8727965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5065280 
End bp5068351 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content52% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388990 
Protein GI284039060 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.43352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAAT TACTACAAAT TGGTTTCCTG ATTTTGATAA CCGTATGGGC AACCTACGCA 
CAGGGTCAGG CCGTTTCAGG CAGAGTTACA TCATCAGACG ACGGTAATCC CCTACCCGGC
GTATCTGTCA GTGTCAAAGG AACGACACAG GGAACGTTAA CCGATGCCTC CGGTAACTAC
CGCATTAACG CAGGTAACAA CGCGGTAGTC GTTTTCAGCT TTATTGGCTT CACGACTCAG
GAAGAAAAGG TGGCGAATCG GTCAGAAATC AATGTCCAGT TAAAAACCGA TGTACGTAAC
CTGAGTGAGG TTGTTGTAAC AGGTTACGGG CAGCAGATCA AACGGGACCT AACCGGCAAC
ATCGCGAAAG TTAAAGCCGC CGATATTCAG GATCAGCCCG TAACGACCTT CGATCAGGCA
TTACAGGGCA AAGCGGCCGG CGTTCAAATC AATTCCGGCT CCGGCAAACT GGGTCAGGGA
ATACAGGTTC GGGTACGGGG CCAGTCGTCG GTATCGGCAT CGAACCAACC GCTTTACATC
ATCGACGGTA TTCCCGTCAC GACAGACAAC CTAAGTATCA CCAGTTCAGC CACCAATCCT
TTGGCCGATA TTAACCCTCA GGATATTGAG TCGGTCGATA TTCTGAAAGA TGCGTCGGCC
GGAGCTATTT ACGGTGCCCG GGCGGCCAAT GGTGTCGTGC TGATTACCAC CAAACGCGGC
AAAGCCGGAC GTACCAACGT TAATTTCGGT GCTCAGTACG GGTCAAGCAA GCCAACCCGC
AAGCTGGAGT TTCTGAATAC GGAACAGTAC GTTAAGTTTT ACAATCAGGC AGCGGCCAAT
TCCGACCGAA TTGAGGGGCT AGATCCGAGC GACCCTGACT CGTATACCAC GTATATGAAG
GATTTCTACC AGACGCAGGG ATTAGGTACC TACGGCACGT CTAACCAGGC AAGTACCAAC
TGGGGTGATC TGGCCTATCA GGATGCCCCC TATCAGCAGT ATGATCTGAA CCTGAACGGT
GGTAACGAAA AAACGACGTT TTACCTCTCG GGCCAACTGC TGGATCAGAA AGGTATTCTG
GTTGGTAATG CCCTGCAGCG CTATGCCGGC CGTCTGAATA TCGAACACCA GGTATCCAGC
CGGTTCAAAG CAGGCTTTAA CATGGGACTG ACTCGTACTC TGAACCAACG CATCTCGGGC
GATAACCAGT TCGATAACCC CATGCAGATG GTGGCCCTGC CGCCAATGAC ACCCGCAACG
GATGCCACAA CGGGGCTCCC TGTGGGCTCC CCTCCCGGCG ACATCAGCAT TCCGGTTTAC
TACAACCCAC TCATTAACAT CGGCAATGCG TATTTCAACA CCACCGTTTA CCGGAATATC
AGCAATGTAT TTGGGCAATT GCAGATTATG AAAGGCCTAA CGTTCCGAAC AGAGTTTGGC
CTCGATGTAC TGAATCAGCA GGAAGAGTTG TACTACAACA GCAAAACGCA GCGGAACTTT
GGCTCACCGC TGGGCCTGGG CCGGAACCGT TTCGCCCGGG TAGAAAACTA TACGACGAAC
AACTTCTTCA ATTATTCCAC CGCTTTTGGC CGGAGTAACC TCGACGCTAC GGTGGGGATG
TCGTATCAGC AATCGCAGCA GAAAACGAAC TTCACCGAAG GCCGGGATTT CCCGTCTGAT
GCTTACCGGA TGATTGCCAG TGCGGCCCGC AAAACCGACG GTAGCTCGTC GCAAACGGAT
TACCGCTTCC TGTCTTACTT TGCCCGCGCC AATTACAAAT TTGCCGACCG CTACCTACTT
GGTGTGAGTG CGCGGGTAGA CGGTTCATCC CGCTTTGGTA ACAACAGCCG GTATGGCTTC
TTCCCATCTG TTTCGGCAGG CTGGGTGCTT AGTGAGGAGG GGTTCATGAA AAACACAACG
GCTATCAGCT TCCTGAAACT TCGGGCCAGC TACGGCCAGA CGGGTAATGC CGAAATTCCG
AATTTCCCGC AGTTGGGTCT CTTCACCGGC GATGCCGGCT ATGGTACGCT GCCTGGTCAG
CGCCCATCGC AGTTGGCCAA CCCCGACCTG AAATGGGAAA CAACCAATCA GTTCGACATT
GGTATCGACT TTGGTATTCT TAACAACCGC ATCAATGGCG AAATCGACTA TTACAACAAA
CAAACGTCGG GTCTGCTGCT CAATGTAAAC GTGCCGGGAA CGACAGGCTT TGCCACGCAG
TTCCGCAACG TAGGCAGCCT CGAAAACAAG GGGGTCGAAA TTGTTATTAA TACCGAAAAC
CTGACGGGTG CTTTCCGCTG GACAACGAGC TTCAACGCAG CTACGAACCA GAACAAGATC
AACAACTTAC AGGGCCAGAT TATCGAAGGC GGTATCAATG CGATGAGTCG TGCGGTAGAA
GGCCAGCCAC TGGGCGTTTA TTTCACGCAG GAATATGCCG GTGTTGATCC GGCCAATGGC
GATGCTCTTT GGTTCAAAAA CACCACCAAT ACCGACGGTA CTATTGACCG GAGCACCACT
AAAACCTACA ACCAGGCTCA GCGGGTTGTT GCTGGTAGCC CGTTACCCAA GTGGACGGGT
GGTATTACCA ACACGTTCAG CTACAAAGGT TTCTCACTGA GTGTACTGTT CAACGGTGTT
TTTGGCAACA AGATCAACTT CTACGGTGTA GGTCGCTACT CATCGGCCAA CGGTCGTTTC
GAGGATAACC AGACGGTCAA CCAGTTGGCA GCCTGGACGA AAGAGAACCC CAACACCAAC
GTTCCGGAAG CCCGTCTGTT CTACAACAAC GGTGCCCAGT CGTCCAGCCG TTTCATTCTT
GATGGTTCAT TCGTTCGGTT ACGTACGGCT ACCTTATCTT ACTCGCTGCC CAAAACGCTC
GTTAACCGGG TTAAGATGAA TAGCGTCCGT CTGTTCGTTA CAGGACAAAA CCTGCTAACG
TTTACCAACT ATGCCGGATG GGACCCTGAA GTCAACGCCG ACTACATTGT GTCGAACATT
GCGCAGGGGT ACGATTTCTA CACGGCTCCC CAGGCACGCA CCATTACGGG CGGTATTAAC
ATTGGTTTCT AA
 
Protein sequence
MRKLLQIGFL ILITVWATYA QGQAVSGRVT SSDDGNPLPG VSVSVKGTTQ GTLTDASGNY 
RINAGNNAVV VFSFIGFTTQ EEKVANRSEI NVQLKTDVRN LSEVVVTGYG QQIKRDLTGN
IAKVKAADIQ DQPVTTFDQA LQGKAAGVQI NSGSGKLGQG IQVRVRGQSS VSASNQPLYI
IDGIPVTTDN LSITSSATNP LADINPQDIE SVDILKDASA GAIYGARAAN GVVLITTKRG
KAGRTNVNFG AQYGSSKPTR KLEFLNTEQY VKFYNQAAAN SDRIEGLDPS DPDSYTTYMK
DFYQTQGLGT YGTSNQASTN WGDLAYQDAP YQQYDLNLNG GNEKTTFYLS GQLLDQKGIL
VGNALQRYAG RLNIEHQVSS RFKAGFNMGL TRTLNQRISG DNQFDNPMQM VALPPMTPAT
DATTGLPVGS PPGDISIPVY YNPLINIGNA YFNTTVYRNI SNVFGQLQIM KGLTFRTEFG
LDVLNQQEEL YYNSKTQRNF GSPLGLGRNR FARVENYTTN NFFNYSTAFG RSNLDATVGM
SYQQSQQKTN FTEGRDFPSD AYRMIASAAR KTDGSSSQTD YRFLSYFARA NYKFADRYLL
GVSARVDGSS RFGNNSRYGF FPSVSAGWVL SEEGFMKNTT AISFLKLRAS YGQTGNAEIP
NFPQLGLFTG DAGYGTLPGQ RPSQLANPDL KWETTNQFDI GIDFGILNNR INGEIDYYNK
QTSGLLLNVN VPGTTGFATQ FRNVGSLENK GVEIVINTEN LTGAFRWTTS FNAATNQNKI
NNLQGQIIEG GINAMSRAVE GQPLGVYFTQ EYAGVDPANG DALWFKNTTN TDGTIDRSTT
KTYNQAQRVV AGSPLPKWTG GITNTFSYKG FSLSVLFNGV FGNKINFYGV GRYSSANGRF
EDNQTVNQLA AWTKENPNTN VPEARLFYNN GAQSSSRFIL DGSFVRLRTA TLSYSLPKTL
VNRVKMNSVR LFVTGQNLLT FTNYAGWDPE VNADYIVSNI AQGYDFYTAP QARTITGGIN
IGF