Gene Slin_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0116 
Symbol 
ID8723844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp141672 
End bp144143 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content54% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003384987 
Protein GI284035057 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.797457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0887465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAA TTTATATAAT GCGAAAAACT TTACTGCTAC TTTCAATTAC GCTGCTCGGT 
CTGGTGTCCG TTAACGCGCA AACCCTCGTG AGCGGGCGAA TTCTGGATAA AGTTACCAGT
CAGCCGGTTC CCTTTGTCAG TGTGGCCCTC TATCGCCAGC CCGATTCGGT GGCCATTGGC
GGAGCCATAA CCGATTCCTC CGGCCAGTTT CGGATTGTAG AAGCGAAAGC CGGAGTCTAT
ACACTGCGTA CGTTTTTTGT TGGCTATAAA CCGGTTTCGA TGCCACTCAC GGTTGTCCGA
AATCAGCCAC TTGAATTGGG TGCGTTGCTG CTGGAAGGCG ATTCCCGGTT ACTTAACGAA
GTCCGCGTTG CTGGACAGCG AACCGATGTC GTCATGCAGG CCGACCGCCA GACCTATCGG
GCGGCTCAGT TTCAATCTGC TGCCGGTGGT ACGGCTACCG ATATTCTGCG TAATCTGCCC
GGCCTGAGCA TTAATGCCGA AGGGGATGTG AGTCTGCGGG GTGCCAATGG CTTTCTGGTG
CTGCTAAATG GAAAACCTGT TCAGGCTAAT TTGGGAACGC TCCTCAATCA GATGCCAGCG
AACAGCATCG AGTCGATCGA AGTAATCACG ACCCCCAACG CCCGCTACGA CCCCGATGGG
AAAGCGGGCA TTATTGCCAT TGTGACCAAG AAAGGTGTCG ATACCGGCTG GTCGGCGCAG
GTAAACGGCC TGATTGGTCT GCCCAGCACA AACGCGTTCG GGAACGCCTA CAACCCCGTT
CGGTATAGTC CAGATGTGAC GCTCAACTAC CGTTCGGCGC GGTGGGATGT AACACTAAGT
TCGGCTTACA TCCGCAACGA CATTGCCGGA CGGCGCGAAG GCGATGTGAA CACTACCATC
GGTAATCGCC TGACCCGCTT TCCTTCCATA GGGGAGCGGA GTTTTGACCG CTATACCTTT
ACCAACCGCA TCGCTGTTGC TTTTGTACCC AATAAAAACA CAACCTGGAC GCTGGGACTT
TACCAGAGCC AGCGTACCGA AGACCGGCTG GCCGATATTG CCTACACCAA CAGTACAACC
AATCTACTGA CGGGCCAAAC CTACAACCAG CGCGCTTATT TCAACAGCAA TCTAGTGCGC
AAGGGTGGCC GGTTCTATAC TGCCAACCTC GATTATGCCC ACACCTTTGC CAATAAGGCT
ACGCTGAGCG CGGGTGCCCT GTATGAGTAT GACCTGATCG ACGGCTTTAC CAGTAACCGC
AACGTCAATC GGAATAATTA CCGCGATACG CTCCAGTATT CGTACTCGAC CACCAACCGG
CCTATTCAAA ATTTCCGGGC CAACGTCGAT GGTAGCCTGC CTTTTTTAGG CGGAAAGCTG
GAAGGCGGTT ATCAATACCG CTACCAGGAA GATACGGGCG ATTACCGGTA TCGGCAGCAG
GATGGAAATG GGTCGCCGTT GCTGGTCGTT CCGGCCTTTA CCGGCCGTAC CGCCATTAGC
AGCCGTATCC ATAGCTGGTA CACGCAATAT GGCGCGGTGG CCAAAAAGCT GGAATATACG
CTGGGCCTGC GCTATGAATA CGCCGTACGG GAGGTGCTGT CGCTGCCTGC CAATCAGACC
TATGTGCTGA CGCTGAACAA CCTGTTCCCG TCGTTCAACA TGCTTTACAA ACCCAAAGGC
GGACTGGCCC TGCGCGCTGG ATTCAGTCGG CGGGTGCAGC GCAACAGCAA TTTTGCTCTC
AATCCACTGC CCGAACGCGA ACACTCCGAA ACCCTCGAAC AGGGTGATCC GAACCTGTTG
CCCGAGTTTG TTAATCTGGC TGAATTGGGC GTGAGTAAAG ATGTTGGTCG GAGTACGCTG
CTGGCGACGG TCTATTATCA GGGCGTACAG AACACCATCA ACCGGGTCAA CCGGGTGTTT
GCCGATACCA TCCTGAGCCG GATATTTACA AACGCTGGTT TAACGCAACG GTTGGGCGTC
GAGCTGGCCA CCGATCTGAA ACTCACCAAA AGCTGGAAGC TTTATGTGGG GGGAACGGTC
TACCGATTCA CACAGCAGGG GCAGCTTTTC CAGAATGAAG TTATTTTCGA CCGGGCAGCC
TGGGTGTATT CGGTCAATGC CAACTCGACG GTGCAAATTA GTCCAACGCT TATCTGGCAG
GCCAACGTCA ACTACCTCTC CCGGCGTATC ACTGCCCAGG GCGAAGATTC CCGGTTTCTG
ATACCTAATA TGTCGGTGAA AAAGAGTTTG ATGGGCAGTC GCCTGACCAT TATGGCGCAA
TGGCAGAACA TCGGTCTGGG ATTCCTGCCC ACCAACGAGC AGCGGATCAC CACCTACGGG
AAGGACTTCT ACACGACGAC GAACTACATT CAGGAGAAGG ATATTTTCCT GATCAACCTA
AGCTATTCAT TCCGGCAACT CAGTAAACGG GCAAAACTGC CGGGAAATGA TTTCGGCGAA
AAAGAATTTT GA
 
Protein sequence
MPEIYIMRKT LLLLSITLLG LVSVNAQTLV SGRILDKVTS QPVPFVSVAL YRQPDSVAIG 
GAITDSSGQF RIVEAKAGVY TLRTFFVGYK PVSMPLTVVR NQPLELGALL LEGDSRLLNE
VRVAGQRTDV VMQADRQTYR AAQFQSAAGG TATDILRNLP GLSINAEGDV SLRGANGFLV
LLNGKPVQAN LGTLLNQMPA NSIESIEVIT TPNARYDPDG KAGIIAIVTK KGVDTGWSAQ
VNGLIGLPST NAFGNAYNPV RYSPDVTLNY RSARWDVTLS SAYIRNDIAG RREGDVNTTI
GNRLTRFPSI GERSFDRYTF TNRIAVAFVP NKNTTWTLGL YQSQRTEDRL ADIAYTNSTT
NLLTGQTYNQ RAYFNSNLVR KGGRFYTANL DYAHTFANKA TLSAGALYEY DLIDGFTSNR
NVNRNNYRDT LQYSYSTTNR PIQNFRANVD GSLPFLGGKL EGGYQYRYQE DTGDYRYRQQ
DGNGSPLLVV PAFTGRTAIS SRIHSWYTQY GAVAKKLEYT LGLRYEYAVR EVLSLPANQT
YVLTLNNLFP SFNMLYKPKG GLALRAGFSR RVQRNSNFAL NPLPEREHSE TLEQGDPNLL
PEFVNLAELG VSKDVGRSTL LATVYYQGVQ NTINRVNRVF ADTILSRIFT NAGLTQRLGV
ELATDLKLTK SWKLYVGGTV YRFTQQGQLF QNEVIFDRAA WVYSVNANST VQISPTLIWQ
ANVNYLSRRI TAQGEDSRFL IPNMSVKKSL MGSRLTIMAQ WQNIGLGFLP TNEQRITTYG
KDFYTTTNYI QEKDIFLINL SYSFRQLSKR AKLPGNDFGE KEF