Gene Slin_3993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3993 
Symbol 
ID8727751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4801958 
End bp4805203 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content53% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388782 
Protein GI284038852 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.356607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00078443 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGAAC GATTATTTGC ACTTCTCTTC TTACTTCCTT TGCTTTCTTT TGCCCAGCAG 
CGCGTTGTGC GTGGTAAGGT AAGCGATGCT AATGGGAAAG AATCTCTACC CGGTACCACG
GTAACGGTGA AAGGTGGTAC AGCGGGTACA GTAACCGACG CACAGGGAAC TTATCAGATA
AATGTACCCG ATAAAGCAGC AACGCTGGTT TTCTCGTCTG TCGGCTTTCG ACTACAAGAG
ATAGTTGTTG GTAATCAGCA GGTTATCGAT GTGTCGCTCA TTGCCGATAC CAAGCAGCTC
AGTGAAGTGG TGGTTACCGC AGCGGGTATC AAACGGGATA AAAATGCGCT CGGTTATTCG
GTATCAACGC TGGATGCCAA CAAGCTGGCT CAGCGGTCAG AACCGGATCC ACTCCGGGCA
CTTACCGGTA AAGTCGCGGG TGTTAATGTG CAGGGTTCAG GGGGGGCGGC CGGTGGTGCA
ACGAATATTA CTATTCGGGG AAATTCGTCG CTGGGCAATA ACAACCAGCC ACTGTTTGTG
GTCGATGGGG TTCCCTTTGA TAATTCCAGT TTCGGCAGTA CGGACGGATT TGTTGGCGGG
TCGACCGTTA CCAACCGGGC TTTTGACCTG GACCCGAACA ATATTCTTAC CATGACGGTG
CTGAAAGGAG CCGCTGCGGC TGCCCTTTAC GGCTCCAGAG CGGCTAACGG AGCCATCATC
GTGACCACAA AAGCCGGTAA AAGCACCAGC CGGAAAGGGC TTGAAATCAC GTATAGTTCC
GGCTACTCGA CAGAAACGGT GGCCGGTTTG CCGGATTACC AAACCAAGTA TGGACAGGGA
ACGAACTTCG ATTATCGGAG TGGTGTATTC GGCTCGTGGG GACCGCCGTA CGCAGGCTAT
ACCAGCCTGA TTCCCACCCG GGCTACCATT CCGCACCCGC TCACGACGAA TAACCGCTAT
CCATCAACGG TATTTCCGCA GTTTTATCAG GCCGATGGCA CAACGCCCAT TCAGGTACCT
TACCAGTCGT ATTCGCAGGG CAACGCCAAA AATTTCTTTC GGACAGGAAA CGTCTTTGAA
AACGCCCTCT CGGTATCGAC AGGTGGCTCG AAGGGGAATT TTACGGTGGG TCTTTCCCGC
ACGGGTAATC AGGGTGTAGT GCCCGAGAAC CAGATCACCC GTACCGGTAT CAACATTGGT
GGTAACGCCC AGCTGGATAA TAAGTTCTAC GTGAGCGGTG CCCTGAACTA CGTCAATACC
GAGCAGGTAT CTCCGCAGGT GACGGCGGCC AATGGCAGCG GCAGTTCCAT TATGGACATC
CTGATGTTTG TGCCTACCAG CTTCGACCTG ACGGGTTACC CGAACACCAA TCCGCTGAAC
GGCAATAATG TATACGACCG CGTCGGTACC GATAACCCTT ACTGGTCGGT CAAAAACAGT
CCGACAATCA GTAAGGTTGA TCGCTATTAC GGTAATGTCG TGCTGGGGGT CGATCCGTTG
CCCTGGCTGA ACGTGCAGAA TACCGTCGGC TTCAATGCGT ATACCGACCG GCGTGTGGTC
GTCAATGGAA AAGGCGGGGA CTACTTCCCG AACGGCAACA TCACGAACGA TAATATCTAT
CGGCAGGAAC TGGACAACAC CCTGCTCGTA ACGGCAACTA AACCGCTTTC GGAAAACATT
GGCCTGAAAG TAATTCTGGG AAACAACGTT AACCAGCGCG TAACGGAGCG GCAGGTTGTG
TTTGGCGACG GGATCATTTT TCGGGGTATC AACTCGCTGA ACAACACCAG CGTGACCATT
CCCCGCGTGT TGCCCAATAA CCGCAATAAT TTCAAACAGC GATACTTTGC ATTCTTCACC
GATATTTCGC TGGACTACAA AAACTACGCG TTTTTAAACC TCGTAGCCCG TAACGACGTA
TCGTCTACAC TGCCCGCCAG CAACCGAAGC TACCTGTATG GCGGGGCCAG TGCCTCCCTG
ATCTTTACCG AAGCGTTGAA ACTGCCTAAA AACGTTCTTT CGTTTGGTAA GCTTCGGGCT
GGCTACACCC GCGTAGGTAA CGAAGCAACG CCTTACCAGA CGCAAACGGT GTACATCGCC
AACCCTGTTC TGGGCGTTGG GTCGGGCACG GGATCTATCT CATCGCCGTT CAGCGGCCAA
AGTACGCTGT CCGAATCAGA CTTGCTGGCC AATTCAGAGT TGAAGCCCGA GTTCATTACC
GAGCTTGAGG TGGGTACGGA GTTACAGTTT TTCAATAACC GGATAGGCCT GGACATCACG
TACTATAACA AGATCAGTAC GTCGCAGATT TTTACGGTCA ACGCCACACC CTCGTCGGGG
TACACGCAGC GGGTAATTAA TCTGGGGCGT TCGTCTAACG AAGGCATCGA GATTGGTCTG
ACGGCAACGC CCGTAAAACT CAAGAACGGG TTTAGCTGGG ATATCTCATC GGCCTTCACC
ATGAACCGCA ACATTGTACT GGACATAGGT TCCCTGAAAG AACTGCCCTA CGGTGGTTTC
TCGGACCTAG GTAGTGTACA CATTGCCGGT CAGCCCTACG GACAGATTCG CGGGTCGACG
TATGCCAGAG ATGAGGTGGG CAATATCCTG GTCAATCCCA ACACGGGAAA GCCGATCCTG
AGCGGTAAAA CGGCCGCTAT CGGTAATCCA AACCCCGATT TTATTCTGGG CGTAACCAAT
ACCTTTAGTT ATAAAGGGCT TACGCTGAGC GCGTTGTTCG ACTGGAAGAA AGGGGGCGAC
ATGTACTCCT TTACGGCTTA CGAACTGCTA AGCCGGGGTG CAACCAAAGA CACCGAAGAG
CGGGAGGCCA TTCTGGTGGG ACCGGGTGTG CTGGGCGATG TGAACACGCT GAAACCCCTG
CTGGATGGCG AAGGCAAAAA GATTCCGAAC AACATTGGTA TTGCCGTAGC TGATTACTAT
TTCACGGGCG GCTTTGGACC GGGTGGCGCG GGCGAAACGA ATATTTTCGA CGCCACCATT
TTCCGGCTTC GTGAAGTATC GCTGGGCTAC CAGTTTCCGA AAAAATGGCT GGCGAAAACA
CCCTTTGGCG GGGCGTTCCT ATCCGTCAGT GGGCGCAACC TATGGTACCT GGCTCCCAAC
TTTCCCAAGT ACCTGAATTT TGATCCCGAG GTTAGCTCAT TAGGTGCCGG TAATTCACAG
GGCTTCGATT TCATCGGTAT ACCGACAACC CGCCGACTGG GTGTCAACCT GCGGTTCAGC
TTCTGA
 
Protein sequence
MKERLFALLF LLPLLSFAQQ RVVRGKVSDA NGKESLPGTT VTVKGGTAGT VTDAQGTYQI 
NVPDKAATLV FSSVGFRLQE IVVGNQQVID VSLIADTKQL SEVVVTAAGI KRDKNALGYS
VSTLDANKLA QRSEPDPLRA LTGKVAGVNV QGSGGAAGGA TNITIRGNSS LGNNNQPLFV
VDGVPFDNSS FGSTDGFVGG STVTNRAFDL DPNNILTMTV LKGAAAAALY GSRAANGAII
VTTKAGKSTS RKGLEITYSS GYSTETVAGL PDYQTKYGQG TNFDYRSGVF GSWGPPYAGY
TSLIPTRATI PHPLTTNNRY PSTVFPQFYQ ADGTTPIQVP YQSYSQGNAK NFFRTGNVFE
NALSVSTGGS KGNFTVGLSR TGNQGVVPEN QITRTGINIG GNAQLDNKFY VSGALNYVNT
EQVSPQVTAA NGSGSSIMDI LMFVPTSFDL TGYPNTNPLN GNNVYDRVGT DNPYWSVKNS
PTISKVDRYY GNVVLGVDPL PWLNVQNTVG FNAYTDRRVV VNGKGGDYFP NGNITNDNIY
RQELDNTLLV TATKPLSENI GLKVILGNNV NQRVTERQVV FGDGIIFRGI NSLNNTSVTI
PRVLPNNRNN FKQRYFAFFT DISLDYKNYA FLNLVARNDV SSTLPASNRS YLYGGASASL
IFTEALKLPK NVLSFGKLRA GYTRVGNEAT PYQTQTVYIA NPVLGVGSGT GSISSPFSGQ
STLSESDLLA NSELKPEFIT ELEVGTELQF FNNRIGLDIT YYNKISTSQI FTVNATPSSG
YTQRVINLGR SSNEGIEIGL TATPVKLKNG FSWDISSAFT MNRNIVLDIG SLKELPYGGF
SDLGSVHIAG QPYGQIRGST YARDEVGNIL VNPNTGKPIL SGKTAAIGNP NPDFILGVTN
TFSYKGLTLS ALFDWKKGGD MYSFTAYELL SRGATKDTEE REAILVGPGV LGDVNTLKPL
LDGEGKKIPN NIGIAVADYY FTGGFGPGGA GETNIFDATI FRLREVSLGY QFPKKWLAKT
PFGGAFLSVS GRNLWYLAPN FPKYLNFDPE VSSLGAGNSQ GFDFIGIPTT RRLGVNLRFS
F