Gene Slin_4734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4734 
Symbol 
ID8728498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5766728 
End bp5768845 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content52% 
IMG OID 
ProductOligopeptidase B 
Protein accessionYP_003389511 
Protein GI284039581 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.979421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGA CTTGTCAACT GTTCATTATT TCTCTCACGC TCTGTAGCAT GACCCAGGCA 
CAACCGATAA CCCCGCCCAA AGCGGCCGTT AAACCCAAAG AACTGATCAC AAACGGTCAT
AAGCGAACCG ACAATTACTA TTACCTCAAC GAACGCGAAA ATCCGGAGGT GATCAAGTAC
CTGAACCAGG AAAATGCCTA TGTCGAGCAG GTGCTGGCGC CCGTAAAAGA CCTGCAAACC
AAGTTGTTTG AGGAGATGAA AGGGCGCATT AAGCAGCAGG ACGAATCGGT GCCCTACAAA
GAAGGCAACT ATTATTATTA CACCCGCTTC ATAACGGGGG GCGAATACCC GATCTACTGC
CGCAAGAAAG GCTCCTTGCA GGGCACCGAA GAAGTTATGT TCGATGGCAA CGCCATGGCA
AAAGGCCACA ACTATTACCA GTTCGGGGGC TTTGAAGTAT CGGACAACGA TGAACTGGCC
ATTTTTGCCG AAGATACCGT CAGCCGTCGG CTCTATACCC TGCGGGTGAA AAACCTGAAA
ACGGGTAAAC TCTACCCCGA AGCTATTCCC AACACCGAAG GAGGCAGCTT TGCCTGGGCT
ACGGATAACA AGACGCTGTT CTACATCAAA AAAGACCCGC AAACCCTCCT TGGCTATCAG
GTTTACCGGC ACGTGCTGGG TACGGACGCA AAAAACGATG TGCTGGTCTA CAAAGAAAAA
GACAACCAGT TTTACATGGG GCTGGGCCGG TCGAAATCAA AGAAATACAT CACCATCGGC
TCCGATCACA ACGGCGTCGC CACCGAATAC CGGTTACTGG AAGCCAGCAA ACCACTGGGT
GAGTTTGTTC CGTTCCTGCC CCGCCAGAAA GGCCACGAGT ATGACATAGT TCATTACAAA
GACAAGTTTT ACGTCCGCAC GAACTGGAAA GCCGAGAACT TCCGCCTGAT GGAAGTACCG
GAAGGTAAAA CGGCTGATCG TGCTGCCTGG AAGGAAGTGA TTCCCCACCG GGCCGACGTG
TATCTGGAGA ATATGGACAT CTTCGCCAAT CACCTCGTAT TGGGCGAACG CAAAGCCGGG
CTGACCAACA TCCGGGTCAT CAATCAAAAA ACGAAGGCCG ATGAGTACCT CGATTTCGGT
GAAGCGGCCT ACGTAGCGGG TATCAGCTAC AACCCGGATT TCAACACCAA CGTGCTACGT
TACGGTTACT CGTCGCTTAC AACGCCCAGT TCTACCTTCG ACTACAACAT GGATACGAAG
GAGAAAACGC TCAAAAAGCA GCAGGAGGTA CTGGGTGGAT TTGATAAGAA CAACTATACA
TCGGAGCGGT TTTTTGCTAC CGCCCGCGAT GGCGTGAAAG TGCCCGTTTC GCTGGTGTAC
CGCAAGGGCA CGAAGAAAGA TGGCTCGGCT CCTTTGCTCC AGTATTCCTA CGGCTCGTAC
GGCTACTCCA CCGATCCCGG CTTTAGTTCA ACCCGCCTGA GCCTGCTCGA CCGGGGCTTT
ATCTTCGCCA TTGCCCATAT TCGGGGCGGG CAGGAGATGG GCCGGCATTG GTACGAAGAT
GGCAAGATGC TCAAGAAGAA AAACACCTTC AATGATTTTG TCGACGTTTC GGAATACCTC
ATCAAGAACA AGTACACCAG CGCCGATAAG CTCTTTGCTA TGGGCGGCAG CGCGGGTGGT
TTGCTGATGG GTGCCGTTAT TAACCAGGCT CCGCAACTGT ACCGGGGGGT AGTGGCCGCC
GTGCCCTTCG TGGATGTGGT AACGACCATG CTCGACGAAA GCATTCCGCT CACTACGGGC
GAGTTTGAAG AGTGGGGCAA TCCGAAAAAC AAGGAATATT ATGATTACAT GCTATCGTAC
TCGCCCTACG ATAACGTGGA GAAAAAAGCG TACCCAAACC TGCTCGTGAC TACGGGCTTG
CACGATTCAC AAGTGCAGTA TTGGGAACCC GCCAAATGGG TGGCCAAGCT TCGGGAAATG
AAAACGGATA ATAACCAATT ATTACTTCAC ACCAACATGG AAGCGGGTCA CGGCGGTGCC
TCGGGCCGTT TTCAGGCGCT CAAGGAAATC GCGCTGGAAT ACGCGTTCAT GCTCAATTTG
GTTGGGGAGC GGCAGTAA
 
Protein sequence
MKRTCQLFII SLTLCSMTQA QPITPPKAAV KPKELITNGH KRTDNYYYLN ERENPEVIKY 
LNQENAYVEQ VLAPVKDLQT KLFEEMKGRI KQQDESVPYK EGNYYYYTRF ITGGEYPIYC
RKKGSLQGTE EVMFDGNAMA KGHNYYQFGG FEVSDNDELA IFAEDTVSRR LYTLRVKNLK
TGKLYPEAIP NTEGGSFAWA TDNKTLFYIK KDPQTLLGYQ VYRHVLGTDA KNDVLVYKEK
DNQFYMGLGR SKSKKYITIG SDHNGVATEY RLLEASKPLG EFVPFLPRQK GHEYDIVHYK
DKFYVRTNWK AENFRLMEVP EGKTADRAAW KEVIPHRADV YLENMDIFAN HLVLGERKAG
LTNIRVINQK TKADEYLDFG EAAYVAGISY NPDFNTNVLR YGYSSLTTPS STFDYNMDTK
EKTLKKQQEV LGGFDKNNYT SERFFATARD GVKVPVSLVY RKGTKKDGSA PLLQYSYGSY
GYSTDPGFSS TRLSLLDRGF IFAIAHIRGG QEMGRHWYED GKMLKKKNTF NDFVDVSEYL
IKNKYTSADK LFAMGGSAGG LLMGAVINQA PQLYRGVVAA VPFVDVVTTM LDESIPLTTG
EFEEWGNPKN KEYYDYMLSY SPYDNVEKKA YPNLLVTTGL HDSQVQYWEP AKWVAKLREM
KTDNNQLLLH TNMEAGHGGA SGRFQALKEI ALEYAFMLNL VGERQ