Gene Slin_3920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3920 
Symbol 
ID8727678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4703227 
End bp4704813 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content53% 
IMG OID 
Productpeptidase M61 domain protein 
Protein accessionYP_003388709 
Protein GI284038779 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.144473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.231177 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTATC GCCTGTCTGC TGATTCCTCC AGCCCCCATT ATATTGCCGT TGATGCTCAT 
TTAGCCAACA TTTCGACCTC GGAAGTTGAA CTGCAATTGC CCGCCTGGCG TCCCGGCCGC
TATGAACTAC AGCAGTTTGC GAAAAATATT CAACGATTTG AGATCGTTGA CCAGGCCGGT
AAACCGCTTT CGTTTCGAAA AATTACCAAA GATCGCTGGC TAGTACAAAC CGACGGCGTT
AGTGAGCTTA CCGTTCGGTA TACTTATTAC GCCCTCCTGC CAACACCCAA CCAGCTTAAT
GCGGGCAGCA GTTTTATCAG CGAGTCGCTC CTGTATGTTA ACCCGGTAAA CCTGTGCCTG
TATGCCGAAG GACGCATTTC GGAGCCCTGT ACGCTGGAAC TGGCCATTCC CGATAGCTGG
ACGGTTGCCT GTGGCCTGAC CGAAGCCCGG TCCGAACAAC CCAATATACG TACGTTACTG
GCCGCTGATT TTTATGAACT GGTCGATTGC CCGCTCATAG CTGCTCCGGT TATTCAGGAT
ATACAGTACA CCGTGGGCGA TACGGATTTT CACGTCTGGA TTCAGGGCGG TCGGCGGACG
GATGGTAATC CCACCTTCGA TGCCGACCGG ATTGTGGCCG ACTTCCGGCG TTTTTCGGTG
AAGCAGATCG AACTTTACGG CGAGTTTCCC GAAAAGGCGT ATCATTTCCT GACGCTCATT
CTACCTGTTC CCTACTATCA CGGTGTCGAA CACCGCAACT CGACCGTACT GACGCTCGGC
CCGAACGATG AGGGAGAGGG GCTGTATCAG GATTTGTTGG GGGTTTCGTC GCACGAGTTG
TTTCATGCCT GGAACATTAT CCGCATTCGC CCTACCGAAC TGCTGCCGTA CGATTTTACG
AAGGAGAATT ATTTTACGAC CTGCTTTGTC GCCGAGGGCG TAACGACCTA TTACGGCGAT
TTAATGCTGC GGCAATCGGG CGTGTTTACC GACGAAGCGT ATTTAAAAGA ATTACAGGTT
TTGCTGAAGC GTCATTTCGA GAACAACGGG CGGGCCTTCC AGTCGCTTAC CGAATCGTCC
TGGGATTTGT GGCTCGACGG TTACGACAAG GGCGTTCCCG ACCGCAAAGT GTCGGTTTAC
CACAAAGGAG CCATTGCCGC TCTGATTCTT GACCTGCACA TCCGACAGGT AACCGACCAC
GCCCGCTCGC TGGACGACGT TATGCGCCAG ATGTGGCAGC GTTTCGGTAA ACCATTCATT
GGCTATACCC TGGACGATTA CCGCGCCGTA ACCGAAGCCG TTGCGGGCGA GCCGCTTGAC
TGGTATTATG CCGTGTGTAT CTTTGGCAAT CAGCCACTTG AACCCTTGCT GAACAAGTAT
CTGGCGTGGG TTGGCCTGCT GGTCGCCTAT GAAGAGCCAA CGCCCGACCA GCCGGGTGGC
ATACGCTTAC TGGAGATCGA CAGCCAGGAA GGTCGTCAGC ATCGAGCTCG GTGGTTTGGG
CAAGTAAAAG TTGACGAGCC TGTTTCAGAA GGGAGTGTAC ATCCTGTACC TCAAGAGAAA
CTGGGTAAAA ACGTAGTTGC GAAATGA
 
Protein sequence
MRYRLSADSS SPHYIAVDAH LANISTSEVE LQLPAWRPGR YELQQFAKNI QRFEIVDQAG 
KPLSFRKITK DRWLVQTDGV SELTVRYTYY ALLPTPNQLN AGSSFISESL LYVNPVNLCL
YAEGRISEPC TLELAIPDSW TVACGLTEAR SEQPNIRTLL AADFYELVDC PLIAAPVIQD
IQYTVGDTDF HVWIQGGRRT DGNPTFDADR IVADFRRFSV KQIELYGEFP EKAYHFLTLI
LPVPYYHGVE HRNSTVLTLG PNDEGEGLYQ DLLGVSSHEL FHAWNIIRIR PTELLPYDFT
KENYFTTCFV AEGVTTYYGD LMLRQSGVFT DEAYLKELQV LLKRHFENNG RAFQSLTESS
WDLWLDGYDK GVPDRKVSVY HKGAIAALIL DLHIRQVTDH ARSLDDVMRQ MWQRFGKPFI
GYTLDDYRAV TEAVAGEPLD WYYAVCIFGN QPLEPLLNKY LAWVGLLVAY EEPTPDQPGG
IRLLEIDSQE GRQHRARWFG QVKVDEPVSE GSVHPVPQEK LGKNVVAK