Gene Slin_3086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3086 
Symbol 
ID8726839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3742064 
End bp3744505 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content47% 
IMG OID 
ProductType I site-specific deoxyribonuclease 
Protein accessionYP_003387896 
Protein GI284037966 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.461628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.442211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA AAACCCTTTC TGAGCGAGAT ATATGCACCA AGTTCATTAC ACCGGCTTTA 
GAACTGGCTG GCTGGAAAGA CAAATTTTTA GAAGAGGTTT CGTTTACTGA TGGCCGTATA
CGCGTGGTAG GCAAGATGAC TACGCGGGGT GTATCGAAGC GGGCGGATTA CATTCTGTAT
TATAAGCCGA ACATTCCGGT GGCTATTGTT GAAGCGAAAG ACAACAAACA TACCGTATCG
GCGGGATTGC AGCAGGCCTT AGAGTATGCG CGCATTCTGG ATATTCCCTC CGTATTCAGT
AGCAACGGTG ACGGTTTCAT TTTTCATGAC CGTACTGCCA CAGACGATAC CATAGAGCAA
GAACTTACGC TGAATGAGTT TCCCACCCCT GCCCAACTCT GGGAGCGTTA CAAGGCTTAT
AAAGGCCTTG AATCTACCGA AGAGGATGCA ATTGCTCAAC AGGAATACTA TACTGATGGA
TCGGGACGCC AGCCACGCTA TTATCAACAA ATAGCCATCA ATAGAACTGT GGAGGCCATT
GCGAAAGGCC AAAACCGGGT ACTGTTGGTA ATGGCAACCG GAACCGGAAA AACGTACACC
GCTTTTCAGA TGATTTATCG CCTTTGGAAA AGCGGCCGTA AAAAGCGGAT TTTGTTTCTA
GCCGACCGGA ACGCCTTGAT CGACCAGACG CGCCGAGGAG ATTTCAAGTA CTTTCGCGAT
AAGATGACGA TCATCCGTAA GAGGGTGGTG AACGTCGGCG GGAAAGAAGA ATTAGTATCG
ACTCGTAGGC GGGGCATTAG TGCGACAGAT AAAGCATATG AGATATTCCT AGGCTTGTAT
CAAGGACTCA CTGGCAACGA AGGTATAGAC GCTTATAAAG ACTTCTCCCC TGACTTTTTT
GACCTAATTG TTGTCGATGA GTGCCATCGG GGTAGTGCTT CCGATGATTC ATCCTGGCGG
GTTATTCTCG ATTATTTCAA AGGAGCCACA CAGGTAGGTC TGACGGCAAC ACCACGCGAA
ACAAACACTG TTTCTAACAG TGAATACTTC GGCGACCCTC TTTATACTTA CTCGCTCAAA
CAAGGCATTG ACGATGGGTT TCTGGCCCCG TACCGGGTTG TACGGATTGG GTTGAACGTT
GACCTGGAAG GCTGGCGACC GCCAAAAGGA AAACGAGACA AAAAGGGAAA CCCAGTAGAG
GATCGAATTT ATAACCGCTC CGATTTTGAC CGCAACATCG TTGTGGAGGA CCGCCGTAGG
CTGGTTGCCG AAAAGATTAC GGAATACCTG AAAGGCCAGA ATCGTTTCAT GAAAACCATT
GTGTTCTGCG TAGACATCGA ACACGCAGAC GGTATGCGAA ACGCTTTGGT CAAACAGAAC
GCCGATCTGG TCAAACAGAA TTATAAGTAT GTGATGAAGA TCACAGGCGA CGATGAAGAA
GGTAAACGAG AGCTGGATAA TTTCATCAAT CCCGAAGAAC GCTACCCGGT TATTGCCACG
ACATCAAAGC TGATGACAAC TGGCGTAGAT GCACAGACTT GTCAGCTAAT CGTACTGGAT
AGTAATATTC AATCAATGAC GGAGTTTAAG CAGATCATTG GCCGTGGAAC CCGTATCAAT
GAAGAGTTCG GCAAGCTATA TTTTACGATT CTGGATTTTC GTAATGTAAC AGACTTATTT
GCTGATCCTG ACTTCGACGG TGATCCTGTT CGAGTGAAAA TAGTTAGCGA AGACGAAACT
CTAGAAACAG TAGAAGCTGA GGAAGAAGCT GATGAAAGTT TAGTATCGAA TGATGAGGCG
GAGGTCGAAA TAGAAGAACC TCTACGGCCA AAAGTCCGCT ATAGCCTAGA TGATGAGCCC
GAGATCGTCA ACGACGAACG GAAGGTTTAT GTCAATGGCG TTGACGTATC GGTGCTGAAC
AGCCTTGAAC TCACCTTCGA CAATGATGGC AAACCTATTT TAGTGGGCCT GAAAGATTTT
ACCCGCGACA AAATGCGGGA GAAGTTCAGG AGCATGGACG ATTTCCTAAC CTACTGGAAT
GCTGCCCAGC GCAAAGAAGT CATTGTTCAG GAGCTCATGG AACAGGGCGT GCTACTAGAT
GCCTTCACCA ACGCCGTTGA CCGCGATGCT GACTTATTCG ATCTGATTTG CCACGTTGCC
TTCGACCAGA AGCCGTTGAC ACGAAAAGAG CGAGCCAATG AAGTAAAGAA GCGAAATTAC
TTCGGTAAAT ACGGAGAGAA ATCCCGCGCT GTTCTGGAGG CTTTATTAGA CAAATACGCT
GATGAGGGCG TGGTTAATAT TGAAACGTTG GATGTTCTTC GCGTTCAGCC CCTCAATAAA
TACGGATCGA CGGTAGAGAT TGTTAAGTTG TTCGGTGGCA AGCCGCAATA CTTAGAGGCC
GTACGCGAAT TAGAAAGCGA AATTTATAAA GCAGTAGCTT AA
 
Protein sequence
MNKKTLSERD ICTKFITPAL ELAGWKDKFL EEVSFTDGRI RVVGKMTTRG VSKRADYILY 
YKPNIPVAIV EAKDNKHTVS AGLQQALEYA RILDIPSVFS SNGDGFIFHD RTATDDTIEQ
ELTLNEFPTP AQLWERYKAY KGLESTEEDA IAQQEYYTDG SGRQPRYYQQ IAINRTVEAI
AKGQNRVLLV MATGTGKTYT AFQMIYRLWK SGRKKRILFL ADRNALIDQT RRGDFKYFRD
KMTIIRKRVV NVGGKEELVS TRRRGISATD KAYEIFLGLY QGLTGNEGID AYKDFSPDFF
DLIVVDECHR GSASDDSSWR VILDYFKGAT QVGLTATPRE TNTVSNSEYF GDPLYTYSLK
QGIDDGFLAP YRVVRIGLNV DLEGWRPPKG KRDKKGNPVE DRIYNRSDFD RNIVVEDRRR
LVAEKITEYL KGQNRFMKTI VFCVDIEHAD GMRNALVKQN ADLVKQNYKY VMKITGDDEE
GKRELDNFIN PEERYPVIAT TSKLMTTGVD AQTCQLIVLD SNIQSMTEFK QIIGRGTRIN
EEFGKLYFTI LDFRNVTDLF ADPDFDGDPV RVKIVSEDET LETVEAEEEA DESLVSNDEA
EVEIEEPLRP KVRYSLDDEP EIVNDERKVY VNGVDVSVLN SLELTFDNDG KPILVGLKDF
TRDKMREKFR SMDDFLTYWN AAQRKEVIVQ ELMEQGVLLD AFTNAVDRDA DLFDLICHVA
FDQKPLTRKE RANEVKKRNY FGKYGEKSRA VLEALLDKYA DEGVVNIETL DVLRVQPLNK
YGSTVEIVKL FGGKPQYLEA VRELESEIYK AVA