Gene Slin_0268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0268 
Symbol 
ID8723996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp351334 
End bp353751 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content51% 
IMG OID 
ProductSmr protein/MutS2 
Protein accessionYP_003385132 
Protein GI284035202 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0912246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTATC CCAATACCTT AGAAAACAAG CTAGGTTTCG ATACCATTCG CGAACGGCTC 
AAAGAAGCCT GTGTCAGTCC ACTGGGGCAG GACTACGTTG AGAAGATTCG CTTTACCGAT
AATGTCCAGC TCATTGATAA ACTGCTTCGG CAAACGACCG AATTCAAGCA GATCGTTCAG
TATGAGCCGG ATTTCCCGAC GAGCAACTAC ATCGACGTTC GCCCGCATTT GAGCCGGGCC
CGCGTGGAAG GATTAGCCCT GACGGAAGCC GAATTCTTTG ACCTGAAACT CGCCCTTCGG
ACCATTCAGG ACTGTCTGCG CTTCTTGTCC AAACGGGAAG AGGATAACAA TTTCCCCTTT
CTGCGTGAGC TGGCCGGTCC GGTCGGTGTA GACAAGAAGC TGACGGACGC GCTCGAACGG
GTTGTCGACG ACCGGGGTAA TATCCGCGAC TCGGCTTCGC CTGAACTAGC CAGCATCCGT
CGGCGTATTA TTGGCGAACA GGCCAATCTG CGCAAACGGC TGGACAGTAT TTTGCGGCAG
GCCCGTCAAA ACGGCTGGAT TCCCGACGAC CTGTCGCTGA CGGTGCGTGG GGGGCGGCTC
GTGATTCCGA TTGCCGCTGA ACACAAACGA AAAATCAAGG GATTTGTCCA CGACGAATCA
CAGACCGGGC AAACCGTATT TCTGGAACCG GCCGAAGTAT TTGACGCCAA CAACGAAATT
CGAGAGTTGG AGTACGAAGA ACGGCGCGAA ATCTACCGGA TTCTCCTTGC CCTCACCGAT
CAGATTCGGC CCCATCTGGA TGATCTCAAA AAAGCCGTAA ACTTTCTGGC CCAAATCGAC
TTCATTCGCG CCAAGGCCAA ACTGGCGGTT CAGTTGGACG CCATTATGCC AAAACTGCAC
GAACGTCCGC TTGTAAACTG GACGAACGCC CGCCACCCGT TGCTCTACCT ATCGTTTTCA
AAACAGGGTA AAACCGTGGT GCCGCTGAGC GTGAAGCTGG ACGAAAAGGC CCGAATCCTA
ATTATTTCCG GGCCGAACGC CGGGGGTAAA TCGGTGTCGT TGAAAACCAT TGGACTCATT
CAGTACATGC TTCAGTGCGG CCTGCTGGTG CCTATGGCCG ATTATGCCGA GATGGGCGTT
TTCCAGAATC TCTTCATCGA CATCGGCGAT GAGCAATCGC TGGAGAACGA CCTAAGTACG
TACTCTTCGC ACTTAACGGC GATGAAGCAG TTTCTGATCG GTGCTAACAA GCGTACCCTC
TTTCTGATTG ACGAATTCGG AACCGGTACC GAACCGGGCC TTGGGGGCGC TATTGCTGAG
TCGATTCTGG AAGACCTTAA TAAATCGGGG GCGTATGGGG TTGTTAACAC GCACTATACG
AACCTAAAAG TGATGGCCGA TAAAACGCCG GGACTCATTA ACGGAGCTAT GCGTTTCGAT
GGTGAACACC TGGAACCGCT TTACAAACTG GAGATCGGTC AGCCGGGATC GTCATTTGCT
TTTGAAATTG CCCAGAAGAT CGGCCTTCCC AAAGGTGTAA TAGATCGGGC TAAAGATAAA
CTGGGCACGC AGCAGGTCAA TTTTGAAAAG TTGCTGAAAG AACTGGATAT TGAAAAGCGC
GTTTTTTCGG AGAAGAATAT CGAAATCGGC ATAAACCAGC GTAAACTCGC CCAGCAACTT
GCCGAATACA CAGCGCTGAA AGAACGGCTG GATAACGACC AGAAAAAGAT CGTTAACGAC
GCTAAACAGA AGGCAAAAGC CCTTGTTCAG GAAGCGAATC AGCGTATTGA AAACACCATC
CGGGAAATAA AAGAGAACAA GGCAGAGCGA GAAACAACCA AACAGGTTCG TCAGGAACTG
GAACGGTTCG AGCAAAAGGA ACTGAAACCC GAGCCTGTCG TTCTGGAAAC TCCGAAACAA
GCCGAAGATG TATTCGAGAA AGATAATGGC GTGATCTCGG CAGGAAGTTA TGTACGAATT
GCCGGACAGA ATACTATTGG CGAAGTACTG GCTATACGGG GAAAAGACGC TGAAATTCGT
ATTGGCGACC TGAAGTCGAA CGTTAAGCTG AACCGGCTCG AAAAAGTAAG CAAAAAGACG
TTCGCAGCCG CTACCGAAGT GCGCGATGAT CGTCCTCGTA GCCAGGGCGT AGATATGAAC
GAGAAGATGC AGAACTTCAG CTTCAACCTC GACATTCGGG GCAAACGTGG CGAAGAAGCC
CTCGGCGAAG TGGATCGTTT TTTCGATGAT GCCCTCATGC TCGGTTACCC TGAGCTGCGC
ATCGTTCATG GTAAAGGCGA TGGCATTCTG CGAACGCTCG TCCGTAATCA CCTGCGTGGC
TACAAGCAGG TGGGCAAGAT GGAGGATGAA CATGCCGACC GGGGTGGTGC GGGCGTAACA
ATTGTAAAGA TGAAATGA
 
Protein sequence
MLYPNTLENK LGFDTIRERL KEACVSPLGQ DYVEKIRFTD NVQLIDKLLR QTTEFKQIVQ 
YEPDFPTSNY IDVRPHLSRA RVEGLALTEA EFFDLKLALR TIQDCLRFLS KREEDNNFPF
LRELAGPVGV DKKLTDALER VVDDRGNIRD SASPELASIR RRIIGEQANL RKRLDSILRQ
ARQNGWIPDD LSLTVRGGRL VIPIAAEHKR KIKGFVHDES QTGQTVFLEP AEVFDANNEI
RELEYEERRE IYRILLALTD QIRPHLDDLK KAVNFLAQID FIRAKAKLAV QLDAIMPKLH
ERPLVNWTNA RHPLLYLSFS KQGKTVVPLS VKLDEKARIL IISGPNAGGK SVSLKTIGLI
QYMLQCGLLV PMADYAEMGV FQNLFIDIGD EQSLENDLST YSSHLTAMKQ FLIGANKRTL
FLIDEFGTGT EPGLGGAIAE SILEDLNKSG AYGVVNTHYT NLKVMADKTP GLINGAMRFD
GEHLEPLYKL EIGQPGSSFA FEIAQKIGLP KGVIDRAKDK LGTQQVNFEK LLKELDIEKR
VFSEKNIEIG INQRKLAQQL AEYTALKERL DNDQKKIVND AKQKAKALVQ EANQRIENTI
REIKENKAER ETTKQVRQEL ERFEQKELKP EPVVLETPKQ AEDVFEKDNG VISAGSYVRI
AGQNTIGEVL AIRGKDAEIR IGDLKSNVKL NRLEKVSKKT FAAATEVRDD RPRSQGVDMN
EKMQNFSFNL DIRGKRGEEA LGEVDRFFDD ALMLGYPELR IVHGKGDGIL RTLVRNHLRG
YKQVGKMEDE HADRGGAGVT IVKMK