Gene Slin_5894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5894 
Symbol 
ID8729672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7136893 
End bp7139982 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content50% 
IMG OID 
ProductDNA repair ATPase-like protein 
Protein accessionYP_003390656 
Protein GI284040726 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0510725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00862936 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATACCCA TTAAACTGTC AATTCAAGGA CTCTATTCAT ATCAGGACTT ACAGGAAATT 
GATTTCCAGC AATTAATTGG TTCGAGTGTT TTTGGCATCT TTGGCAAGGT GGGCAGTGGC
AAAACGTCGC TGCTGGAAGC TATAAGCTTT GCGTTATACG GCGAAACGGA ACGGCTGAAC
AGTCGCGACA ATCGTCAGTA TAATATGATG AATTTAAAGT CGAAGCACCT GATCATCGAC
TTTCAGTTTC AGGCCGGGCC GGAGCAGCAG CTATACAAGT TTGTTTACGA AGCGAAACGT
CATCCCAAAA AGCATCATGA AATAGGGCCT GGCGAACGCC GGATGTTTAT TCGCCAGGGC
GATGAATGGC AACCCATTGG GAATGAGAAA GAAGATATAG CCGTTCTGTC GAAGCAGATT
CTGGGTCTGG ACTACGATAA TTTCAAGCGG ACCATCATCA TTCCCCAAAA CCAGTTTCGG
GAGTTTCTGG AGCTTAGTCC CACCGAACGG ACGAAGATGA TGAACCAGTT GTTCAAGCTC
GACCAGTATG ACCTGGCTGC CCGAGTGAGC AAACTGAGTA AAGCGAATGA CGACCAACTG
GCCGAACTTC GCGGGTTGTT ATCCCCTCTG GAAGCTGTAA CACCCGAAAC CATTGAGCAG
GCAAAGACCG ACATTACTGT TATTTCTGAG TCTCTGAGCC GGAAGGAAGC CGAAATCAAT
GACTTGTCTC CTGCCGAAAA ACGATTACTC GAAAACCAGC AACGTAGCCA AACACTGGTA
TCGACTCAAC AGGAACTCAC CCAGGTTCTT AACCGGGCAC CCCTCTATCA GCAACGGGAA
CAGGCACTGG CCATTTACGA AACCTGTCTG CTCGTCTTCC AGGCCGACTT TGCTAACCTC
GATAAGCTTA ATTCCCGCAA AACCAGACTG ATAGAACAGG AACAGGCGGC CCAGCGGCAA
CTCCTGTTCG TAACGAAGCG GTTGGTAGGC TTACGTAGTT TATACGAAGC GGCCAAACAG
GCTTACGAAA CGCGCGACGA ACTACAGCAG AAAATTGACG AACTGGATAC GGTACAGCAA
ATCCGAACGC TGCAACAAAC CATCGGCCAG CAAACCCGTA ATCGCGATAC GCTGGCCAGT
CAGCTCCACC ACCAAGCGAC TTTAATTGAA CGGCACAAGG CAACCCGCGC CAAACACCGG
CTCATTCTGG ATAACGGCAT CGGTCGGACG TCCGATCTAG AGCGGCTTTA TAAAGTTAAG
AACTGGTTTA CGGCTTATAA ACCTCTGAAG AAACAAGCCG ATGATCTTCA ACTGGCCGTC
GACAACTATG ACCGGGCGAT TGAAAAACTG AAACAGCAGA AGAACGATGC GCTGTTGGGT
TTTCCACCCG ATTGGACTGA CTTAACGCTC AAAACCTTAC CGGATGCTAT TGAGGAGGCT
CTTGAGAAAT TTAAAGAAGT TCGGGAGGAC CGCGAGGACA AACACCGGAA ACTACTCGTT
CAGGATGAGT TACGAAAATA TGCCGATGCA CTGAATGAAG GACAACCCTG CCCACTTTGC
GGATCAACGC ACCATCCTAA CCGACACAAG GGCGATGAGG AAAGCATTGA TGTAAAAGGG
AGCGAAGCCG GTTTACAGAA AGTAATTCAG CGAATAGAGG TCACCAGTAA GCTACAGCTT
GCCATTAAGG AGCTGACGAC AAAACTGCGG AGCGAACACG ATAACGGCAA ACGGCTCATG
CAGGAGCGTA CCGAAATCGT CCGGCAGTTG ACCGAGCACG AAGATAATTT TATGTGGCCG
GAATTCTCGA AAGAGCAGGA AGGTGAGGTG ATAAATGCCA TCCAAAAAGA GAGCGACGCG
CAGAAGCAAA TCCTGGACGC GCAAAAGGCG ATTCAGGATC TCGAAAAACT TATAGGAGAA
GCCGAAGCGG CCCATAACGA TCTAAACAAA AAAGTGGTCG ACGCGGGTAA TGCCATTGCC
GGATTGAGCG GGCAATTGAA AACAGCGGCC GAATCGCTGG AGCATTTCCG GCTGGAAGAA
GTGAAAAACT GGGGACTTGA CCAGATTGCT GACCTACGCG AATCACTCAC CAAAACCTAC
CGCCAGGCAA AAATTCATTT CGACGATACG GCAAAGCAGC AGGGTGAAGC CGAAAAAGAA
CAGGCAACGG CTGAAGAACA AATTCAGCAA TTCCATAATC AATTGGCCGA AATAGTCGCT
GAACAGGAAG GTCTGGAAGT CACCATCGCG CAGAATCTGG CCAACCAAAA CCTGACCCGC
AAGCAGGTGA AACAGATTCT ACAATCTAAC CTAAATATCG GTCAGGAACG ACAGCAGATC
AACGAATACA ACGAAAAGCG AAGCAGTCTG CAAGGACAAG TCGATACGCT ACAGACTGAG
CTAGCCGAAC AGCCGTTCGA CCCGGTCGAA TTAGCGACGG TTCAACAACA GTTAGCAACC
TTACAGACCG AGAAGGATGC GCTGAACAAA GAACATGGCC GTGCTACCAG TGTGCTAGCT
GCCCTCGAAA GTCAGTGGCA GCAAAAGCTG GAACACCAGA AGCGGCACGA TGAACTTGAC
CTCCGAAAGC AGGACCTGAA GAAGATGGAC GAGATGTTCA GAGCGCAGGG CTTCGTGAAT
TACGTGTCGT CGGTTTATCT GAAAAACCTC TGTGAATCAG CCAATGAACG ATTCTTTAAG
CTGACCAATA ATCAGCTGCG GCTCGAACTG GACGACAAAA ACAATTTCCT CGTGCGCGAC
TTCCTCAACA GCGGGGAGGT ACGGAGTGTA AAAACGCTCT CGGGCGGTCA GACCTTCCAG
GCAGCTCTGT CGCTGGCGCT GGCGTTATCG GATAACATTC AACATCTGAC GAAAGCAAAG
CAAAATCTGT TTTTCCTGGA TGAAGGCTTT GGTACGCTCG ACAAAGATTC CCTGCAAACG
GTCTTCAAAA CGCTGAAAGC CCTCCGCGCC GAAAATCGGG TCGTTGGCAT TATCTCGCAC
GTGGAAGAGC TTCAGCAGGA AGTGGAACAT TTTATTCGGG CAGAATCGAC CGAGAATGGT
AGCCGGATTA TTCGGAGCTG GGAGAGTTGA
 
Protein sequence
MIPIKLSIQG LYSYQDLQEI DFQQLIGSSV FGIFGKVGSG KTSLLEAISF ALYGETERLN 
SRDNRQYNMM NLKSKHLIID FQFQAGPEQQ LYKFVYEAKR HPKKHHEIGP GERRMFIRQG
DEWQPIGNEK EDIAVLSKQI LGLDYDNFKR TIIIPQNQFR EFLELSPTER TKMMNQLFKL
DQYDLAARVS KLSKANDDQL AELRGLLSPL EAVTPETIEQ AKTDITVISE SLSRKEAEIN
DLSPAEKRLL ENQQRSQTLV STQQELTQVL NRAPLYQQRE QALAIYETCL LVFQADFANL
DKLNSRKTRL IEQEQAAQRQ LLFVTKRLVG LRSLYEAAKQ AYETRDELQQ KIDELDTVQQ
IRTLQQTIGQ QTRNRDTLAS QLHHQATLIE RHKATRAKHR LILDNGIGRT SDLERLYKVK
NWFTAYKPLK KQADDLQLAV DNYDRAIEKL KQQKNDALLG FPPDWTDLTL KTLPDAIEEA
LEKFKEVRED REDKHRKLLV QDELRKYADA LNEGQPCPLC GSTHHPNRHK GDEESIDVKG
SEAGLQKVIQ RIEVTSKLQL AIKELTTKLR SEHDNGKRLM QERTEIVRQL TEHEDNFMWP
EFSKEQEGEV INAIQKESDA QKQILDAQKA IQDLEKLIGE AEAAHNDLNK KVVDAGNAIA
GLSGQLKTAA ESLEHFRLEE VKNWGLDQIA DLRESLTKTY RQAKIHFDDT AKQQGEAEKE
QATAEEQIQQ FHNQLAEIVA EQEGLEVTIA QNLANQNLTR KQVKQILQSN LNIGQERQQI
NEYNEKRSSL QGQVDTLQTE LAEQPFDPVE LATVQQQLAT LQTEKDALNK EHGRATSVLA
ALESQWQQKL EHQKRHDELD LRKQDLKKMD EMFRAQGFVN YVSSVYLKNL CESANERFFK
LTNNQLRLEL DDKNNFLVRD FLNSGEVRSV KTLSGGQTFQ AALSLALALS DNIQHLTKAK
QNLFFLDEGF GTLDKDSLQT VFKTLKALRA ENRVVGIISH VEELQQEVEH FIRAESTENG
SRIIRSWES