Gene Slin_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0543 
Symbol 
ID8724271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp664815 
End bp667889 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content54% 
IMG OID 
ProductDNA polymerase I 
Protein accessionYP_003385406 
Protein GI284035476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.72991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.283494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAC CAACCAAAAA ACTGTTTTTA TTAGACGCGC TGGCACTTAT TTACCGCGCT 
CACTTCGCCT TTAGCAAATC GCCCCGTATC TCATCGAGGG GTATCAATAC GTCGGCGGTG
TTCGGGTTTA TGAACGCCAT GATCGAGGTG TTAACGAAAG AGAAACCGAC GCACATCGGT
GTCGCTTTCG ACTCGGCGAA AAAGACATTC CGGCACGAGT CCTTTCCGAT GTACAAGGCC
AATCGGCAAT CGCAGCCCGA AGATATCAGC GTGGCTATGC CATACATCAA GCAGATTGTG
GAAGCGATGC ATATCCCCAT GCTGATTCTG GACGGATACG AAGCCGACGA TATCATTGGC
ACTATCGCAA AAAAAGCCGC TCTGGCTGAT TTTGAGGTGT ATATGATGAC GCCTGACAAA
GATTACGGGC AGTTGGTGGA AGAACACATT CATATTTACA AGCCCGCCTT CATGGGCAAA
CCCGCCGAAA AGCTGGGCGT AACGGAAGTG CTCGAACGCT GGCAGATCGA GCGTATCGAG
CAGGTAACGG ATATGCTCGG GCTGATGGGC GATTCGGTCG ATAACATCCC CGGTATTCCG
GGCATCGGCG AAAAAACGGC GCAGAAGCTC ATCGCTGACT TCGGTTCGGT CGAGAACCTC
ATTGCCCGCG CCGATGAACT GAAAGGGAAA CTGAAAGAAA ACGTTGTCAA TTTTGCCCAG
CAGGGACTTA TGTCGAAAGA ACTGGCCACC ATTCACCTGG ATGTGCCGGT GCCATTCGAT
GAAGAACACC TGCGCCATAC CGAATACGAT AAACCCCGGT TGGCCGCCTT GCTGGACGAA
CTCGAATTTC GGCAGATGAA GACCCGCCTG CTGGGTGGCA GTTACGACGA GAAACCGTTG
CCAACGGCTT TCCAGGCTCC CGGTTCAGCA CAGATGAACC TGTTCGACTC ACCTGGTGGC
GACAGCCCTG CCTTCCTGCC GTTTCCCAAT ATGGGGTCAA ACAATCCATC GGGGGCAAGT
GATCTGCCAT TTGATTTTGG TAGCGAGACG ACTCCAACTG CTGCACCCGT TGAGAAGCCC
AAAGGCAAGC GCACCGCTGT TAAAGTACCC GTTGCGTCGG GTTCTAAAGC GACTCCCAAA
GGCGTGACGG ATACCATTAC GGCGGACGAT AAACTGGGCG CGGAGGTGAG TGAAACGGAC
GCCCCCGCTT ACCTCGACGT GTATCCCGAT TACGAAATCG ACGAAAATCA GCCCGAACGA
CGCAAGACTA TTTTGTCGGT CAAGCACGAC TACCGGCTGG TCGATACTGC CGAACTGCGG
GCTAGTCTGG TACACTACTT AAGCCAGCAG GAGAGCCTTT GCTTCGACTC CGAAACGACT
GCTATCGACC CCGTTGAAGC CGATCTTGTT GGGTTGTCCT TCGCGTATCG TGCGGGCGAA
GCCTTCTACG TACCCGTTCC CGCCGACCGG GCCGAAGCGC AGGCGATTGT TGATCAGTTC
AAACCCGTTT TCGAGAATCC GACCATTGAG AAGGTTGGAC AGAACCTGAA GTACGACCTG
CTGATGCTGA AAAAGTATGG CGTGGAAGTA CAGGGTAAGC TGTTCGATAC CATGATTGCC
CATTACCTGA TTGAGCCCGA AATGCGGCAC AACATGGACA TGATGGCCAT GACCTACCTG
AACTATAGCC CGGTTGAGAT TGAAGCCTTG ATCGGCAAGA AAGGGAAAGG GCAGTTGACC
ATGCGCGACG TGGACATTCA GAAAGTGGTG GATTATGCGG GCGAGGATGC CGATATTACC
CTGCAACTGA AACACGCATT CGCCCCCCGG CTCGAAAAAG ATAACCTGCA CAAACTCTTC
GATCAGGTCG AAATGCCGCT CGTTCAAGTG CTCACGGATC TGGAACTGGA GGGTATTAAA
ATTGACACCA ACGCGCTATC TGAATTGTCG GCCACGCTGG AGGTCGATAT GCGGCAGGTG
CAGCAGGAAA TTTTTGAGAT TGCCGGTGAG TCGTTCAACA TCGGCTCGCC GAAGCAATTG
GGCGAAGTGC TGTTCGATAA ACTCAAGCTC GACAAGAACG CCAAAAAGAC CAAAACCGGG
CAGTACGCCA CGGGCGAGGA AATCCTGTCG AAGCTCGAAG CCGAGCACGA AATAGCCCGC
AAAATTCTCG ATTACCGCGA GTTGATCAAA CTTAAGAACA CCTACGTCGA TGCGCTGCCG
TTGTTGATCA GCAAGCGTAC CGGTCGGATT CATACGTCGT TCAATCAGGC GGTAGCGGCC
ACTGGTCGGC TGTCATCGAC CAATCCTAAC TTGCAAAACA TCCCGATTCG GACGCCACGC
GGGCAGGAAA TCCGGAAAGC GTTCGTACCG CGCGGACCGG AGTTTGTGAT CATGTCAGCC
GACTATTCGC AGATCGAACT ACGAATTATG GCCGCTTTCA GTGGTGATCA GACTATGCTC
GAAGCCTTCA ACAACGGCGT CGATATTCAT ACCCAAACAG CCAGCAAGGT ATTCCATGTG
GGGCTCGACG AAGTAACCAG CGACATGCGT CGGAAGGCCA AAACCATCAA TTTTGGTATC
ATTTACGGCA TATCGTCCTT TGGCCTGGCG CAACGGCTCA AGATTCCGCG CAAAGAGGCA
GGGCAGATCA TTGAAGAGTA TTTCGCGGGT TTCCCGGCGG TAAAAGACTA CATCGACCAG
TGCATCGAAA AAGCACGCGG CTTTGGCTAT GCCGAAACCA TACTGGGTCG TCGGCGGTAC
CTGCGCGACA TCAACTCCCG CAACCAGACC GACCGTATGT TTGCCGAGCG TAACGCCGTG
AACGCTCCCA TTCAGGGCAG TGCTGCCGAC ATGCTCAAGA TTGCCATGAT CCAGATTCAC
GAGTTCATGC AGGCCGAGCG GTTGAAGTCC AAAATGATCC TGACCGTACA CGACGAACTC
GTCTTCGACG CCCACCGCGA CGAAATCGAC TTGTTGCGCG TGCGTGTAGA CGAGATCATG
AAGAACGCCA TCCCGATGGG TGTAAAGATG GAAACTGGCA TCGGCACGGG CGAGAACTGG
TTGTTGGCGC ACTAA
 
Protein sequence
MAKPTKKLFL LDALALIYRA HFAFSKSPRI SSRGINTSAV FGFMNAMIEV LTKEKPTHIG 
VAFDSAKKTF RHESFPMYKA NRQSQPEDIS VAMPYIKQIV EAMHIPMLIL DGYEADDIIG
TIAKKAALAD FEVYMMTPDK DYGQLVEEHI HIYKPAFMGK PAEKLGVTEV LERWQIERIE
QVTDMLGLMG DSVDNIPGIP GIGEKTAQKL IADFGSVENL IARADELKGK LKENVVNFAQ
QGLMSKELAT IHLDVPVPFD EEHLRHTEYD KPRLAALLDE LEFRQMKTRL LGGSYDEKPL
PTAFQAPGSA QMNLFDSPGG DSPAFLPFPN MGSNNPSGAS DLPFDFGSET TPTAAPVEKP
KGKRTAVKVP VASGSKATPK GVTDTITADD KLGAEVSETD APAYLDVYPD YEIDENQPER
RKTILSVKHD YRLVDTAELR ASLVHYLSQQ ESLCFDSETT AIDPVEADLV GLSFAYRAGE
AFYVPVPADR AEAQAIVDQF KPVFENPTIE KVGQNLKYDL LMLKKYGVEV QGKLFDTMIA
HYLIEPEMRH NMDMMAMTYL NYSPVEIEAL IGKKGKGQLT MRDVDIQKVV DYAGEDADIT
LQLKHAFAPR LEKDNLHKLF DQVEMPLVQV LTDLELEGIK IDTNALSELS ATLEVDMRQV
QQEIFEIAGE SFNIGSPKQL GEVLFDKLKL DKNAKKTKTG QYATGEEILS KLEAEHEIAR
KILDYRELIK LKNTYVDALP LLISKRTGRI HTSFNQAVAA TGRLSSTNPN LQNIPIRTPR
GQEIRKAFVP RGPEFVIMSA DYSQIELRIM AAFSGDQTML EAFNNGVDIH TQTASKVFHV
GLDEVTSDMR RKAKTINFGI IYGISSFGLA QRLKIPRKEA GQIIEEYFAG FPAVKDYIDQ
CIEKARGFGY AETILGRRRY LRDINSRNQT DRMFAERNAV NAPIQGSAAD MLKIAMIQIH
EFMQAERLKS KMILTVHDEL VFDAHRDEID LLRVRVDEIM KNAIPMGVKM ETGIGTGENW
LLAH