Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_6007 |
Symbol | |
ID | 8729788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 7283481 |
End bp | 7286528 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | Endonuclease/exonuclease/phosphatase |
Protein accession | YP_003390768 |
Protein GI | 284040838 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00134837 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACACG TTTTACAGGT GCTTTTAGCC GCTTCTTTGT GGCTAATGAC AGCTTCTGGC TTTGCCCAGA CAACGCTGGC CCAATGGAAT TTTGAAGGGG CACCCGGCTC GGTGACCGCT GCTGCCGCAT CGACAATCAC GGCCGATAAT GCCGCTTTTG GATCGGGGGT AACCAGCGTC ACGTTTGTAG CCGGTAATGG GGGAGGCCGG GCCTATACCG GAACGGGCTG GAATGTAACA AGCCCGGACG TCAACAAGTA TATCCAGATT AGCCTCGGGC CGGTGGCCAG TTATACCATG GCCCTGTCGC AGCTGAGTTT CGACGAGCAG CGGTCCGGTA CAGGACCAAC AACGTGGGTG TTACGGTCGA GTCTGGACAA TTTTACTGCC GATATCAACA CGACGCCCAC ATCGACCACC TCGCCTACGT CAAATGGCTC ATTTACATCG CGGGTGGTGT CGTTGAGTGC CAATACTGCC TTTCAGAATT TAAGTACCCC CATCACCTTT CGGATTTATG GTTACGGAGC CAGCGGGTCG GGCGGAACCT GGCGATTCGA TAACATTCAG GTGAGCGGCA CCACCGGCCC CAGCGATCCG ACAGCACCGC TTCTGTCGGT TATACCGGGT TCGCTGCCCG GCTTCGCTAC GGTGCCCGGT ACGCCTTCAA CGGCAAAATC GTATACGGTG TCCGGGGTCA ATCTTACGAG TGATCTGACC ATAACTGCCC CAACGGGCTA TGAAGTCAGC AAAGACAACG GTACCAGTTA TGCTACTGTG CAGACGTTGA CGCAGTCGGG CGGAATCATT GCGCCTACAG CCATCAGTGT CCGGCTTACG GGGGCGGGCA ATGGTACTGT AAACGGCACA ATCACGAATG TGGCCGGTGC TGTTTCTAAA AATGTGACGG TGAGTGGAAG CGTCGATGTG TCGAATGGAC CAGTGCCTAT CGCTACAGCC CGGGGGCAGG TCGGTACAAC GGTTATCATT CAGGGGCGCG TAACGGTGAG CAGCCAGTTT GGCGGCAAGC TGTTTTACAT ACAGGATGCT ACCGGCAGCA TTGCCGTTTA CGACCCTACC ACAAGCTACG GAAATCAGGT ACAACTCGGC GATCTGGTGC AGGTGAGCGG ACCGGTGGCG CTTTTTCAGG GGAAAAAAGA AATCAACGGG GTTGCGACCT TTCTGAAGGT CAATGATACA AATCAGCCCG TTACCCCTCA GGTTATTACG GCTACGCAGC TCACCTCGGG TGCCTTCGAA GGGCAGTTGG TAACGGTTCA GAATGCAACC ATTGGCGGCT CGGGGGCAAC GTTTCAGGGC GGAGCCGCCG GGACGTATCC GCTCACGACC AGCGATGGGA CCGCCGAGTT GTTCGTAACG AGTGCCTCCG ATCTGGTGGG GGCTACCAAA CCTACGGGTG CACTGACCGT TACGGGCATT GCCGATCGTT ACATACCGAC CGATGCGTCG AAGAATGTCG TTCAGCTTAA CCCACGCGCC ATCTTCGATA TTCCCGGTTC GGAAGTGCCC CCGCCACCAC CAACCATAAC GACCTGCCCG GCCAATCGGA CCATCACCGA TAATGACCAG ACATTGAGTG TAGTAAGCTG GAACCTCGAA TGGTACGGTT TTGATGGCGG TTCGTACACT TGTACCAATG GCTCCAGAAC GTACGCCGAT AATGGCCCGA CAAATGAGAC GCTTCAGGCC CAGAATGTTC GCTCTGTAAT GGACGCGTTC AACGCTGACA TTTATATTTT GCAGGAAGTG AGCGATAAGA ACCTGCTTGT GACCAATACC CCGGCCGGCT ACGCGCTGAG CTGTTCCGAT CAATACACGT CGTACTTCTT TCAGGACTTG TGCGATGCCA ACGGCAATCC GCAGGGGTTC AACCCAACGA GCCTCAACCA GAAAGTCTGC GTAATGTACA AAACGAGCGT TGTTTCGATG ATTCCCGCCG AAAGCAAACC GTTGCTCACT GACAAGTACA GCTATACCAC AACACCACGC AGCGATGCCT GGGCGTCGGG TCGGTTACCA TACCTGTTTG TGGCGAACGT AACCGTCGAT GGGCAAACCC GGAAACTATA CATCGTTGAT ATTCATGCCA AATCCGGATC AGCACAAGCC GATTACAACC GCCGGAAACA GGACATCATT GACCTGAAAG CTGAACTGGA TGCCAATTAT GGTAATGTAA ACCTCATCAT GGCTGGTGAC TACAACGACG ATGTCGATCA GTCCATTGCC GCCGGTAATC CTTCGTCGTA CGCCAACTTC GTGAGCGACC CCAACTATAC CGTTATTTCC AGTGAGTTGA GCAGCAGCAA CTGTAACACC GATGCGAACT TTACCGACGC CATCGACCAC ATTACGGTAT CCAACGAGCT GGCGTCGTCT TACGTAGCCG GTTCGGTGGC ATCGGTTCGG CCAGCCGTTG TCAATTACGC CCTGACAACC TCCGACCACT ATCCAACCTT CGCCCGTTTC ACACTAGCCA GTCCTTTACC GGTGCGCCTG ACTTCCTTTG CTGCAAAGCC AGTTGGCGAG ACGGTGGGTA TATCCTGGAC GACGGCCAAC GAAACGAACA GTGCTTATTT CGAGGTAGAG CGTAGTGTCG ATGCACGTGA GTTCGCGTCT ATTGGCCGGG TAGCGGCTGC GGGAGATGCC CAATCAATAA AAACCTATGG CCTTGTCGAT CAACACCCCC TGAGCGGTAC CAATTACTAC CGGTTGAAAC AGGTTGACCT GGATGGCAAG ACGGCTTACT CGACAATTGT ATCTGTCGTG ATGGATAATC TAACGCCCGC TATGGAACTG CTGGGGAATC CGGTCGACAA TCAGGCGATT CGGGTGGCTG TTCGTAACCT GCCGAACGCT GTCTATCGTT TAACGACCCT TACCGGACGT GAGCTTCCTG TGCAGGGCCA AACTCAGGCG GATGGGTCGA TGCTGCTGAC AACCGCACAG GCTCTCAGCC CCGGTGTGTA CCTGCTCCGG GCTGATTCGG GAACTACCCG CATTATGCGG AAAGTGGTAA TACGGTAA
|
Protein sequence | MKHVLQVLLA ASLWLMTASG FAQTTLAQWN FEGAPGSVTA AAASTITADN AAFGSGVTSV TFVAGNGGGR AYTGTGWNVT SPDVNKYIQI SLGPVASYTM ALSQLSFDEQ RSGTGPTTWV LRSSLDNFTA DINTTPTSTT SPTSNGSFTS RVVSLSANTA FQNLSTPITF RIYGYGASGS GGTWRFDNIQ VSGTTGPSDP TAPLLSVIPG SLPGFATVPG TPSTAKSYTV SGVNLTSDLT ITAPTGYEVS KDNGTSYATV QTLTQSGGII APTAISVRLT GAGNGTVNGT ITNVAGAVSK NVTVSGSVDV SNGPVPIATA RGQVGTTVII QGRVTVSSQF GGKLFYIQDA TGSIAVYDPT TSYGNQVQLG DLVQVSGPVA LFQGKKEING VATFLKVNDT NQPVTPQVIT ATQLTSGAFE GQLVTVQNAT IGGSGATFQG GAAGTYPLTT SDGTAELFVT SASDLVGATK PTGALTVTGI ADRYIPTDAS KNVVQLNPRA IFDIPGSEVP PPPPTITTCP ANRTITDNDQ TLSVVSWNLE WYGFDGGSYT CTNGSRTYAD NGPTNETLQA QNVRSVMDAF NADIYILQEV SDKNLLVTNT PAGYALSCSD QYTSYFFQDL CDANGNPQGF NPTSLNQKVC VMYKTSVVSM IPAESKPLLT DKYSYTTTPR SDAWASGRLP YLFVANVTVD GQTRKLYIVD IHAKSGSAQA DYNRRKQDII DLKAELDANY GNVNLIMAGD YNDDVDQSIA AGNPSSYANF VSDPNYTVIS SELSSSNCNT DANFTDAIDH ITVSNELASS YVAGSVASVR PAVVNYALTT SDHYPTFARF TLASPLPVRL TSFAAKPVGE TVGISWTTAN ETNSAYFEVE RSVDAREFAS IGRVAAAGDA QSIKTYGLVD QHPLSGTNYY RLKQVDLDGK TAYSTIVSVV MDNLTPAMEL LGNPVDNQAI RVAVRNLPNA VYRLTTLTGR ELPVQGQTQA DGSMLLTTAQ ALSPGVYLLR ADSGTTRIMR KVVIR
|
| |