Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_2205 |
Symbol | |
ID | 8597670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | + |
Start bp | 2342796 |
End bp | 2345777 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003308990 |
Protein GI | 269120813 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTAT TAGCAAAAAA AGGAGAAAAA ACTGGAATCA CAGTACCTTA TGATCAATAT TCCAGTATGC TAAAGTACGA AATTCTAGCA TTTTTCTCCC AAGTTCTTGA TGAAAAAATT TCTTTGAATG ATTATGATAA TGCTGTAAAT ATGACTGAAT CAGTTTTAAA TACAAAGTTT GAACTGCCTT TGCAGTTTAA CCGGAGTAAT TGTAATAATA GCAATTTTCT TATTGTTAAT CAAAAAACAA AATTTAAAAA CTTTTTTACA TATTTGAAGG AGGAGCTGAA CAGCTGTGAT TCTTTTTGTT TTATCGTGAG TTTTATAAAA TTTTCAGGTA TACAGCTTCT TATAAATACA CTTGACGAAT TAAAAAATAG AGGAATAAAA GGGAAAATAG TCACTTCGGT ATATTTGAAC ATAACAGATC CAAAAGCTCT GAGAAAGCTT GCAGAATATG ATAATTTGGA AATAAAGATT TATAACAATA CACGGGAGAG TTTTCATACA AAAGCTTATT TATTTCACAG AAAAGAATAC AGCTCATGTA TTATAGGCTC TTCAAATCTA AGTCAGTCTG CTTTATATTC CGGTGAGGAA TGGAATGTAC GACTGGTAAA AGATAACTAT CTGGAAATTT TTGATCAGTC GTATGAGCAG TTTGAAAAAA TATGGGATAG TAATGAAGCA ATAGAATTGA ATTCAAAATT TATTGATATG TATGAAAATT TCAGAAACAA AAGCGGAAAT ATTGAAACAT TCGATTTTAA AAAAGAAGAA ACTGAAAAAG AAATATTTAT GCCTAATAAA ATGCAGTCAG AGCTTCTTGA GAAATTAAAA CTCACAAGAG AATTCGGAAA TAAAAAAGGA TTGATTGTAG CTGCTACGGG GACTGGTAAA ACTTATCTCG CTGCTATGGA TATTCTAAAA ATGAATCCCA AGTCATTTTT ATTTATAGCA CACAGGGAAG AGCTTATAAA TAATGCTTTT AATGTTTTCT CGAAAATTCT TCCTTATGAT AAAAATGAAT ATGGCTTTCT TACAGGCAGT GAAAAAAATT ATAATAAAAG ATTTATGTTT TCTACAATTC AATCTTTGTA TAAAAATACG GAATATTTTT CAAAAGATGC TTTCGATTAC ATTATTATAG ATGAGTTTCA TCATTCAAAA GCTTCCACTT ATGAAGCTGT AATCAATTAT TTTAATCCTT TTTTTATGTT AGGGCTTACT GCAACACCTG AAAGAATGGA CGGGAAAGAT ATTTTGGAAC TGTGTGATTA TAATCTGATT GGAGAGATGG GTCTTCGTGA AGCTCTTGAA TATGACTTAC TCTCTCCTTT TCATTATTTT GGAGTAATTG ACGATACTGT GGATTATGAG CAGATCCCTT TTAGTAACGG AAAATATGAT GATAAAATAC TTGGTGAAAA ATTAAGCATA CCAAAAAGGG TAGACTTTAT TCATAATAAA ATAGAGAAAA TAGGTTTTGA CGGAGACAGG GTAAAATCTA TTGCTTTTTG TGCGAATATA AAACACGCAG AGTTCATGAA AGAGCAGTTC AGACTGAAAG GTTATACGAC AGAATCAATC ACTGCGAAAG ACAGTCAGCT GAAAAGAAAA GAAGTAATAA ATGCTTTTCA AACAGGGGAA ATCGAAATTT TGTGTGTTGT GGATATTTTT AACGAGGGTA TTGATATACC TGATGTTAAT CTTTTATTAT TTCTAAGGCC TACTATGTCA TCTACAATAT TTATACAGCA GCTGGGACGC GGTCTCAGAA AAGTTAAGAA TAAAGATTTT GTAACTATTT TGGATTTTAT AGGGAATCAT AAAAAGGACT ACATAATTAC TCAGGCTTTT TCAGAGAATA CTCTTAATGA AAAAGACAGA CTCCTGAATG AAGTCAGAAA TCAATTTTCT GATATTCCCG GAGCATCTTA TATTGAGCTT GACAGAATAT GCCAGGATAG AATAATATCT AAGATAGAGA ATTATAATAG TTTGAGCAGA GACAATATTG TTTCTGAATA TCTGGATTTT AAAAATGAAA TTGGAAGAGA AATAGACATT ATTGATTTTA AAGATAATAC AGAATTATTC CTAAGACTGA AAAATAAGTT CGGCTCTTTT GTTAAAACAC AAAAATTAAT CGAAAAACTG GATTATTATT TTAGTGATGA AGAGGAGAAA ATCTTTGAAA TTTTAGAAAA AAATCTTAGT ATAAATTATC CTTATGAAAC ACTTATTATC TGGCTTTTGC ACAGTCTTGA CAGAGTTTCT GTCAGTGATG TCATAGAAAA ATTTGAATCT GTATTTTTTG TCAAAATAAA TGAAAGAATA CAGTATAATC TTATTATAAG AGCAATGAAA GAGCTAAGTG AAAATGATTT ATTTATTTTT AATCCGGAAA CTAACATAAT TTCTTTAAAA AATATTGATT TTACTGCATT TTATAAAAAA CGACTGATTG GTCTAATTGA GCTTTTTATT TTGAAATTTA AAAAAGAAAT TGATATAAAT GAATTCAATA ATAACATCCT AGTAAAATAC AAAGAGTATT CCAGAGTAGA ACTGCAGATA CTGCTTGATT CTAATGCACA AAAAGGATCA TGGCGTGCCG GTTACTCAGT GTCTAGAGAA CATGTATGTC TGTTTATTAC ACTAAATAAA TCTCTTGTAC AGAAGGAAGA GCTGAAGTAT GATAATTATT TTCACAGACA GGACATTGTT CAGTGGATAA GTCAGAGCAA AACAAGGCAT GATTCTAAAA TAGGACAGAT GTATGTAAAA CATAAAGAGA TGAATATGAA AGTTCATATT TTTATACGAA AGGAACCTGT TTTGGAAAAC GGGACAGCGG CACCTTTTAC ATATCTGGGA GAGTCTGAAT ATTTCAGCAG TCATGGTGAC AAGCCTATGT ATATGTTATG GAAGCTTCAT TATCCCGTTC CAAACGAGCT GTTTATTGAT TTTACAGTTT GA
|
Protein sequence | MSVLAKKGEK TGITVPYDQY SSMLKYEILA FFSQVLDEKI SLNDYDNAVN MTESVLNTKF ELPLQFNRSN CNNSNFLIVN QKTKFKNFFT YLKEELNSCD SFCFIVSFIK FSGIQLLINT LDELKNRGIK GKIVTSVYLN ITDPKALRKL AEYDNLEIKI YNNTRESFHT KAYLFHRKEY SSCIIGSSNL SQSALYSGEE WNVRLVKDNY LEIFDQSYEQ FEKIWDSNEA IELNSKFIDM YENFRNKSGN IETFDFKKEE TEKEIFMPNK MQSELLEKLK LTREFGNKKG LIVAATGTGK TYLAAMDILK MNPKSFLFIA HREELINNAF NVFSKILPYD KNEYGFLTGS EKNYNKRFMF STIQSLYKNT EYFSKDAFDY IIIDEFHHSK ASTYEAVINY FNPFFMLGLT ATPERMDGKD ILELCDYNLI GEMGLREALE YDLLSPFHYF GVIDDTVDYE QIPFSNGKYD DKILGEKLSI PKRVDFIHNK IEKIGFDGDR VKSIAFCANI KHAEFMKEQF RLKGYTTESI TAKDSQLKRK EVINAFQTGE IEILCVVDIF NEGIDIPDVN LLLFLRPTMS STIFIQQLGR GLRKVKNKDF VTILDFIGNH KKDYIITQAF SENTLNEKDR LLNEVRNQFS DIPGASYIEL DRICQDRIIS KIENYNSLSR DNIVSEYLDF KNEIGREIDI IDFKDNTELF LRLKNKFGSF VKTQKLIEKL DYYFSDEEEK IFEILEKNLS INYPYETLII WLLHSLDRVS VSDVIEKFES VFFVKINERI QYNLIIRAMK ELSENDLFIF NPETNIISLK NIDFTAFYKK RLIGLIELFI LKFKKEIDIN EFNNNILVKY KEYSRVELQI LLDSNAQKGS WRAGYSVSRE HVCLFITLNK SLVQKEELKY DNYFHRQDIV QWISQSKTRH DSKIGQMYVK HKEMNMKVHI FIRKEPVLEN GTAAPFTYLG ESEYFSSHGD KPMYMLWKLH YPVPNELFID FTV
|
| |