Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0197 |
Symbol | |
ID | 8413045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 233616 |
End bp | 236993 |
Gene Length | 3378 bp |
Protein Length | 1125 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 645021766 |
Product | type III restriction protein res subunit |
Protein accession | YP_003179221 |
Protein GI | 257784004 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000621753 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAATTTCA GCTTCCTTAA GGACATTAAA GAGTACGAGA TGTTCTCTGG CGCTTGCGAA GATGCAGAAC GCACCTTCGC ATCAAGTCCC GCTATGTGCG CACTTGCATG CCGTAAGGCG ATGGAACTTG CCGTCAAATG GGTTTATGCG GCCGACGAAA CAATGAACGA GCCATGGCGT AGCAACCTCG CCTCCCTCAT TCATGAGGCA AGCTTCAAGA ATGCCATGGA CACTACTACC TGGGGCCAGC TGCGCTACAT CTGGAAGCTC GGCAACAACG CGGCTCACAC CCAACGGCAA ATTGATGCAC GCGATGCCCT CGCTGCATTA GCAGGACTTT TTAACTTCGT TGAGTGGATT GACTACTGCT ATGGGCCAAG TTACGAGGAG CGCCATTTTG ATCCCAAGGC GATTCCCGTC CTGAAACAGG CGCTTGACGA GCAGAAACTG CGAAAGACCC TGCAGGAGAA TGCAGAAAAA GATGCTCGCA TAGCCCAACT GGAAGAGCAG GTGAGGGCAC AGCGGGCACA GTTCAGCGAA ACCAAGGAGC AGCGCCCCGA GGAAACGCAT TTCAATCCGG ATGAGATTCC CGAATATGAG ACGCGCAAGC GCTATATCGA TGTTGATCTT GAATATGCAG GTTGGAACCT CGATGACAGT GTAAAGACCG AGGCTGAAGT TAGCGGTATG CCGAACCCCA CGGGTACTGG ATTTATCGAT TACGTACTTC TGGGCAGGAA TGGCAAGCCC TTGGCAATCG TCGAGGCCAA GCGCACCATG TACGATCCCT TGAAAGGTGA GCAACAGGCT CGTCTCTATG CAAACTGCCT GGAAAAGAAG TACGGGTACC GACCCTTCGT CTTCCTCTCC AATGGCTTTG AGACGCGCTT CATGGATGAC GGAACTGCTG CACCTCGTGC CTGTTCGGGC GTCTTTGGCA GAGAAGATCT CGAGCGCCTT ATGAACAGGC GCGGTAAGGT AAAGAATCTT TCCACCGTGC CCGTCAATCG CGACATTGCT GGCGGCGGGC GAAACCGCTA CTACCAGGTA GAGGCCATTG AAGCCGTGTG CCAGAATATG GAAGAAGGGC ATCGGAGAAG CCTTCTCGTT ATGGCCACGG GAACTGGAAA GACGCGAACC GCAGCAGGTT TGGTGGACGT GCTCTCCCGT GCGGGAGCGG TGACTAACGT TCTTTTCCTT GCAGACCGTG TTGCATTGGT TCGACAGGCA AAGAATGCCT TCCAAAACTA CCTAACCACA ATGACCGTGT GCAACCTTTG TGAGACCGAT GACAAGGCGA AGAATGTCGA TGCACGCATT GTATTCTCCA CGTACCCCAC CATATTGAAT GCGGTAGATG ACGTGAAGAG AGATGGGGAT GCGCGCCTGT TTACGCCGGC TCATTTCGAC CTCATCATCG TTGACGAAGC CCATCGTTCT ATTTTCAAAA AATACCGCGC CATTTTCGAT TACTTCGATT CACCGGTGGT TGGGCTCACG GCAACCCCCA AAGATGAAGT CGATCGCAAT ACTTATGACT TCTTCCAAGT CGAACGCGGT ATCCCAACGT ACCTCTACGA GTATGCGACC GCGGTTGAGA AAGACAAGGT TCTCGTCCCC TACTACAACA TCGAGGTCCA GACGAACTTT CTCTCCAAGG GCATCACATA TGACGAGCTA TCGGAGGAAG ATCGCCAGCG CTACGACGAT GACTGGGAGG AAGCGCAGGG AACTTCCGCA CCTGATTACG TCGAGGCAAG CGCGCTAGAC CGCTTTGTAT TCAACGAGCA CACAATCGAT CTCGTCCTTA CGACCCTCAT GGATGAGGGA ATCAAAATCA AGGGCGGCGA GCACATAGGA AAGACGATCA TCTTCGCCCA GAACCGCAAG CATGCAGAGA TAATCGTCAA ACGCTTCAAC GAGCTCTATC CGAAGTTTGG CGCACAAGGT TTTTGCAAGC GCGTTGTGCA CACGGACGAT TACGCCCCAA CTGTTATTAC TGACTTCGAG ACGAAGGAAA TGCCCACCAT CACCGTGTCT GTCGACATGA TGGACACGGG TATCGACGTG CCCGAAGTCG TTAACCTCGT GTTCTTCAAG CAAGTGAAGT CGAAGGTCAA GTTCTGGCAG ATGATAGGAC GTGGAACCAG GCTTTGTGAG GGAATACATG CCCAAGACAA AGTGTCTGGA GAATACGAAG ACAAGAAGCA CTTCTTCATT TTCGACTGGT GCGGTAACTT CGAGTTTTTC CGCCAGGAGC AAAAGTTAGC AGAGGGTGCT AACCCGGAAA GCATGCAGGA GAAGGTTTTC AAGCGTCAGG CTCTGCTTGC CCAAGCCCTG CAGGGAGCAG ATTTCGCATC TGATGATTAC CAGGAGTGGC GCGTCCGAGT AGTCCAGGAA ATGGCGGTTA AAGTCCAAAG CGTTAAGGAA CCGCTTACTG CTGCGGTCAA ATTGCACATC CGTGAAGTTG ACAAGTTCTC GCAAGTTCCT TCATATCAAG TCCTCGAGGA CGTAGATATA GCAGACCTCA ACAAGGTCGC CCCTCTAGTG CGTGCAGATG GTGAAGAAGA GCTCTCATTG CGCTTCGATG CGCTCATGTA TGCCTATATG GTGGTGCTCA TAAGTGGAAC CAACACAGAA AACCATCGCA CACGCGTGGT GGGCATTGCC GTGAGGCTGC AAAATAAGGC AAGCATTCCC CAGGTGCGGG AGAAAATGGA TCTCCTGAAA CGCGTTACAT CCGAGGGCTT CCTTGAGAGT GCAAGTCTAA TCACGCTTGA AGAGGTACGC GTAGAGCTGC GTGACCTTAT GAAGTTCCTC ATCGGCGACA GCAGGCCTCT GATGGTGGAA ACGCGAGTCA CCGACTCCGT TGTTGGACGC TGCGAGGGCG ATGCTATTAG CCCTAAAGAG GATTTCGAGG ACTATAAGCT AAAGGTAAGT CGCTATATCG CTCAGAATGC GAACCACACG GTAATCGCCA AGATTCATCA CAACCAACAG ATGACTTCCT TCGAGCTCAA TGAGCTGGAG CGTATTTTCA CGGTCGAACT AGGTAATGAA GCGGACTATC GAGCCGCTTA TGGTGACACT CCATTCGGGA AGCTTGTACG GCAAGTTGCA GGTCTTTCGC ATGAAGCCGC GATGGATGCT TTTGCGGAAT TTCTAGGAGA CGAGTCTCTT TCGCGCCAGC AGATGGATTT CGTTCACAAG ATCGTCGACT ACGTAGAGAC CAATGGCTTT ATGGATTTGG CCGATTTGGG CAAGCCTCCA TTTGATCAAC CGCAGAGCTT CGTAAGGCTC TTCGACGGAA GAAGGCAACG ACGCCTCGTG CAGATTATTC AATCGGTAAA CGACAACGCT ACCACGCCTG CCGCTTAA
|
Protein sequence | MNFSFLKDIK EYEMFSGACE DAERTFASSP AMCALACRKA MELAVKWVYA ADETMNEPWR SNLASLIHEA SFKNAMDTTT WGQLRYIWKL GNNAAHTQRQ IDARDALAAL AGLFNFVEWI DYCYGPSYEE RHFDPKAIPV LKQALDEQKL RKTLQENAEK DARIAQLEEQ VRAQRAQFSE TKEQRPEETH FNPDEIPEYE TRKRYIDVDL EYAGWNLDDS VKTEAEVSGM PNPTGTGFID YVLLGRNGKP LAIVEAKRTM YDPLKGEQQA RLYANCLEKK YGYRPFVFLS NGFETRFMDD GTAAPRACSG VFGREDLERL MNRRGKVKNL STVPVNRDIA GGGRNRYYQV EAIEAVCQNM EEGHRRSLLV MATGTGKTRT AAGLVDVLSR AGAVTNVLFL ADRVALVRQA KNAFQNYLTT MTVCNLCETD DKAKNVDARI VFSTYPTILN AVDDVKRDGD ARLFTPAHFD LIIVDEAHRS IFKKYRAIFD YFDSPVVGLT ATPKDEVDRN TYDFFQVERG IPTYLYEYAT AVEKDKVLVP YYNIEVQTNF LSKGITYDEL SEEDRQRYDD DWEEAQGTSA PDYVEASALD RFVFNEHTID LVLTTLMDEG IKIKGGEHIG KTIIFAQNRK HAEIIVKRFN ELYPKFGAQG FCKRVVHTDD YAPTVITDFE TKEMPTITVS VDMMDTGIDV PEVVNLVFFK QVKSKVKFWQ MIGRGTRLCE GIHAQDKVSG EYEDKKHFFI FDWCGNFEFF RQEQKLAEGA NPESMQEKVF KRQALLAQAL QGADFASDDY QEWRVRVVQE MAVKVQSVKE PLTAAVKLHI REVDKFSQVP SYQVLEDVDI ADLNKVAPLV RADGEEELSL RFDALMYAYM VVLISGTNTE NHRTRVVGIA VRLQNKASIP QVREKMDLLK RVTSEGFLES ASLITLEEVR VELRDLMKFL IGDSRPLMVE TRVTDSVVGR CEGDAISPKE DFEDYKLKVS RYIAQNANHT VIAKIHHNQQ MTSFELNELE RIFTVELGNE ADYRAAYGDT PFGKLVRQVA GLSHEAAMDA FAEFLGDESL SRQQMDFVHK IVDYVETNGF MDLADLGKPP FDQPQSFVRL FDGRRQRRLV QIIQSVNDNA TTPAA
|
| |