Gene Apar_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0197 
Symbol 
ID8413045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp233616 
End bp236993 
Gene Length3378 bp 
Protein Length1125 aa 
Translation table11 
GC content52% 
IMG OID645021766 
Producttype III restriction protein res subunit 
Protein accessionYP_003179221 
Protein GI257784004 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000621753 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAATTTCA GCTTCCTTAA GGACATTAAA GAGTACGAGA TGTTCTCTGG CGCTTGCGAA 
GATGCAGAAC GCACCTTCGC ATCAAGTCCC GCTATGTGCG CACTTGCATG CCGTAAGGCG
ATGGAACTTG CCGTCAAATG GGTTTATGCG GCCGACGAAA CAATGAACGA GCCATGGCGT
AGCAACCTCG CCTCCCTCAT TCATGAGGCA AGCTTCAAGA ATGCCATGGA CACTACTACC
TGGGGCCAGC TGCGCTACAT CTGGAAGCTC GGCAACAACG CGGCTCACAC CCAACGGCAA
ATTGATGCAC GCGATGCCCT CGCTGCATTA GCAGGACTTT TTAACTTCGT TGAGTGGATT
GACTACTGCT ATGGGCCAAG TTACGAGGAG CGCCATTTTG ATCCCAAGGC GATTCCCGTC
CTGAAACAGG CGCTTGACGA GCAGAAACTG CGAAAGACCC TGCAGGAGAA TGCAGAAAAA
GATGCTCGCA TAGCCCAACT GGAAGAGCAG GTGAGGGCAC AGCGGGCACA GTTCAGCGAA
ACCAAGGAGC AGCGCCCCGA GGAAACGCAT TTCAATCCGG ATGAGATTCC CGAATATGAG
ACGCGCAAGC GCTATATCGA TGTTGATCTT GAATATGCAG GTTGGAACCT CGATGACAGT
GTAAAGACCG AGGCTGAAGT TAGCGGTATG CCGAACCCCA CGGGTACTGG ATTTATCGAT
TACGTACTTC TGGGCAGGAA TGGCAAGCCC TTGGCAATCG TCGAGGCCAA GCGCACCATG
TACGATCCCT TGAAAGGTGA GCAACAGGCT CGTCTCTATG CAAACTGCCT GGAAAAGAAG
TACGGGTACC GACCCTTCGT CTTCCTCTCC AATGGCTTTG AGACGCGCTT CATGGATGAC
GGAACTGCTG CACCTCGTGC CTGTTCGGGC GTCTTTGGCA GAGAAGATCT CGAGCGCCTT
ATGAACAGGC GCGGTAAGGT AAAGAATCTT TCCACCGTGC CCGTCAATCG CGACATTGCT
GGCGGCGGGC GAAACCGCTA CTACCAGGTA GAGGCCATTG AAGCCGTGTG CCAGAATATG
GAAGAAGGGC ATCGGAGAAG CCTTCTCGTT ATGGCCACGG GAACTGGAAA GACGCGAACC
GCAGCAGGTT TGGTGGACGT GCTCTCCCGT GCGGGAGCGG TGACTAACGT TCTTTTCCTT
GCAGACCGTG TTGCATTGGT TCGACAGGCA AAGAATGCCT TCCAAAACTA CCTAACCACA
ATGACCGTGT GCAACCTTTG TGAGACCGAT GACAAGGCGA AGAATGTCGA TGCACGCATT
GTATTCTCCA CGTACCCCAC CATATTGAAT GCGGTAGATG ACGTGAAGAG AGATGGGGAT
GCGCGCCTGT TTACGCCGGC TCATTTCGAC CTCATCATCG TTGACGAAGC CCATCGTTCT
ATTTTCAAAA AATACCGCGC CATTTTCGAT TACTTCGATT CACCGGTGGT TGGGCTCACG
GCAACCCCCA AAGATGAAGT CGATCGCAAT ACTTATGACT TCTTCCAAGT CGAACGCGGT
ATCCCAACGT ACCTCTACGA GTATGCGACC GCGGTTGAGA AAGACAAGGT TCTCGTCCCC
TACTACAACA TCGAGGTCCA GACGAACTTT CTCTCCAAGG GCATCACATA TGACGAGCTA
TCGGAGGAAG ATCGCCAGCG CTACGACGAT GACTGGGAGG AAGCGCAGGG AACTTCCGCA
CCTGATTACG TCGAGGCAAG CGCGCTAGAC CGCTTTGTAT TCAACGAGCA CACAATCGAT
CTCGTCCTTA CGACCCTCAT GGATGAGGGA ATCAAAATCA AGGGCGGCGA GCACATAGGA
AAGACGATCA TCTTCGCCCA GAACCGCAAG CATGCAGAGA TAATCGTCAA ACGCTTCAAC
GAGCTCTATC CGAAGTTTGG CGCACAAGGT TTTTGCAAGC GCGTTGTGCA CACGGACGAT
TACGCCCCAA CTGTTATTAC TGACTTCGAG ACGAAGGAAA TGCCCACCAT CACCGTGTCT
GTCGACATGA TGGACACGGG TATCGACGTG CCCGAAGTCG TTAACCTCGT GTTCTTCAAG
CAAGTGAAGT CGAAGGTCAA GTTCTGGCAG ATGATAGGAC GTGGAACCAG GCTTTGTGAG
GGAATACATG CCCAAGACAA AGTGTCTGGA GAATACGAAG ACAAGAAGCA CTTCTTCATT
TTCGACTGGT GCGGTAACTT CGAGTTTTTC CGCCAGGAGC AAAAGTTAGC AGAGGGTGCT
AACCCGGAAA GCATGCAGGA GAAGGTTTTC AAGCGTCAGG CTCTGCTTGC CCAAGCCCTG
CAGGGAGCAG ATTTCGCATC TGATGATTAC CAGGAGTGGC GCGTCCGAGT AGTCCAGGAA
ATGGCGGTTA AAGTCCAAAG CGTTAAGGAA CCGCTTACTG CTGCGGTCAA ATTGCACATC
CGTGAAGTTG ACAAGTTCTC GCAAGTTCCT TCATATCAAG TCCTCGAGGA CGTAGATATA
GCAGACCTCA ACAAGGTCGC CCCTCTAGTG CGTGCAGATG GTGAAGAAGA GCTCTCATTG
CGCTTCGATG CGCTCATGTA TGCCTATATG GTGGTGCTCA TAAGTGGAAC CAACACAGAA
AACCATCGCA CACGCGTGGT GGGCATTGCC GTGAGGCTGC AAAATAAGGC AAGCATTCCC
CAGGTGCGGG AGAAAATGGA TCTCCTGAAA CGCGTTACAT CCGAGGGCTT CCTTGAGAGT
GCAAGTCTAA TCACGCTTGA AGAGGTACGC GTAGAGCTGC GTGACCTTAT GAAGTTCCTC
ATCGGCGACA GCAGGCCTCT GATGGTGGAA ACGCGAGTCA CCGACTCCGT TGTTGGACGC
TGCGAGGGCG ATGCTATTAG CCCTAAAGAG GATTTCGAGG ACTATAAGCT AAAGGTAAGT
CGCTATATCG CTCAGAATGC GAACCACACG GTAATCGCCA AGATTCATCA CAACCAACAG
ATGACTTCCT TCGAGCTCAA TGAGCTGGAG CGTATTTTCA CGGTCGAACT AGGTAATGAA
GCGGACTATC GAGCCGCTTA TGGTGACACT CCATTCGGGA AGCTTGTACG GCAAGTTGCA
GGTCTTTCGC ATGAAGCCGC GATGGATGCT TTTGCGGAAT TTCTAGGAGA CGAGTCTCTT
TCGCGCCAGC AGATGGATTT CGTTCACAAG ATCGTCGACT ACGTAGAGAC CAATGGCTTT
ATGGATTTGG CCGATTTGGG CAAGCCTCCA TTTGATCAAC CGCAGAGCTT CGTAAGGCTC
TTCGACGGAA GAAGGCAACG ACGCCTCGTG CAGATTATTC AATCGGTAAA CGACAACGCT
ACCACGCCTG CCGCTTAA
 
Protein sequence
MNFSFLKDIK EYEMFSGACE DAERTFASSP AMCALACRKA MELAVKWVYA ADETMNEPWR 
SNLASLIHEA SFKNAMDTTT WGQLRYIWKL GNNAAHTQRQ IDARDALAAL AGLFNFVEWI
DYCYGPSYEE RHFDPKAIPV LKQALDEQKL RKTLQENAEK DARIAQLEEQ VRAQRAQFSE
TKEQRPEETH FNPDEIPEYE TRKRYIDVDL EYAGWNLDDS VKTEAEVSGM PNPTGTGFID
YVLLGRNGKP LAIVEAKRTM YDPLKGEQQA RLYANCLEKK YGYRPFVFLS NGFETRFMDD
GTAAPRACSG VFGREDLERL MNRRGKVKNL STVPVNRDIA GGGRNRYYQV EAIEAVCQNM
EEGHRRSLLV MATGTGKTRT AAGLVDVLSR AGAVTNVLFL ADRVALVRQA KNAFQNYLTT
MTVCNLCETD DKAKNVDARI VFSTYPTILN AVDDVKRDGD ARLFTPAHFD LIIVDEAHRS
IFKKYRAIFD YFDSPVVGLT ATPKDEVDRN TYDFFQVERG IPTYLYEYAT AVEKDKVLVP
YYNIEVQTNF LSKGITYDEL SEEDRQRYDD DWEEAQGTSA PDYVEASALD RFVFNEHTID
LVLTTLMDEG IKIKGGEHIG KTIIFAQNRK HAEIIVKRFN ELYPKFGAQG FCKRVVHTDD
YAPTVITDFE TKEMPTITVS VDMMDTGIDV PEVVNLVFFK QVKSKVKFWQ MIGRGTRLCE
GIHAQDKVSG EYEDKKHFFI FDWCGNFEFF RQEQKLAEGA NPESMQEKVF KRQALLAQAL
QGADFASDDY QEWRVRVVQE MAVKVQSVKE PLTAAVKLHI REVDKFSQVP SYQVLEDVDI
ADLNKVAPLV RADGEEELSL RFDALMYAYM VVLISGTNTE NHRTRVVGIA VRLQNKASIP
QVREKMDLLK RVTSEGFLES ASLITLEEVR VELRDLMKFL IGDSRPLMVE TRVTDSVVGR
CEGDAISPKE DFEDYKLKVS RYIAQNANHT VIAKIHHNQQ MTSFELNELE RIFTVELGNE
ADYRAAYGDT PFGKLVRQVA GLSHEAAMDA FAEFLGDESL SRQQMDFVHK IVDYVETNGF
MDLADLGKPP FDQPQSFVRL FDGRRQRRLV QIIQSVNDNA TTPAA