Gene Apar_0198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0198 
Symbol 
ID8413046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp237003 
End bp238163 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content46% 
IMG OID645021767 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_003179222 
Protein GI257784005 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.719886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000218103 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACTAGCA AACCGCAATT TGTCGGCGAG CGAGTGAGAC TTACATCTTT TGTTAGCGCT 
GCGGGCAAAA GAAATAAGGG TGCAAAGTGC ACGGATGTAT ATAGTGTTAC GAACTCACAT
GGATTCGTAC CATCAACTGA ATACTTTTCC AAAGAGGTCT TTAGCAAGGA ACTGGAAGCA
TATCGCCTTG TCGAAAGAGG AATGCTTGCC TATAACCCGT CTCGTATTAA CGTAGGTTCA
ATTGCACTAC AAGAGTCTGC CGATAGAGTG GTTGTAAGCC CGCTCTATGT AGTCTTTTCG
GTTGATACGA GACATCTGGC GCCGGGTTAC CTGCTGCGAT TTCTCAAGAG TAAGCCTGGA
CTCAACCAGA TAGCGTTTAG GTCGTCTGGT ACCGTGCGCA GCAATCTCAA ATTCGATGCA
CTTAGCTTGC TTGAGATGCC CTTGCCTAGT ATTGATGTAC AGGAAAAAAG ACTTGTTGTC
TTATCGCGAC TTGAAAAGCA GATTGAGGCA AGAGGAGAAT TTATTGCTTC GCTCGACACC
CTCGTCAAAT CCCGGTTTAT CGAGATGTTC GGGGACCCAA TCGCTCTAAA TTCGAATAAG
AAGTCGCGCC TAGATAGCTT TGCCAAAATA ATCACAGGGA ATACCCCATC AAGAAAAAAA
CCTGAGTACT ACGGCGACTA TATTGAATGG ATTAAAACAG ACAACATCAC ATCAACCCCA
GTGCTTACAA AGGCAGCTGA ATCGCTTTCT GAGGACGGGG CATCTGCGGG CAGGATTGCT
CCATCAGGAA GCGTGCTGAT GTCATGCATT GCAGGAAGCG TTAAATCCAT AGGCAAAGTG
GCGATTGCCG ATAGACCCGT AGCATTCAAC CAACAAATAA ATGCGATTAT TCCTGCAGAT
GGAATCTTGA CTGAGTATCT TTACTGGATG CTGTCGCTTT CAAAAGACTA TCTTTGCTCT
GATATCAACA TGCAGCTCAA AGGCATCCTT AACAAAACAG CTCTATCAAG AAAAATGTTC
TGCGTGCCCC CTCCATCTCT TCAGCAAGAG TTCGCGACCT TTGTCCGGCA GGTCGACAAA
CTGCGAGTTG TAGCAGAAGA GCAGAAGAAA AAACTCCAGA CGTTGTACGA CAGTCTCGCT
CAGGAATACT TCGCGATATA G
 
Protein sequence
MTSKPQFVGE RVRLTSFVSA AGKRNKGAKC TDVYSVTNSH GFVPSTEYFS KEVFSKELEA 
YRLVERGMLA YNPSRINVGS IALQESADRV VVSPLYVVFS VDTRHLAPGY LLRFLKSKPG
LNQIAFRSSG TVRSNLKFDA LSLLEMPLPS IDVQEKRLVV LSRLEKQIEA RGEFIASLDT
LVKSRFIEMF GDPIALNSNK KSRLDSFAKI ITGNTPSRKK PEYYGDYIEW IKTDNITSTP
VLTKAAESLS EDGASAGRIA PSGSVLMSCI AGSVKSIGKV AIADRPVAFN QQINAIIPAD
GILTEYLYWM LSLSKDYLCS DINMQLKGIL NKTALSRKMF CVPPPSLQQE FATFVRQVDK
LRVVAEEQKK KLQTLYDSLA QEYFAI