Gene Apar_1225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1225 
Symbol 
ID8414104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1375039 
End bp1376418 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content47% 
IMG OID645022819 
Producthypothetical protein 
Protein accessionYP_003180243 
Protein GI257785026 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.438073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATT CTGCATCTCT CCTCTCCACA GTGCTCTTTT CAAACATTGG GCAGTGGGAT 
GTCAAGCAAT TCTTTCTGAA CATAATTCAG TCCTCGTATA CCGTGGAGGA GCTTGGCAAA
CACCTCGTCC ACCAAACCGA AAAAGTTCAA CTTTCAGATT ATCCAGAAGA CAAGTTTGTC
ATCTTGGGTG TGTCAAACAA AATTGGAATG TTTGATGCCA GCATTAAAAA AGGCAAGAAG
ATCAAGCAAA AGTACCATGT TGTCAAAGAT GGTTGGCTAG CCTACAACCC ATACCGAATT
AACGTTGGCT CTATCGGCAT CAAAACCCCA GAGCTTCAAG GCGGGTATAT CAGCCCTGCT
TATGTTGTCT TCAGTTGTAA AGACACTTTG CTTCCTGAAT ATCTCTGGCT GATGATGAAG
AGCGATTATT TCAATGCTCT CATCAACGAT TCGACTACAG GCTCGGTACG TCAAACATTA
CGCTTCGATA AGCTGGCAAG CATCAAAGCG CCGATACCAA CAGTAGATGA ACAAAAAGAA
ATACTCGCCC AATATCACGC CACCCTCGCC GAGGCGGAGA AAAACATATC CGACGGCAAT
AGCTTCAGCG ACGGCCTGCT CTTCGACATC CAGTCAAAGG TCTCCGACCT AGAAAAGGAC
GAGTCTGCTG CAGAAAAGCC TTCCTCGATC ATTCAGCCGG TTCCCTTTGC TGCAATGTCA
CGTTGGGAGG TCGCCTATAC GCTCAAGAAG GGAAAGCTGG AGCGGGTTTA CGGTAGCTTC
AAGTGTCCTT TCAAATCGAT TTCAGAACTT ACCAAGGAAT CACTCTTTGG TCTGTCTCTT
AAGGCGTCGC TCAAGCAGGA AAGCGGCATG ATACCCATAC TCCGTATGTC AAATATCGTC
AATGGCGAAA TCGACTGCAG CAGCCTTAAG TATCTGCCAT ACAAAAGCGC TGTTACTCCG
AGGGAGCCGG ACAAGTGGCT TCTTCGCAAA GGAGACTTTC TTATCAACAG AACAAACAGC
AAAGAGCTGG TTGGCAAGTC TGCGGTATTC AATCTCGACG GCGACTACAC CTATGCCTCG
TATATCATAC GCTACCGCTT CGACACATCG GTCGTCCTGC CGGAGTACGT GAATATCATG
TTCATGCTTC CTCTGGTGCG AATCCAAATA GACACCATGA GCAGGCAGAC CGCAGGGCAA
TGCAATATAA ACAGCGGCGA GATCGGCTCT ATTAGAATTC CCATCCCGTC AATTCCGGAG
CAACAAGCCA TCATCGATAA GTACTACTCT ACGAAAGACG GCGCTGACGC GTTCTACGCC
AAAGCGGAAG AGCTAAAACA AAAAACCGCT GAGGACTTCG AGAAGAGCAT ATTTGCATAA
 
Protein sequence
MNDSASLLST VLFSNIGQWD VKQFFLNIIQ SSYTVEELGK HLVHQTEKVQ LSDYPEDKFV 
ILGVSNKIGM FDASIKKGKK IKQKYHVVKD GWLAYNPYRI NVGSIGIKTP ELQGGYISPA
YVVFSCKDTL LPEYLWLMMK SDYFNALIND STTGSVRQTL RFDKLASIKA PIPTVDEQKE
ILAQYHATLA EAEKNISDGN SFSDGLLFDI QSKVSDLEKD ESAAEKPSSI IQPVPFAAMS
RWEVAYTLKK GKLERVYGSF KCPFKSISEL TKESLFGLSL KASLKQESGM IPILRMSNIV
NGEIDCSSLK YLPYKSAVTP REPDKWLLRK GDFLINRTNS KELVGKSAVF NLDGDYTYAS
YIIRYRFDTS VVLPEYVNIM FMLPLVRIQI DTMSRQTAGQ CNINSGEIGS IRIPIPSIPE
QQAIIDKYYS TKDGADAFYA KAEELKQKTA EDFEKSIFA