Gene Apar_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0072 
Symbol 
ID8412915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp81984 
End bp83303 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content50% 
IMG OID645021639 
ProductSAM-dependent methyltransferase 
Protein accessionYP_003179099 
Protein GI257783882 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAC TTCGCCCTTA TCCTCAGGTT ATTGTCACCA AGAAAGCAGC TCGCGCACTT 
GCGGGCGGGC ATCCTTGGGT CTTTGAGGGT GAGGTTATCC GTGTAGAGCC GTCTCCAGAA
AGTGGCGAGC ATGCACAAAA TGGTTGCATT GTTGATGTCT TTGAGGAGAA CGGCACCTAC
CAGGGAACCG GTCTTCTGTC AGAAATGAGC AAGATTCGCG TCAGAATTGT GACAAGAAAC
GCCAACGATA GACTCAACGA AGCGTTTTGG AGCCGAAAAA TTAGCTGGGC GTGGAAGCAC
CGAAAAACCA TTATGGGAAA CCGCGCTCTA CCAGGTACTG AGTCCGATAC CAATTGCTGC
CGCGTTATCT TCTCTGAGGC CGATGGTTTT CCTGGCTTAA TCGTTGACCG CTACGAGAAT
GTACTTGTTG CGCAGGTAGG AACCGTTGGC ATGGAGCGTC TGCGCAACGT TATCTATCCT
CTGCTGCTTG AGGTTTTATC TTCAGACGGC CAGGTAATTG ATGGCATCTA CGAAAGAAAT
GACTCTCCTT CACGCCTTAA AGAGGGACTT CCGCAATATA AGGGCTGGTG GACAGGTACC
TCTCCTGACG AGAATGGCTT GATTGCAGAA AGTTCCCTTA CCTCGCCTTC AACTCACGTT
CTTGCAACCG AAAATGGACT CAAATTCAAC CTTGACCTAG AGAACAGCCA GAAAACCGGC
TTTTTCTTAG ATCAAAAATA CAACCGCCGA GCAGTTCGTC AACTTGCGCA AGGTCATCGT
GTACTGGATT GCTTCTGTCA CGTTGGTCCG TTTGGTCTAA ACGCTACAGC AGGAGGAGCA
GATTTTGTCC GCTGCGTAGA TGTCAGCCAA ACCGCCATTG ACCTTGCACG CCAAAATGCG
GAGCTCAACG GCTTGGCAGA CCGCATAGAC TTTACCTGTG CAAACGTTCT TGAATACCTG
CCAGAACTGG CTCGCGACCG TACCCAGCTC AAAGCTGAGG GAGGTCCATT TGACCTTATC
ATTTTGGATC CGCCTGCATT CACCAAAACG CGCGACAAAG TCCGCAGCGC CATGCGTGGC
TATAAAGAAA TCAATGCAAC AGCTATGAAG CTGCTACCTC GAGGCGGCTA CCTAGCAACT
TGTTCATGCT CTCACTTCAT GACTAGAGAT CTTCTCGCCC AGGCAATTGC CGAGGCAGCA
CACCATACCA ACGTGCAGCT CAAACAAATT GAAGAGCGAC AGCAAGCTCC TGACCACCCA
ATTCTTTGGG GTGTTCCTGA GAGTCACTAT CTTGATTTCT TCATTTTCCA GGTGATTTAA
 
Protein sequence
MKQLRPYPQV IVTKKAARAL AGGHPWVFEG EVIRVEPSPE SGEHAQNGCI VDVFEENGTY 
QGTGLLSEMS KIRVRIVTRN ANDRLNEAFW SRKISWAWKH RKTIMGNRAL PGTESDTNCC
RVIFSEADGF PGLIVDRYEN VLVAQVGTVG MERLRNVIYP LLLEVLSSDG QVIDGIYERN
DSPSRLKEGL PQYKGWWTGT SPDENGLIAE SSLTSPSTHV LATENGLKFN LDLENSQKTG
FFLDQKYNRR AVRQLAQGHR VLDCFCHVGP FGLNATAGGA DFVRCVDVSQ TAIDLARQNA
ELNGLADRID FTCANVLEYL PELARDRTQL KAEGGPFDLI ILDPPAFTKT RDKVRSAMRG
YKEINATAMK LLPRGGYLAT CSCSHFMTRD LLAQAIAEAA HHTNVQLKQI EERQQAPDHP
ILWGVPESHY LDFFIFQVI