Gene Apar_1213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1213 
Symbol 
ID8414091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1358518 
End bp1359573 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content58% 
IMG OID645022807 
Product5-methylcytosine-specific restriction enzyme subunit McrC 
Protein accessionYP_003180232 
Protein GI257785015 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.816052 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTCGGA TACAGAACAT CTACCACATG CTCGCCTACG CGTTCCAGAC GCTGCAGGGG 
CAGGGCTACC GCGACATAGC CGCCGAGGAG TTTGGAAACA CCACCGAGCT CCTCGCTGAG
ATACTGGCGC GGGGTGTGAG CTTGCAGCTA AAGCGAGGCC TCGGTCAAGA GTATATCGAC
CGCGAGGAGG CGCTCTCCTC CCCGAGGGGA AAGATAGAGC TGTCCGAGTC TCTGAAGACA
CGCTCGATCC TGCGCAGGCA GCTGGTCTGC AGCTACGACG AGTTCAGCAC GGACACGCGC
ATGAACCGCA TCCTCAAGGC GACGATTGCG CTCCTGGTCC GCTCGGACAT CGACAAGGTA
CGCAAGAAGG CGCTCAGGCG GCTGCTACCG TACTTCGTGG ACGTGGGCGA CGTAGACCTT
GAACATGAGG ACTGGCACAT GCGCTTCGAC CGGAACAATC AGGCCTACCG CATGCTCATG
AATGTGTGCT GGCTGGTCGT GAAGGGCCTC CTCCAGACGC AGGAAGACGG AAGCATCCGC
ATGATGGACC TCCTCGACGA GCAGCGCATG AGCCACCTGT ACGAGAAGTT CATCCTCGAG
TACTACAGGC GCGAGCACCC GAAACTCTCC GCAGGGGCTC CATACATCGA TTGGGCTCTC
GACGACGGCT TCGATGACAT GCTCCCCGCC ATGCACACTG ACATAATGCT CGAGCAGGGC
AGGACTGTCC TCATCATCGA CGCGAAGTAC TACAGCCGCA CAATGCAACA GCAGTTTGAC
AAGCGAAGCG TCCATTCGAG TAACTTGTAC CAGATCTTCA CCTACGTGAA GAACAAGGAA
GTGGAGCTTT CCAGTACCCT CAAAGCCCAC AGTGTATCGG GCATGCTGCT CTACGCAAAG
ACCGACGAAG AAATCCAGCC TGATGGCGTG TACCAGATGA GCGGCAACCA GATAAGCGTG
AGGACGCTCG ATCTCAACCA GCCTTTCGAG GAGATACGCT CGCAGCTCGA TGGAATTGCC
AAGGCACATT TCTCAAAGGA GGCAGCCTGT GTTTGA
 
Protein sequence
MIRIQNIYHM LAYAFQTLQG QGYRDIAAEE FGNTTELLAE ILARGVSLQL KRGLGQEYID 
REEALSSPRG KIELSESLKT RSILRRQLVC SYDEFSTDTR MNRILKATIA LLVRSDIDKV
RKKALRRLLP YFVDVGDVDL EHEDWHMRFD RNNQAYRMLM NVCWLVVKGL LQTQEDGSIR
MMDLLDEQRM SHLYEKFILE YYRREHPKLS AGAPYIDWAL DDGFDDMLPA MHTDIMLEQG
RTVLIIDAKY YSRTMQQQFD KRSVHSSNLY QIFTYVKNKE VELSSTLKAH SVSGMLLYAK
TDEEIQPDGV YQMSGNQISV RTLDLNQPFE EIRSQLDGIA KAHFSKEAAC V