Gene Pars_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1839 
Symbol 
ID5056228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1645438 
End bp1646394 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content53% 
IMG OID640469385 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_001154042 
Protein GI145592040 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.414211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.341933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATAACG TTATTGATAT ATTTTCAGGT GGCGGCGGAT TCGGGCTGGG TTTTAGACAG 
GCGGGTTTTA AAATAAGGGT GGCGCTTGAT GTGGACAGAG ACGCGGTCAG GACGTACAGC
GCCAACCACG TAAACACGGT AGTGTTGCAG AGGGACATTA GGGAGGTGAG CTACGAGGAT
TTGGTCAAAT ACGGAGAGGC TGATGTGCTA ATTGGGAGCC CTCCCTGCGA GCCGTTCACA
TCGGCAAATC CTAACAGAAT GGAAGACCCC GCCGACAGGC TTTACCTCGA TCCTGCCGGT
CAGCTGACAC TTGAATTTAT CAGGATTGTA GGCGAACTCA GACCGAAGAT CTTCGTCATG
GAGAACGTAG CGGCGTTGGC AGAGGAGCCA CTGAGGTCGT ACATTGAAAG GGAATTCAGA
AGGGTGGGCT ACGAGGTGTA CTTCAATGTA CTCCACGCAG AGGACTACGG AGTCCCCAGC
AGGAGGCGGA GGGTCTTCGT CTCAAACGTA GAAATTAGGC CACCTAAAAC ACGCATCATC
ACGGTCCGAG AGGCGTTGCG CGACCTGCCT CCCCCCGACA GCGGCCTAGT GCCTAACCAC
GACACGGTGA CGATAAGCAT GAAAAAACAG TATCAAATTG CCCGGCTGAG GCCTGGCGAG
GCTTTAATGA AATACAGAGG AGCTACCGGT TTCTATGAAA ACTACATCCG GCTACGCTGG
GACGAGGTGG CACCCACCGT AATGGGTACC CGGAGATTTG TCCACCCGGA GGAACACAGA
GTCCTCACAG TACGCGAGCA GGCTAGACTA ATGGGCTACC CAGACTCATA CACCTTCTTC
GGCTCTAAAG ACTCACAGTA TAACCAAGTT GGAGAAAGCG TGCCGCCCCC GCTGGCTTAT
GCAATTGCGC TTGAGATACG AAAATATATA GACGAGAAGG TTTATCGACG TGGCTAG
 
Protein sequence
MYNVIDIFSG GGGFGLGFRQ AGFKIRVALD VDRDAVRTYS ANHVNTVVLQ RDIREVSYED 
LVKYGEADVL IGSPPCEPFT SANPNRMEDP ADRLYLDPAG QLTLEFIRIV GELRPKIFVM
ENVAALAEEP LRSYIEREFR RVGYEVYFNV LHAEDYGVPS RRRRVFVSNV EIRPPKTRII
TVREALRDLP PPDSGLVPNH DTVTISMKKQ YQIARLRPGE ALMKYRGATG FYENYIRLRW
DEVAPTVMGT RRFVHPEEHR VLTVREQARL MGYPDSYTFF GSKDSQYNQV GESVPPPLAY
AIALEIRKYI DEKVYRRG