Gene Pars_1514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1514 
Symbol 
ID5055202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1372013 
End bp1373413 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content63% 
IMG OID640469056 
Productmethyltransferase small 
Protein accessionYP_001153722 
Protein GI145591720 
COG category[R] General function prediction only 
COG ID[COG4123] Predicted O-methyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.271049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGAGGA GGGGTTTGGT TGCCGATAGG GGGAGGGTGG CGACGCCGCC TGATTTGGCT 
TTTTACATGG TGGAGAAGCT TTTTAGGGGG GCGCCGCCGG GTGGCGGTAG CAGGGTGTTG
GATGCCGGAT GTGGCCTGGG GGTGTTCATA GACGCGGTGT TGAGGTGGTG TAGGGGGCGT
TGCGCCGAGC TTCCTGAGGT GGTGGGGGTG GAGGTGGACC CGGCGCTTGC CGAGGCGGCG
AGGCGGAGGT TTGCCGGGGA GCGGGTGAGG ATTGTGCGGG GTGACTTCTT GCTGATGTCG
GCGGGGGAGC TCGGCGGCTT GTTCGACTAT GTGATCGGCA ACCCGCCCTA CGTCTCTTAC
GAATACATCG ACCCGCCGAA GAGGGAGCTG TACAAGAGGC TGTTCACCAC GGCGGTGGGG
CGGTTTGATT TGTACATGTT GTTTTTCGAA AAGGCGCTGT CGTTGCTGAA GCCGGGGGGT
AGGCTCGTAT TCGTCACGCC GGAGAAGTAC CTCTACGTGC TGTCGGCTGT TGCGCTGAGG
AGGTTGCTGG CCAGCTACAG GGTGGAGGAG GTGGAGCTTA TCCGGGAGGA CGCCTTCGGG
GGTGTGTTGG CCTACCCGGC GATCACCGTG GTGGTGAAGG AGGCGCCTTC CTTGACGACT
ATAAGGCTTA GGGATGGGCG GGCGGCGAGG GTGGCGTTGC CGAGGGACGG CTCTCCGTGG
CTCTCCGCCA TAGCCACGGC CAAGTTAAGG ACGCCCTACA GCCTCGGCGA CTTGGTTTTG
AGGATAAGCC CGGGGGTCGC CACTGGCCGG GATGACGTTT TCGTGATCCC AAAACGCGCC
TTGTCAAAGG AGCTTGAGCC GTTTGCCTAC CCAACGGTGG GTGGGAGGGA GCTCTCCGCC
TTTGCCCCCG GCTCCGTTGT GGACTATGAC AAGTTGGCCC ACGTCATCCT CATCCCATAC
GACAGAGGCG GCCGGCTCCT GGACGAGGGG GAGGCAAAGC CGCTTTTGGA CTACTTGTCT
AGGTGGCGGC GGGTGCTGGA GTCGAGATAT GCGGTTAGGG CGGAGGGTAA GAGGTGGTAC
GCCTTTCACG AAGACCCGCC TATGGGCGAT CTGCTCCGGC CTAAGATACT CTGGAGGGAC
ATAGCTAAGG AGCCCGCCTT CTACATAGAC GCGAAGGGCC TCCTCATCCC AAAGCACACC
GTTTACTACC TAGTCCCCAA GGACCCCGGC ATGTTGCCCA GGCTGGCCGA GTACCTCAAC
AGCGCCGAGG CCAAGAGGTG GCTGATGGAG CATTGCCAGA GGGCGGCCAA CGGCTACTTG
AGGCTCCAGA CCCACGTGCT TAGGCAACTC CCAGTGCCTC CGGAGGTGGT GGGGGAGGGG
CATGGCCTTG GGAGAGTGTA G
 
Protein sequence
MGRRGLVADR GRVATPPDLA FYMVEKLFRG APPGGGSRVL DAGCGLGVFI DAVLRWCRGR 
CAELPEVVGV EVDPALAEAA RRRFAGERVR IVRGDFLLMS AGELGGLFDY VIGNPPYVSY
EYIDPPKREL YKRLFTTAVG RFDLYMLFFE KALSLLKPGG RLVFVTPEKY LYVLSAVALR
RLLASYRVEE VELIREDAFG GVLAYPAITV VVKEAPSLTT IRLRDGRAAR VALPRDGSPW
LSAIATAKLR TPYSLGDLVL RISPGVATGR DDVFVIPKRA LSKELEPFAY PTVGGRELSA
FAPGSVVDYD KLAHVILIPY DRGGRLLDEG EAKPLLDYLS RWRRVLESRY AVRAEGKRWY
AFHEDPPMGD LLRPKILWRD IAKEPAFYID AKGLLIPKHT VYYLVPKDPG MLPRLAEYLN
SAEAKRWLME HCQRAANGYL RLQTHVLRQL PVPPEVVGEG HGLGRV