Gene Pars_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1858 
Symbol 
ID5056024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1660457 
End bp1661704 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content57% 
IMG OID640469404 
Productmolybdenum cofactor synthesis domain-containing protein 
Protein accessionYP_001154061 
Protein GI145592059 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.269883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.565686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGGCT TTAAGACACT GATGCCGATA GCGGAGGCAC AGAGGGCGGT CATCAGCGCC 
ATTGCCCACA AGCCCTCTGT AGTCACAGTG CCGACGCCCC AGTCGGTGGG GCTGTACGTA
GCGCAGGACA TATTTGCGCC TGTAGACGTG CCGCCATTCG ATAGGGCTGC CTTCGACGGT
TTTGCGGTGA GGTCTGTTGA CACTATCGGC GCATCAAGGA CAAATCCCAT AATGCTAAAG
GTGGTCGGCA AGTCGCTACC GGGCCTCGGC TACCGCGGCG CCATTGGGCC TGGGGAGGCG
GTGGAAATAG CAACAGGCGC GCCTCTGCCC GATGGCGCAG ATGCGGTCGT GCCTTATGAA
GAGGCGGCGC ACAGGGGGGA GTACATTGAG GTGTATAAGC CAGTACCCCA GTACTACTAC
GTCTCGCGCA GGGGAGAGGA CGTATCGGCA GGAGAAGTTG TTTTAAAGCG GGGAAGGCGG
ATTAAGCCGT GGGACGTCGG CGTATTGGCC TCCCTAGGCA TTAAAGAGGT GGCTGTTTAC
AAAGTGACGG CAGGCCTAGT ATCCACAGGA AATGAGCTCG TTGAGCTAGA AGATGCGCCT
CCGCCCCCCG GCAAGATTAT AAACAGCACA CGACATATAA TAACGGCGCT TCTACTTGAA
CTTGGAGTAA AGACGACCTA CCTAGGGATA GTCCCCGACG ACGTTGATGC AATACACGGC
GTTTTGAAAG AGGCACTAGC CAAGTTCGAT ATCGTGATAA CAACTGGCGG CGTCTCTGTC
GGCGAGCCCG ACCACGTAGT GGAGGCGGTA AGGCGCCTTA AGCCGGAGGT GCTGGTCCAC
GGCATCGCCG CTAGGCCTGG GAGACCTAAT AGCGCAGCGG TGGTGGGGGG AAAGCCGGTG
ATTATGCTCT CGGGCTTCCC AGTCGCCTCT ATTGTCGGCT TTGAGGTATT CGTCAAGCCG
GTCATTCTCC ACATGGTCGG CGCCAGAGAG GAGCCTCTGC CCGTGGCCGT GGCCACTTTG
ACGAGGAGAG TCACCACACC AATTAACGTG AGGAGTTTAG TGAGGGTCAG GGTCTTCCGC
CAAGGCAGAG AGCTATACGC AGAGCCGCTT GCCGTCACGG GGAGCGGCGT TTTGTCAACG
CTGACAAGGG GCAACGGCCT TTTGATCATA CCGGAAAACA GAGAAGGCTA CGACGAGGGT
GACAAGGTTG AGATCGTACT GCTCGGCCCC ATAGAAGAGG AAAAATAA
 
Protein sequence
MKGFKTLMPI AEAQRAVISA IAHKPSVVTV PTPQSVGLYV AQDIFAPVDV PPFDRAAFDG 
FAVRSVDTIG ASRTNPIMLK VVGKSLPGLG YRGAIGPGEA VEIATGAPLP DGADAVVPYE
EAAHRGEYIE VYKPVPQYYY VSRRGEDVSA GEVVLKRGRR IKPWDVGVLA SLGIKEVAVY
KVTAGLVSTG NELVELEDAP PPPGKIINST RHIITALLLE LGVKTTYLGI VPDDVDAIHG
VLKEALAKFD IVITTGGVSV GEPDHVVEAV RRLKPEVLVH GIAARPGRPN SAAVVGGKPV
IMLSGFPVAS IVGFEVFVKP VILHMVGARE EPLPVAVATL TRRVTTPINV RSLVRVRVFR
QGRELYAEPL AVTGSGVLST LTRGNGLLII PENREGYDEG DKVEIVLLGP IEEEK