Gene Pars_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1859 
Symbol 
ID5056008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1661857 
End bp1663776 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content56% 
IMG OID640469405 
Productputative molybdopterin biosynthesis protein MoeA/LysR substrate binding-domain-containing protein 
Protein accessionYP_001154062 
Protein GI145592060 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme
[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.693636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA GAGTTATATT CCACGACCTG GTTACGCTGG AGCAAGCTTC GGAGATTTTG 
CTAAAGTTTG CAAAGCCGCT GGGGGAGGAG GAGGTGGACA TTGTTGCGTC GTATGGCCGG
GTGCTGGCCC GTGATGTAGT TGCGCCTATT GACGTGCCGC CTTTCGACCG CTCTACCGTA
GATGGGTTTG CAGTGGTGGC CGCGTCCACA TATGGGGCTT CTGAACTTAC GCCAGTGGAG
CTTAGGCTAG TCGGCAGGGT GGAAGCCGGC GGTTGGCCTC AGGGAGAGGT GAAGGCTGGT
GAGGCCTACG AGGTGGCAAC CGGCGCGCCG ATACCCAGGG GTGCAGACTC TGTTGTAATG
GTTGAGTACA CCCAGGAGAG GGATGGTGTA GTAAGGATTT TCCGACCGGT GGCGCCTGGG
GAGAACTTAA TGAGCGCGGG GTCGGACATT TCAGCTGGGG AGGTGGTGCT GAGACGTTGC
ACAAGACTCA CGGCCAGGGA AATAGGCGTA TTGGCCGCGC TGGGCATGAG GAAGGTAAGA
GTCATAAAAA GGCCTAAGGT TGGGATAATC TCGACGGGCG ACGAGCTGAC ACCGCCGGGG
AAGCCGCTTG GCCCGGGCAA ACTGTACGAC GTAAACACTT ACACCCTAAT AGCGGCTGTT
GCAGAAGCCG GCGGAGAGCC GATTCCATAC GGTATTGTGG AAGATGTAGA AGAGAGCTAC
CGTGCCGCGA TCGCCAAGGC TCTTTCTGAA ACAGACGTGG TTCTCATAAG CGGGGGGACG
TCGGCGGGCG TCGCAGACCT CACATACAGA GTACTCGGCG AATTGGGCGA CGTGCTCTTC
CACGGCGTGA TGGTCAAGCC AGGAAAGCCC ACTCTGGCCG CAGTTGTCAA CGGGAAAATA
GTCGTAGGCC TGCCGGGGTA TCCCTCCTCT GCCTTGATGA TCTTCCACAC AATAGTAAGA
CCCTTCCTTC TAAGACTACA GTGCCTAGAA CCTATGCCCC CCGCCGTGTA TAAGGCGAGG
TTGGCGTACG GCATAGAGGG GGCAAAGGGA AGGCGTGCTT TATACCCAGT AGTCCTCATC
GCGAGGAGGT CTGAGTATAG GGCCTATCCC CTCTACGCGG AGTCGGGGGC AATATCGGTG
CTGGCGAGGG CCGACGGCTA CATAATAGTG CCGGAAAACG TCGAGTTTAT GTCAGAGGGG
GAGGAGGTGT ATGTTTACCT TTTCGAGAAG TATAAGCCCT CTGACCTCTA CTTCATCGGT
AGCCACGACC CCCACCTAGA CGCAGTGCTC GCCAGACACA ATGTCAAGAC GGTATACGTC
GGATCTTTGG GCGGCCTAAT GGCGTTAAAG AGGGGCGAGG CCGACATGGC GGGAGCACAC
ATATACGATC CCGAGACCAA CGCCTATAAC GTCCCCTACG TTAAGAAGTT GAGGATTACA
AACGTCGCCG TGGTAGGGCT ATACAAGAGG GAGCAGGGGC TAATCGTGAA GAGGGGTAAC
CCCAAGGGGA TAAGGGGGGT TGAAGACCTT TTGAGAGGCG ACGTGGTGTA TGTAAATAGA
CCAAGAGGCA CAGGTACGAG GGCCCTCCTA GACTTGCTTC TTTCCGAGCT GGCGGAGAAG
ATGGGCACCA CGCTGGAGTC GTTGGCTAAA AAAATTAGGG GCTACACCTA TGAGGTGAAG
ACACACACAG CCGTCGCTGC CGCCGTAGCC CAGGGCAGAG CCGACGTGGG CCTCGGGGTG
AGATACGCCG CCGAGCTCTA CGGGCTTGAC TTCATACCCA TAGGCTGGGA GGAGTACGAC
ATAGTCGTGA GAAAATCCGT CTTAGACAAG GCTATGGAAA TTGTAGAAGA GGCTCTTGAG
AACCTTCCGC CAGGGTACCA GCCATATGAA CACTCAAGAA AAATAAAATT CGAGAATTAG
 
Protein sequence
MSKRVIFHDL VTLEQASEIL LKFAKPLGEE EVDIVASYGR VLARDVVAPI DVPPFDRSTV 
DGFAVVAAST YGASELTPVE LRLVGRVEAG GWPQGEVKAG EAYEVATGAP IPRGADSVVM
VEYTQERDGV VRIFRPVAPG ENLMSAGSDI SAGEVVLRRC TRLTAREIGV LAALGMRKVR
VIKRPKVGII STGDELTPPG KPLGPGKLYD VNTYTLIAAV AEAGGEPIPY GIVEDVEESY
RAAIAKALSE TDVVLISGGT SAGVADLTYR VLGELGDVLF HGVMVKPGKP TLAAVVNGKI
VVGLPGYPSS ALMIFHTIVR PFLLRLQCLE PMPPAVYKAR LAYGIEGAKG RRALYPVVLI
ARRSEYRAYP LYAESGAISV LARADGYIIV PENVEFMSEG EEVYVYLFEK YKPSDLYFIG
SHDPHLDAVL ARHNVKTVYV GSLGGLMALK RGEADMAGAH IYDPETNAYN VPYVKKLRIT
NVAVVGLYKR EQGLIVKRGN PKGIRGVEDL LRGDVVYVNR PRGTGTRALL DLLLSELAEK
MGTTLESLAK KIRGYTYEVK THTAVAAAVA QGRADVGLGV RYAAELYGLD FIPIGWEEYD
IVVRKSVLDK AMEIVEEALE NLPPGYQPYE HSRKIKFEN