Gene Pars_1795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1795 
Symbol 
ID5055723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1611632 
End bp1612852 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content54% 
IMG OID640469340 
ProductMoeA domain-containing protein 
Protein accessionYP_001153998 
Protein GI145591996 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.213804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0671685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTATC TCAGCGGCCT TAATCCTGTG GCGAAGGTTG CAGAGGTGTT GCCTCTGGTG 
AAAAGAGTAG ACCAAGTAGA AAAGGTGGCC ACGTGGGACG CCGTCGGCAG AGTTGTTGCA
AAGGACGTAA CAGCTCCTCA CGACTATCCC CCGCTTCCCA GGGCTTCCTA CGACGGATAC
GCTGTTAACT CAGAGGCGAC GCCTGGCAGA TTCAGGGTAG TGGGCACAGT CCTGGTAGGT
CAGTATCGGA GAGATATCGA AGTTAAGCCT GGGGAGGCTG TGTATGTAAC AGTCGGCGCA
TTTCTCCCAG AGGGGGCCGA CGCCGTGGTG CCCGAGGAGG CTGTGGAACG TGAAGGAGAC
TTCGTGGTAG TAAAGTCAAG ATTTGAGAAG TACGCCAATG TAGACCCACC GGGGTCCTAC
GTGCGGAAGG GAACAGTCAT GGCGAGTCAA GGTACCGTGC TGACCCCCTT TGACGTGGTC
GGCCTTCTAG ATGTTGGGAT AACCGCAGTT TACGCGTACA GGAGGTTGAG AGTGGGCATA
ATCGCCACGG GGGACGAGCT TATAGTTCCG CCAATTGACC CAGAAGTGGC AACTGAGCTG
GTTTTGAAAG GGAAGGTAAT TGAGTCGACA GCATCTCTAG TGGCGTGGTA CATTGATACA
TACATGCCAT ATGTGAAGGT AGAGGAAAGG GTGGTGACAG GCGATAAGCA CGAAGAGGTG
CGCTTCTATG TAGACAAATT CCTAGAAAAT TACGATGCTG TGATAATCAC CGGCGGCGCA
GGGCCAAGTG AAATTGACCA CTTCTACAAG CTGGGGTTCA GCGGTCTGAG AGGGTTTAGA
ATGAAGCCGG GTAGGCCGAC CAGCGTTGCC GTGATCAACG GGAAGCCTGT CTTCGGCCTC
TCTGGCTATC CCATTAGCGC TCTACACGGC GTCATAAGAA TAGTAGAGCC CGTGTTGCGC
CACATGGCTA ACGTGACGAG ACCGCCTGGT AGCGGATGGG TATACGCCAC GATGGCTCAA
GACGTCCAGG GAGAGATGGC CCAGATAGTC AGAGTGAAGC TGGAGATAAG TGAGGGGGAG
TTATTAGCCA GGCCGATTAA GGCGAAACAC CACTCATTCA CAGAGCCTGA GACGTGTGGT
GTGGCGCTAA TACCGCCTGG AGGAACGAAA AAGGGCGACG TGGTGCCGGT GTTGGTATTT
CGCGACGTCA GGAAGCTCTA A
 
Protein sequence
MRYLSGLNPV AKVAEVLPLV KRVDQVEKVA TWDAVGRVVA KDVTAPHDYP PLPRASYDGY 
AVNSEATPGR FRVVGTVLVG QYRRDIEVKP GEAVYVTVGA FLPEGADAVV PEEAVEREGD
FVVVKSRFEK YANVDPPGSY VRKGTVMASQ GTVLTPFDVV GLLDVGITAV YAYRRLRVGI
IATGDELIVP PIDPEVATEL VLKGKVIEST ASLVAWYIDT YMPYVKVEER VVTGDKHEEV
RFYVDKFLEN YDAVIITGGA GPSEIDHFYK LGFSGLRGFR MKPGRPTSVA VINGKPVFGL
SGYPISALHG VIRIVEPVLR HMANVTRPPG SGWVYATMAQ DVQGEMAQIV RVKLEISEGE
LLARPIKAKH HSFTEPETCG VALIPPGGTK KGDVVPVLVF RDVRKL