Gene Pars_1108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1108 
Symbol 
ID5055970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp994887 
End bp996491 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content65% 
IMG OID640468664 
ProductAAA ATPase 
Protein accessionYP_001153338 
Protein GI145591336 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00107728 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATGTG GGTTCGTATC GCGGCAGTTC CCGGCTGAGG TTTCTTTCGA CGAGGCTAAG 
ACCTACGCCG TCTTGACTGG TTGCGGCTTG GAGGTGGGGT CCTACGTGGT GGTTGAGGGG
GGTGGGAGGA GGTACCTGGC CTCGGTGTCT AAGCTCCAGG TGGCGGACAT ATACGCCGTG
GCGAAGACGC CGGTGCTGAC GCCGGAGCAG GAGAAGGCGG TTTCCCTGAG GCTGGGCCCC
TTGATTGCCG AGCTGGAGAT CCTCTCGGAG TGTTTCGGCA ACTCGTGCGG CCCGCCCGCG
ACGCCGGTGC CCATCCACTC CTCTGTGAGG CCTCCTGCCC CTGGGGAGGT GGGGGAGATG
CTGGGCCTGC CGCGGGAGGG GGTTGTCCTC GGCGCGTTGG CCCTGCCCTC GGGTCGGGCG
GTGGAGGGGG AGGTGGTTAG GCTCCCGCTG GAGGCGCTTA GGCACCACGT CTTGGTGGTG
GGGACTACTG GGAGTGGGAA GACGGTGTTG GTGAAGGAGA TTGCAAAGCA GCTGGCGGGT
CAGCAGGTGG TTGCCCTAGA CGCCGTGGGC CACTTCTACC ACTTGGCGTA CAGCGGGGTG
AGGGTGAGGG TGGTGCTCCC CGTCACTAGG CAGATGGCGC GTCGCGGGGC GAGGGCGGTG
GTGAGGAGGG TGGTGAAGAC GGCGGCGTGG GGGAGCAGGG CCCGCTTCAA GGCGAGGGTT
AGGTTCAAGA AGCAACGGGC TACTGGGGAG GTGTACCTGG CGGGGGCGGT GGTGGAGGTT
GAGTCGCCCC GCGGCCGGGG GGTTTTCGAG GTGGTTCCCT GGGCGCTGGA GAGCAGGCGT
ATTTTGAGGG ACCTCCCCCG GGCTATCCCC ATCCTCTCGC AACAGGCTAG GATTTTCTAC
CAGAGGGTTT TGAAGAGGGC GGTGGAGCTC AGTGGGCAGA AGACGGCGGA GGGGCTTTAC
GAGTTTTTGA CCTCGCCGGC TGAGGGGGAG AGGAGGCCGG TGGTGATGTA CGAGAAGCTG
GGCATGGAGC TGGGGCTTCA CTCCAGCACT ATGGAGAACA TCGTGAGGGC TCTCCTGGCG
CTGGTGGAGA CGGGGCTTGT GGACGTAGTG GGGGGCGGGT TCAGGGTGGT GGAGCCGCGG
TACGACTTCT CGGGGTACAC GGTGGTGGAC ATATCTCGCC TCAACGTGCA CCAGCAGAGG
CTTGTGGTGT ACCGCATCTT AGACGCGGTG TATAGGAGGG CGCGGCCGAC CACCGCCGTC
CTCATCGACG AGGCCCACCT CTTCTTCCCC CAGACCCGGA GCGAGGACGA GCAGGCCTTT
ATAGAGGGGC ACTTGACGCG GCTTACGAGG CTTGGGAGGG CGAGGGGCAT CGCAGTGGTC
TTCGCCACCC ACATGCCCAC CGACCTAAAC GACGTGGTGG TGCAGCTGGC TAACACCAAG
GTTATCCTCC GTAGCGACCT CAAGATTTTG GACAGGCTGG ACGTCCCCGC GAAGGACAGG
CGGTTCCTCG CCGTGGCGGA CAAGGGCATC GCCTACGTCA GGAGCTACGC CTATAGGCAC
CCTATATACG TCAAGGTGCA GAAGACCGTT GCCCACTTCG GCTGA
 
Protein sequence
MTCGFVSRQF PAEVSFDEAK TYAVLTGCGL EVGSYVVVEG GGRRYLASVS KLQVADIYAV 
AKTPVLTPEQ EKAVSLRLGP LIAELEILSE CFGNSCGPPA TPVPIHSSVR PPAPGEVGEM
LGLPREGVVL GALALPSGRA VEGEVVRLPL EALRHHVLVV GTTGSGKTVL VKEIAKQLAG
QQVVALDAVG HFYHLAYSGV RVRVVLPVTR QMARRGARAV VRRVVKTAAW GSRARFKARV
RFKKQRATGE VYLAGAVVEV ESPRGRGVFE VVPWALESRR ILRDLPRAIP ILSQQARIFY
QRVLKRAVEL SGQKTAEGLY EFLTSPAEGE RRPVVMYEKL GMELGLHSST MENIVRALLA
LVETGLVDVV GGGFRVVEPR YDFSGYTVVD ISRLNVHQQR LVVYRILDAV YRRARPTTAV
LIDEAHLFFP QTRSEDEQAF IEGHLTRLTR LGRARGIAVV FATHMPTDLN DVVVQLANTK
VILRSDLKIL DRLDVPAKDR RFLAVADKGI AYVRSYAYRH PIYVKVQKTV AHFG