Gene Pars_0763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0763 
Symbol 
ID5055809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp681506 
End bp683803 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content57% 
IMG OID640468322 
ProductV-type ATPase, 116 kDa subunit 
Protein accessionYP_001153001 
Protein GI145590999 
COG category[C] Energy production and conversion 
COG ID[COG1269] Archaeal/vacuolar-type H+-ATPase subunit I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.27354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.871767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTTG AACGCGTTAT CGAGTTTAGA GTTGCTGGAA ACGTGGACGC CCTTCCTGAG 
CTGATCTATT TTCTCGGAAA GGCCGGTGTG GCTATGTTTG AGGAGCGGCC CGCCAAACTT
CCGAGGCCGA GGGATCCCGC GCTTTTTCAG AAGGCAAAGA AGATCGACGA GGTCCTCAAC
CAGCTCCTCC TGTATGTACA GCCTCGCCAG TTGAGCCTGC CTCTGGAGCC TCTGGAGTCT
CAGGTAGACG CAGTGCTGGA GAAGCTGGCC GCTCTGCAGA AGGAGGTGTC TTACTACTTA
AAGCTCGTCG ACGAGCTTAA GGCTAAGCTA TCTGTATCAC GCGAGGTGGC GGCTCTGAGG
GCGGCGTCGG TCCCGAAGAC GGAGGTGTTG GAGACGCTGG TGGCCTTGCC GGGAAAGTCG
GTAAAGGAGG CGGCGGAGCT TGTAAAAACT TTCAACGCAT CGGCGATGCA GTATGGCAAC
GCGTTGATTA TAGCGGTGAG CAGAGAAAAG GCGAGGCAAC TGAGGGCCGG TTTGGAGAGA
CTGGGGGCTA GGGTCTTCTC CCTCTGGGAG ATAGCCGAGC TGGAGCCTCC AGAGGCCTTG
CAGGATAGGC TGAAGAAGGC GGAGGAGGAA CTCGCCTCTC TTGTTCAGAA ACATAGCGAT
TTGATAAACT ACGCTTACAC TCTCAGGTAT GCGGTAGGCG CCGTTATGGA CGTCTACAAC
AAGTCGGCAA TAGATGAGGG GTCCGAGGTA GGCCGCCTAT TCGCCTCCTA CGAGAAGGAG
ATAGAGAAAG TTGAGAAGCA GTTGTCTGAT CTCCGCAAGA TTAAGCTCGT GTTGGACTCC
TTGGGAGGCG GCTTCAAACT CCCGGAAGGC TTCAGGATGT ACGTAGACCC CGAGACCCCC
ATCGCGGCGC CGCATGTATT GCAGGAGGTC GGCGGCGTCA AGGTGGCGCT TGTGAGGGGG
GAGGCGAGGG GGGTAGAGGT GCCTCCTGAA TACCTCGCCG ACGTGGAGGC GGGGAGGAGG
GTTGTGGAAG ACGCAATTAG GTCGGCAGAG GCCTCTCTGC AGAGATTGAG GAGGGATCTG
GAGGCCTTAG AGAGGCAGTA CTCCGAGTAC TCGCTCTACG GCGACAAGAA GTGGGAGGAG
CACAATGACA TGGCTAGCTT GGTTTTCTAC GTGTTGGAGA AGGACGTGAA GAAGGTGGAC
GAGGCGCTGT CTGAATTTGC GGCGCGTAAC ATCGCCAAGC TGGATGTGGT GAGGAGGACT
CGCTACAAGT ACTTTGACCA AGTCCCGGCA GAGAGGCGCC CCACGTTGGA GAAGTACCCC
ACGCCGATTA GGCAGTTCAC AAAGATTGTC TACATGTACG GCGTGCCTAG GCCTTACGAG
ATCAGCCCTG TGCCGCTGGC GGCGCTCCTC TTTCCCATCT TCTTCGGCTG GATGTACGGC
GACTTGGGCC ACGGCTTTTT GCTCTTCCTG CTCGGCGTGT TGCTCATGAA ACGGCTGTAC
GGCGGCCGGT ACAAGGACTG GGGCATTATA TGGGCGCTGA CTGGGGCTGT GTCGATGTTT
TTCGGCGCCT TTGTGTACCA TGAGGCGTTT GGCTTTTCCC TGGAAAAGCT TGGAATAGAA
TTGCCTACAG CACCTCTCTT TCACATGTTT GGAGAGCACC AGCTCGTCTT GGTTGAGGGC
GTAGTTGTGG CGATAGGCGC GGCGTTTGTA CTAGGCTTCT TGTTGATCTT CTTGGCGTTT
CTCTCAAAAA TCGTCAACAC TGTGCTTAAG GGAGAGGCAG ATGTGGCGCT GGGGATAGTT
CTGCCGCAGA CTCTGCTCTT TCTCTCCTTT GCCATGGTGT TCTTCTCGCT TGTGAAGGAC
GCGTTGCACC TGACATTTCT CACGCCGGTA GTGTCGTTGC CGTGGCCCTA CGTGCTTGTG
GGGTCTCTAG TCTGGAGCGG CGTAGGCACA TTTGTGCTTA GGGCCAGGTA CAAACACCAC
GAGGAGGCCC CGCCTATAAC TGAGGAGTTT ATCGTCGGCA TAGTCGAGGG GGCGCTGGGC
GCCCTTGCCA ATATCCCCAG CTTCGCTCGT CTTGTAATAC TAATCCTGAT ACACGGCGTT
TTGACAAAAC TGGTGAACGG GCTCGCCATT GCGCTGGGAC CGGCCGGGGT ATTATTCGCC
GTGTTTGGGC ACTCCCTAAT AGCCGCGGCT GAGGGCTTGT TCTCCACGGT ACAATCGCTC
CGTCTAATAT TCTACGAGGT GTTGTCGAAG TTCTACGAGG GGAGGGGCCG CCTGTTCACC
CCGCTGGCGT TGCCTTAA
 
Protein sequence
MPLERVIEFR VAGNVDALPE LIYFLGKAGV AMFEERPAKL PRPRDPALFQ KAKKIDEVLN 
QLLLYVQPRQ LSLPLEPLES QVDAVLEKLA ALQKEVSYYL KLVDELKAKL SVSREVAALR
AASVPKTEVL ETLVALPGKS VKEAAELVKT FNASAMQYGN ALIIAVSREK ARQLRAGLER
LGARVFSLWE IAELEPPEAL QDRLKKAEEE LASLVQKHSD LINYAYTLRY AVGAVMDVYN
KSAIDEGSEV GRLFASYEKE IEKVEKQLSD LRKIKLVLDS LGGGFKLPEG FRMYVDPETP
IAAPHVLQEV GGVKVALVRG EARGVEVPPE YLADVEAGRR VVEDAIRSAE ASLQRLRRDL
EALERQYSEY SLYGDKKWEE HNDMASLVFY VLEKDVKKVD EALSEFAARN IAKLDVVRRT
RYKYFDQVPA ERRPTLEKYP TPIRQFTKIV YMYGVPRPYE ISPVPLAALL FPIFFGWMYG
DLGHGFLLFL LGVLLMKRLY GGRYKDWGII WALTGAVSMF FGAFVYHEAF GFSLEKLGIE
LPTAPLFHMF GEHQLVLVEG VVVAIGAAFV LGFLLIFLAF LSKIVNTVLK GEADVALGIV
LPQTLLFLSF AMVFFSLVKD ALHLTFLTPV VSLPWPYVLV GSLVWSGVGT FVLRARYKHH
EEAPPITEEF IVGIVEGALG ALANIPSFAR LVILILIHGV LTKLVNGLAI ALGPAGVLFA
VFGHSLIAAA EGLFSTVQSL RLIFYEVLSK FYEGRGRLFT PLALP