Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0763 |
Symbol | |
ID | 5055809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 681506 |
End bp | 683803 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640468322 |
Product | V-type ATPase, 116 kDa subunit |
Protein accession | YP_001153001 |
Protein GI | 145590999 |
COG category | [C] Energy production and conversion |
COG ID | [COG1269] Archaeal/vacuolar-type H+-ATPase subunit I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.27354 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.871767 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCTTG AACGCGTTAT CGAGTTTAGA GTTGCTGGAA ACGTGGACGC CCTTCCTGAG CTGATCTATT TTCTCGGAAA GGCCGGTGTG GCTATGTTTG AGGAGCGGCC CGCCAAACTT CCGAGGCCGA GGGATCCCGC GCTTTTTCAG AAGGCAAAGA AGATCGACGA GGTCCTCAAC CAGCTCCTCC TGTATGTACA GCCTCGCCAG TTGAGCCTGC CTCTGGAGCC TCTGGAGTCT CAGGTAGACG CAGTGCTGGA GAAGCTGGCC GCTCTGCAGA AGGAGGTGTC TTACTACTTA AAGCTCGTCG ACGAGCTTAA GGCTAAGCTA TCTGTATCAC GCGAGGTGGC GGCTCTGAGG GCGGCGTCGG TCCCGAAGAC GGAGGTGTTG GAGACGCTGG TGGCCTTGCC GGGAAAGTCG GTAAAGGAGG CGGCGGAGCT TGTAAAAACT TTCAACGCAT CGGCGATGCA GTATGGCAAC GCGTTGATTA TAGCGGTGAG CAGAGAAAAG GCGAGGCAAC TGAGGGCCGG TTTGGAGAGA CTGGGGGCTA GGGTCTTCTC CCTCTGGGAG ATAGCCGAGC TGGAGCCTCC AGAGGCCTTG CAGGATAGGC TGAAGAAGGC GGAGGAGGAA CTCGCCTCTC TTGTTCAGAA ACATAGCGAT TTGATAAACT ACGCTTACAC TCTCAGGTAT GCGGTAGGCG CCGTTATGGA CGTCTACAAC AAGTCGGCAA TAGATGAGGG GTCCGAGGTA GGCCGCCTAT TCGCCTCCTA CGAGAAGGAG ATAGAGAAAG TTGAGAAGCA GTTGTCTGAT CTCCGCAAGA TTAAGCTCGT GTTGGACTCC TTGGGAGGCG GCTTCAAACT CCCGGAAGGC TTCAGGATGT ACGTAGACCC CGAGACCCCC ATCGCGGCGC CGCATGTATT GCAGGAGGTC GGCGGCGTCA AGGTGGCGCT TGTGAGGGGG GAGGCGAGGG GGGTAGAGGT GCCTCCTGAA TACCTCGCCG ACGTGGAGGC GGGGAGGAGG GTTGTGGAAG ACGCAATTAG GTCGGCAGAG GCCTCTCTGC AGAGATTGAG GAGGGATCTG GAGGCCTTAG AGAGGCAGTA CTCCGAGTAC TCGCTCTACG GCGACAAGAA GTGGGAGGAG CACAATGACA TGGCTAGCTT GGTTTTCTAC GTGTTGGAGA AGGACGTGAA GAAGGTGGAC GAGGCGCTGT CTGAATTTGC GGCGCGTAAC ATCGCCAAGC TGGATGTGGT GAGGAGGACT CGCTACAAGT ACTTTGACCA AGTCCCGGCA GAGAGGCGCC CCACGTTGGA GAAGTACCCC ACGCCGATTA GGCAGTTCAC AAAGATTGTC TACATGTACG GCGTGCCTAG GCCTTACGAG ATCAGCCCTG TGCCGCTGGC GGCGCTCCTC TTTCCCATCT TCTTCGGCTG GATGTACGGC GACTTGGGCC ACGGCTTTTT GCTCTTCCTG CTCGGCGTGT TGCTCATGAA ACGGCTGTAC GGCGGCCGGT ACAAGGACTG GGGCATTATA TGGGCGCTGA CTGGGGCTGT GTCGATGTTT TTCGGCGCCT TTGTGTACCA TGAGGCGTTT GGCTTTTCCC TGGAAAAGCT TGGAATAGAA TTGCCTACAG CACCTCTCTT TCACATGTTT GGAGAGCACC AGCTCGTCTT GGTTGAGGGC GTAGTTGTGG CGATAGGCGC GGCGTTTGTA CTAGGCTTCT TGTTGATCTT CTTGGCGTTT CTCTCAAAAA TCGTCAACAC TGTGCTTAAG GGAGAGGCAG ATGTGGCGCT GGGGATAGTT CTGCCGCAGA CTCTGCTCTT TCTCTCCTTT GCCATGGTGT TCTTCTCGCT TGTGAAGGAC GCGTTGCACC TGACATTTCT CACGCCGGTA GTGTCGTTGC CGTGGCCCTA CGTGCTTGTG GGGTCTCTAG TCTGGAGCGG CGTAGGCACA TTTGTGCTTA GGGCCAGGTA CAAACACCAC GAGGAGGCCC CGCCTATAAC TGAGGAGTTT ATCGTCGGCA TAGTCGAGGG GGCGCTGGGC GCCCTTGCCA ATATCCCCAG CTTCGCTCGT CTTGTAATAC TAATCCTGAT ACACGGCGTT TTGACAAAAC TGGTGAACGG GCTCGCCATT GCGCTGGGAC CGGCCGGGGT ATTATTCGCC GTGTTTGGGC ACTCCCTAAT AGCCGCGGCT GAGGGCTTGT TCTCCACGGT ACAATCGCTC CGTCTAATAT TCTACGAGGT GTTGTCGAAG TTCTACGAGG GGAGGGGCCG CCTGTTCACC CCGCTGGCGT TGCCTTAA
|
Protein sequence | MPLERVIEFR VAGNVDALPE LIYFLGKAGV AMFEERPAKL PRPRDPALFQ KAKKIDEVLN QLLLYVQPRQ LSLPLEPLES QVDAVLEKLA ALQKEVSYYL KLVDELKAKL SVSREVAALR AASVPKTEVL ETLVALPGKS VKEAAELVKT FNASAMQYGN ALIIAVSREK ARQLRAGLER LGARVFSLWE IAELEPPEAL QDRLKKAEEE LASLVQKHSD LINYAYTLRY AVGAVMDVYN KSAIDEGSEV GRLFASYEKE IEKVEKQLSD LRKIKLVLDS LGGGFKLPEG FRMYVDPETP IAAPHVLQEV GGVKVALVRG EARGVEVPPE YLADVEAGRR VVEDAIRSAE ASLQRLRRDL EALERQYSEY SLYGDKKWEE HNDMASLVFY VLEKDVKKVD EALSEFAARN IAKLDVVRRT RYKYFDQVPA ERRPTLEKYP TPIRQFTKIV YMYGVPRPYE ISPVPLAALL FPIFFGWMYG DLGHGFLLFL LGVLLMKRLY GGRYKDWGII WALTGAVSMF FGAFVYHEAF GFSLEKLGIE LPTAPLFHMF GEHQLVLVEG VVVAIGAAFV LGFLLIFLAF LSKIVNTVLK GEADVALGIV LPQTLLFLSF AMVFFSLVKD ALHLTFLTPV VSLPWPYVLV GSLVWSGVGT FVLRARYKHH EEAPPITEEF IVGIVEGALG ALANIPSFAR LVILILIHGV LTKLVNGLAI ALGPAGVLFA VFGHSLIAAA EGLFSTVQSL RLIFYEVLSK FYEGRGRLFT PLALP
|
| |