Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0629 |
Symbol | hppA |
ID | 6376350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 808444 |
End bp | 810669 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642681784 |
Product | membrane-bound proton-translocating pyrophosphatase |
Protein accession | YP_001957756 |
Protein GI | 189502039 |
COG category | [C] Energy production and conversion |
COG ID | [COG3808] Inorganic pyrophosphatase |
TIGRFAM ID | [TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATAATA ATATCTTATA CTTTTTGCCC TTAGTAGGGT TGTATATTTT GATTTTTACT GTCTGGCGTT CTAAATGGAT TAAAAAGCAA GATGCAGGAG AGGAGAAGAT GGTAGAAATT GCGCAACATA TTTCAAAAGG TGCTGCATCG TTTCTTAAAG CAGCTTATAA GAGTATTATT CCATTTGCAT TGGTAACTTG CCTGCTTTTG CTGGGCATGA GCTATCTACC TCATAGCCAT ACACACCCTA TTATTGCTAT TACCTTTTTA ATAGGTGCCT TCACGTCTGC AGCTGCTGGC TGGATAGGCA TGAAAGTAGC TACTCAAGCT AATGTAAGAA CTGCTCAAGC AGCGAGGTCT AGCCTTGCTA AGGCATTTAA AATATCTTTT ACAGGTGGTA CTGTTATGGG AATGGGTGTT ACTGGATTAG CTACCTTAGG TATTGGTTTA TTATTATGTG TTCTTTTCTA TCTTTTTGCA CCACAAGATA TTAATACTAC TAGTAGAGAT ACTCTATTGT TGATTTTAGA GACCATTACT GGTTTCTCAC TAGGAGCAGA GAGTGTGGCA CTATTTGCTA GGGTAGCAGG TGGTATTTAT ACCAAAGCTG CTGACGTAGG TGCGGACCTT GTTGGAAAAA TAGAGGCTGG TATTCCTGAA GACGATCCTC GAAATCCAGC TACTATTGCT GATAATGTGG GTGATAATGT GGGTGACGTT GCAGGTATGG GGGCAGACTT ATTTGGTTCT TATGTAGCTA CTATTTTAGC AGCGATGGTG CTAGGCAATG AAGTAGATAC TCTTCAACAG GCAGGAAGGT TACTTCCTAT TTTCTTCCCA CTATTTATTG GAGTGGCAGG CACTTTAATT TCTATATTAT CTAGCTTTAT AATAAAGATT AGTGAAGGAG GCAATGTGCA AAAAGCAATT AACAAGGGAA ATTGGGTTGC TACGGTATTG ATTGGTGTTG CAGCTTACTT TTTAGCTAAA AACTTTTTGC CAGATAGCTT AAGCATGCGA TCCCAAACTT TTACTAATCT AGATGTATTT TATGCGGTTA TTACTGGTTT ATTGGTGGGT GTGCTAGTGG GCAATATCAC ACAATATTTT ACAGGTATGG GGCAAGGACC TGTAAAATTT ATTATACAAC AGTCTAGTAC TGGACATGCG ACTAATATCA TTGCTGGCCT ATATGTAGGG ATGAGCTCAG TAGCAGTCCC TATGCTACTT TTTGCTGCTG GTATTTACAC TTCTTTCAAA TTTGCAGGAT TCTATGGTGT AGCAATAGCA GCAGCTAGTA TGATGGCTAC TACTATGTTG CAACTCTCTA TTGATGCTTT TGGGCCTATT GCTGATAACG CAGGTGGTAT TGCAGAAATG AGTGGACTTC CTCAAGATGT ACGTCAACGA ACAGATGTGT TAGATACAGT AGGCAATACA ACTGCTGCCA CTGGAAAGGG ATTTGCTATT GCATCAGCTG CCTTAACGTC ATTGGCGCTT TTTGCTGCTT ATGTTAAGAC AGCCAATATA GATTCTATTG ATATTTATAA GGCACCTGTA TTAGCAGGTT TGTTTGTGGG TGGTATGGTT CCTTTCTTAT TTTCTGCATT AGCTATTAAA GCAGTTGGTA AAGCAGCTAT GGCCATGGTG CATGAAGTAA GACGACAGTT TAGGGAAATT CCGGGTATTA TGGAAGGTAC AGCAAAGCCT GAATACGAGA AATGTGTACA AATATCTACA CAAGCAGCTC TTAGAGGAAT GTTACTCCCT GGTGCTCTGG CTATAGGTAC GCCTTTAGTA GTAGGTATGT TATACGGACC AGAAGTATTA GGGGGAGTAT TAGCAGGCAT TACTGTAAGT GGTGTACTCA TGGCTATGTT CCAATCTAAT GCAGGTGGCG CTTGGGATAA TGCCAAAAAG TCTTTTGAGA AAGGTGTAGA AATTGATGGG AAAATGTATT ATAAAGGCTC AGATCCTCAT AAAGCTTCTG TAACTGGTGA TACAGTTGGC GATCCCCTTA AAGACACTTC AGGGCCTTCT ATGAATATTC TAATTAAATT GGCATCTATT GTAGCTTTGG TAATAGCTCC TATTATAGCT TTGCCTGCTT CTAACAATGC TAAAGTGTAT AAAAAGACCA AACCAGCAAC TATACAACAG TCTAATACGA TTCATGCTGA TAGCGTAGTT AAGGATAAAA AGGTAGTTAA GGATAAAAAG CATTAA
|
Protein sequence | MYNNILYFLP LVGLYILIFT VWRSKWIKKQ DAGEEKMVEI AQHISKGAAS FLKAAYKSII PFALVTCLLL LGMSYLPHSH THPIIAITFL IGAFTSAAAG WIGMKVATQA NVRTAQAARS SLAKAFKISF TGGTVMGMGV TGLATLGIGL LLCVLFYLFA PQDINTTSRD TLLLILETIT GFSLGAESVA LFARVAGGIY TKAADVGADL VGKIEAGIPE DDPRNPATIA DNVGDNVGDV AGMGADLFGS YVATILAAMV LGNEVDTLQQ AGRLLPIFFP LFIGVAGTLI SILSSFIIKI SEGGNVQKAI NKGNWVATVL IGVAAYFLAK NFLPDSLSMR SQTFTNLDVF YAVITGLLVG VLVGNITQYF TGMGQGPVKF IIQQSSTGHA TNIIAGLYVG MSSVAVPMLL FAAGIYTSFK FAGFYGVAIA AASMMATTML QLSIDAFGPI ADNAGGIAEM SGLPQDVRQR TDVLDTVGNT TAATGKGFAI ASAALTSLAL FAAYVKTANI DSIDIYKAPV LAGLFVGGMV PFLFSALAIK AVGKAAMAMV HEVRRQFREI PGIMEGTAKP EYEKCVQIST QAALRGMLLP GALAIGTPLV VGMLYGPEVL GGVLAGITVS GVLMAMFQSN AGGAWDNAKK SFEKGVEIDG KMYYKGSDPH KASVTGDTVG DPLKDTSGPS MNILIKLASI VALVIAPIIA LPASNNAKVY KKTKPATIQQ SNTIHADSVV KDKKVVKDKK H
|
| |