Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2013 |
Symbol | hppA |
ID | 6315868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2122042 |
End bp | 2124003 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 642644401 |
Product | membrane-bound proton-translocating pyrophosphatase |
Protein accession | YP_001918168 |
Protein GI | 188586623 |
COG category | [C] Energy production and conversion |
COG ID | [COG3808] Inorganic pyrophosphatase |
TIGRFAM ID | [TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000446892 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAG TAGCAGCACC AATTGCAGGA ATTATAGGGT TAGTATTTGC TTTTTATTTG ATCAATCAAG TGAACAAGCG TGAAGAAGGC TCTGAAAGGA TGCAGGAACT AGCTGCAGCC ATTCAAGAGG GTGCCAGTGC ATTCCTGCGT AGGGAATACC GGAGTTTGGC AGTTTTTGTG ATAGTATTGT TTGTTGTGAT AACAATTGCT ATTGATATAC AAACTGCCAT TTCCTTCTTA GTAGGTGCCC TGTTTTCAGG TACAGCTGGT TTTATCGGTA TGACTGTGGC TACTCGAGCT AATGTGAGAA CAGCAAATGA CGCACAGAAT GAAGGGATTG CCCAAGCCCT AAGGACAGCC TTTTCAGGCG GGTCTGTCAT GGGTATGTCT GTTGTAGGCT TAGGTATTTT GGGTGTAGGA ATCTTGTATT TGGTTTTTGA AGATCCTGAA ATTGTTAATG GTTTTGCCTT AGGTGCTAGT TCCATTGCAT TATTCGCGCG TGTGGGCGGA GGTATATATA CTAAAGCCGC CGATGTTGGA GCTGACCTTG TTGGTAAGGT AGAAGCTGGC ATTCCCGAAG ATGATCCCAG GAACCCTGCG GCAATTGCCG ACAATGTTGG TGATAATGTA GGGGACGTTG CAGGAATGGG TGCAGATCTA TTTGAATCCT TTGTAGGATC TATTATAGCA GCTATGACCA TTGGCCTTGT CCAATATGGT GTTGAAGGGG TTCTATTACC CATACTACTA GGATCAGTTG GAGTAATTGC TTCAATTATC GGCTATTTCT TTGTTAGAGT CCGAGAAGGA CAGAAACTAG CCTCTGCCTT AGAAAGAGGG ACCATAGTTA GTGCTATTAT AGTAGTGATT GCTGCCTTTA TTTTAACTAA CAATCTATTG GGAGAGTTAG GACCTTTTTA CGCCATTTTA GCTGGCTTAC TTGCTGGAAT TTTAATTGGT AGAATTACTG AATATTACAC CTCAGAACAT TATGCACCAG TAAAGGGAAT TGCAGATTCT TCAAGAACAG GAGCAGCTAC TAACATTATC AGTGGTATAG CTGTTGGAAT GAAAAGTACT TTCTTACCCA TTATCGTTAT AACAGTAGCT ATATTTATAG CACATCAAGT TGCGGGCTTA TACGGAATTG CTATAGCTGC TGTTGGAATG CTAGCAACAG TTGGTATGAC AATTGCTGTT GATGCTTACG GTCCTGTAGC TGATAACGCA GGTGGTATTG CAGAAATGGC TGATTTAGAT CCTGAAGTAC GAGAAATCAC TGACGAGCTG GATGCAGTAG GGAACACTAC TGCAGCAATT GGAAAAGGAT TTGCCATCGG ATCAGCAGCC CTAACAGCTC TAGCATTATT TAGTGCATAT ACTCAAGCAG CAGATATTGA TAATATTGAC TTGACAAGTG CACCAGTAAT TATTGGTCTA TTGCTAGGCG GAATGTTACC ATTCCTTTTC TCTGCTCTAA CTATGAATGC CGTTGGCCAA GCTGCTAACC AGATGATAGA CGAGGTTAGA AGACAAATTA AAGAAAAGCC GGGCATAATG GATGAAAAAG AAAAACCAGA CTATGCAACT TGTGTTGATA TTAGTACTGC AGCAGCTCTT AAGCAAATGG TATTACCAGG CTTACTTGCA GTTGTAGTAC CATTATTGGT TGGATTATTG CCAGGATTAG GTAAAGAAGC CCTAGGAGGT CTCCTAGCTG GAGCCTTGGC TTCTGGTGTT ATGATGGCTA TCTTTATGGC AAACTCTGGT GGAGCTTGGG ATAATGCTAA AAAGTACATT GAAGCTGGAA ATCATGGTGG AAAAGGTACT GAGACTCATG CAGCTTCAGT AGTAGGAGAT ACAGTTGGGG ATCCCTTTAA AGACACTTCA GGCCCCAGTA TCAATATTTT AATTAAGCTA ATGACAATTG TTTCCTTAGT TTTTGCTCCC CTGTTTTTCT AA
|
Protein sequence | MSAVAAPIAG IIGLVFAFYL INQVNKREEG SERMQELAAA IQEGASAFLR REYRSLAVFV IVLFVVITIA IDIQTAISFL VGALFSGTAG FIGMTVATRA NVRTANDAQN EGIAQALRTA FSGGSVMGMS VVGLGILGVG ILYLVFEDPE IVNGFALGAS SIALFARVGG GIYTKAADVG ADLVGKVEAG IPEDDPRNPA AIADNVGDNV GDVAGMGADL FESFVGSIIA AMTIGLVQYG VEGVLLPILL GSVGVIASII GYFFVRVREG QKLASALERG TIVSAIIVVI AAFILTNNLL GELGPFYAIL AGLLAGILIG RITEYYTSEH YAPVKGIADS SRTGAATNII SGIAVGMKST FLPIIVITVA IFIAHQVAGL YGIAIAAVGM LATVGMTIAV DAYGPVADNA GGIAEMADLD PEVREITDEL DAVGNTTAAI GKGFAIGSAA LTALALFSAY TQAADIDNID LTSAPVIIGL LLGGMLPFLF SALTMNAVGQ AANQMIDEVR RQIKEKPGIM DEKEKPDYAT CVDISTAAAL KQMVLPGLLA VVVPLLVGLL PGLGKEALGG LLAGALASGV MMAIFMANSG GAWDNAKKYI EAGNHGGKGT ETHAASVVGD TVGDPFKDTS GPSINILIKL MTIVSLVFAP LFF
|
| |