Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0337 |
Symbol | |
ID | 4601704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 306683 |
End bp | 309514 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639773097 |
Product | V-type ATPase, 116 kDa subunit |
Protein accession | YP_919749 |
Protein GI | 119719254 |
COG category | [C] Energy production and conversion |
COG ID | [COG1269] Archaeal/vacuolar-type H+-ATPase subunit I |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGGTAC CAGAGAAAGT TGCTAGGTTC AGGGTGGCGG CGCCCGCTGA ATACGAGGTA GCGTTGCTGG ATGCATTAGC CAGCATAGGC GAAGTTCACC TAGAGCCAAG CCTTGGTGGT GAGAGAAGCG CACTGCCCTC GCTGTACGTG GACGTTCTAG AGGGAAGAGT TAACTACGCG AACTTAAACG TGGAGGAGGC TCTCGAGGTT ACCAGGAGAA TTCTAAGGGA GGGAGACCCT CTTCTCTCGA AACTCGAAGG CAAAGTCGCA GAGCTGAGAA ATATAGAGGA AATGCGCGTA GTTGTGGAAA GGCTGGACGG CATGGGGATT TCTCCCGACA GACTCGGAAG GGAGTCGCTC GGAATAGTAA CGGACTACGT CTTCGTATCC GACGAAAACG TCACAGACGC TGTACATGAG TTCCTGAGAG CAGGCGCGAT CGTTAGGAGG AGTAGAGTGT CCCACGCTAA GCACATACTA GTCTTAATCT ATGGCTCTGA CAAAGCGAGC AGTGTTAACG CGTTGAAGGC TAAGTATGGC TTGCAGATAG CCTTACCTTC GTGGTTCTTC GATAAAACGG AGTCTGTTCT GAGGAGACTC GAGGAAGAGG AAACAAGGAT CAGGAAGGAG ATAATAGACA TTATAATCGA GGTCTCCAGC GTTCTTAGAG ACGCGTTCGA ATTCGAGCAC GCAACGCGCC TAGAGATAGT TTCAAACGCG CTAAGAGCAA TTGAGAACTC TGAAAAGCAG GTAGACAAGC TCGAAAAAAT ATTCTACGAC ACGTTGGGCT TGCTACTCGG GTACCGTTTC TGCTTGGAAA AGAAAATGTT GCTTGCTAAG TACGGTGTGA AGAAAGCTAT CTGCGAGCTT TCGCGTAAAA TACTGATCGA TGAACCAGTA GAGCCCGGAG AGTTCGAAAA ACTCCTCTCA TCGCTGTTCT CCCAGGAAGA GGCGGCGGCA CAGAGGATGG AGCTTGAAAA GCTCAACAGG CTTCTCGTTA TACGAAAACT ACTGAGATCC ACCATGCTCT CCGGTAGCTC TGGGAAAGCG TTCGTGGTAT CCGCCGGGGA TAAAAAGCTG GAGGAAGCTA TCGCCTCCGC CCCGGAGATC TATGGTGCTA AGGTTGTGGG AGAACTCCGT GAAAGAGGAG TCATAGCAGT TGTATTCGAG GTCGCAGAAA CCCTGGAAGA GGCTTACGCC AACTTCCTAC GCGATAAATT CAGGGCAAGC GTCGTCGAAA TCTCAGACCC AAGCAAGGAG CTGGAAGAAG CTCTGGCAAA GCTGGAAAGA AGCATCTTCT CGGCGAAGAA CAGTATCCTT TCCAGCATCA TTTCGAAAAG CCTAAGGAAG TTCAAAGTCG ACGTGAAGAA GGTACTGGAG GAACTTGGAG AGCAAGAACT CCGCGAAGTA TACAGCTATA TAGCGAAAAT GAGGAGGGGG GTCGAGCAGA AAAGCATACC GGAGGGGGAC AGATTCGTCT ACCTCAAGAC GTTAATCGGG CTAGCCGACG AGACGTCTGA AGAAATACTG CACCTTGAGG AAGAACTGAA CGTTGCGAAG AGCCTTCCCT CGGAGGCTCT GGCAGATAGT ATAAGAAAAC TGGAGACCTC GCTCGTCAGT GTCAGTAGAA GGCTTGCAGA AATCCTATCG TACAAAGCCG TAACAGAGGC AATGTACAGA GCGCACCCCT TACTCAGCGA GGTACGCATA TTTAGAAGCC GGCGCATAGT AGTGGTAGAA GGCTACGTAC CGGTTAAGTA CGTCGGTCTA CTGGAAGACT CTCTTAGAGG GAAAGTACCG CGGCTACTCT ACTTCAAGTA CTCGGAGGTA CCGAGGTCTG CCGGAGCTCC TACGTACATC GAGAGAAGGG GGCTAAAGAA GTATCTCTAC TCGCTGACCT CTATGAGAGG GACGCCGGCA TACTGGGAGA TAGACCCAAC ACTGATATTC ACAGCTATGT TCGTTGTCAT GTACGGAATG ATGTTCGGAG ACATAGGACA AGGACTTGTC CTATCAGCCT TCGGTGCGTG GTTACTTAAG ACCAAGTACA GGTTGCTGGG AATAACCAGC GAGGGCGCCG CCACGCTGGG AGCCCTCTCA CTGATGGCTG GTATTTCCAG CATGGTTTTC GGCGCTGTTT ATGGCTTCAT GTTCTTTTTA AAGCCGCTCG CCCACCCCAT CATCGCACCC ATACACGACA TATACGAAAT CATAGCGGTG GCCCTATGGT TCGGCGTGGC TCAACTCGTA GTCGCTATGG CGCTGAACAT GGTTAATCTG TGGAGGATGG GGGACAATAT AGGAGCTGTT TTCAGCGGCA TGGGCGGCTT AGGGCTACTC TTCTACCTTT CGGGAGTAGT GGTAGCATAC AACCTCGCAA CGACAGGCTT TAACCTGGCG GTGCTCTCCT CTCCGTCGCT AACCCCCTTC TTGCTAGCCA TTCTCGCGTC GATTCTAGGC GTGCTGGGTT ACGGCTTGTA CGAATCCATC CACGGGGGTG AAAAAGAGAA GATCATGCAT GCGGTAAGCG AGGTAATAGA GATGATCATA GCGCTACCCG CTAACTCCCT CTCGTACATA AGGCTTGCAG CATTCGCTAT GGCGCACGAG GCCTTCGGGA TACTCGCGGA AAACCTGACG CCTTCCGTTG GCGAGATCGC AAGCTACGCG GTGGCGAACC TTCTAGTACT CGGTATAGAG GGGCTCGCCG TGGGCATACA AGCGATGCGT CTAACATACT ACGAGTTCTC AACGAAGTTC TTTAAAGGAG TGGGCGTCGA GTTCAAACCT ATTTCAACGC GCATAAGGTT TGTCACCGAA AGTAGTCAGT AA
|
Protein sequence | MLVPEKVARF RVAAPAEYEV ALLDALASIG EVHLEPSLGG ERSALPSLYV DVLEGRVNYA NLNVEEALEV TRRILREGDP LLSKLEGKVA ELRNIEEMRV VVERLDGMGI SPDRLGRESL GIVTDYVFVS DENVTDAVHE FLRAGAIVRR SRVSHAKHIL VLIYGSDKAS SVNALKAKYG LQIALPSWFF DKTESVLRRL EEEETRIRKE IIDIIIEVSS VLRDAFEFEH ATRLEIVSNA LRAIENSEKQ VDKLEKIFYD TLGLLLGYRF CLEKKMLLAK YGVKKAICEL SRKILIDEPV EPGEFEKLLS SLFSQEEAAA QRMELEKLNR LLVIRKLLRS TMLSGSSGKA FVVSAGDKKL EEAIASAPEI YGAKVVGELR ERGVIAVVFE VAETLEEAYA NFLRDKFRAS VVEISDPSKE LEEALAKLER SIFSAKNSIL SSIISKSLRK FKVDVKKVLE ELGEQELREV YSYIAKMRRG VEQKSIPEGD RFVYLKTLIG LADETSEEIL HLEEELNVAK SLPSEALADS IRKLETSLVS VSRRLAEILS YKAVTEAMYR AHPLLSEVRI FRSRRIVVVE GYVPVKYVGL LEDSLRGKVP RLLYFKYSEV PRSAGAPTYI ERRGLKKYLY SLTSMRGTPA YWEIDPTLIF TAMFVVMYGM MFGDIGQGLV LSAFGAWLLK TKYRLLGITS EGAATLGALS LMAGISSMVF GAVYGFMFFL KPLAHPIIAP IHDIYEIIAV ALWFGVAQLV VAMALNMVNL WRMGDNIGAV FSGMGGLGLL FYLSGVVVAY NLATTGFNLA VLSSPSLTPF LLAILASILG VLGYGLYESI HGGEKEKIMH AVSEVIEMII ALPANSLSYI RLAAFAMAHE AFGILAENLT PSVGEIASYA VANLLVLGIE GLAVGIQAMR LTYYEFSTKF FKGVGVEFKP ISTRIRFVTE SSQ
|
| |