Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0686 |
Symbol | |
ID | 4601886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 634859 |
End bp | 637264 |
Gene Length | 2406 bp |
Protein Length | 801 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639773459 |
Product | protein of unknown function DUF699, ATPase putative |
Protein accession | YP_920091 |
Protein GI | 119719596 |
COG category | [R] General function prediction only |
COG ID | [COG1444] Predicted P-loop ATPase fused to an acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTTAG TCCCCTTGGA GCACCTAGAC GAGGTGCGCG AAGAGCTAGT CAAGGCTAGG AAGAGTAGGC ATAGAAGGCT CTTAGTGATA ACCGGGGACG ACGACTCGAG GCTCGTCACC ACAGCTCTGG ACTTCATATA CAATGTTAAA GACCTGTTGT CAGGAGAGAA GGTGCTGTAC ACCTACCACG CCTTCTACTC GGACGGCGCT ATGCGCAAAG AACTTTTTGA GAAAGGTGTT CCTCGCGAGC TTTCCGTGGA CTACGTGTCG TACCACAAGT TGGACGAGGT TCTAGGGAGG ACTTACGCCG CCGCTGTGGC GGACCTCGTG AACAACCTTG AGCCGAACGA CCTTGGCAGG GTAATGGGGG TCGTTGAAGG AGGCGGGCTT TACATCTTCC TGCTTCCATC CTTTACGCGT CTCTTAGAAA CGGTCACAAG GTTTCAGAGC AACCTGATAG TTCCTGGCTA CACGGATAAG GATCTTAAGA GGTACTTCGA GAAGCGCTTC ATAAAAAAGG TGATGGAACA CCAGGGAGTA GCCGTCTACG ACGCGGATAA CAGGTACTGG GTCAAGAAGT TCGGGAAGAC CCCCTCCACA CCGTACGCTA GACCTAAGCC AGTACTACCG CAGAAGAGCA AAATACCGGT GAAGGTCTTC AACCTTGCTC TAACGCAGGA CCAGGTAGAG GTACTTAAGA TCTTCGAACA CTTCTACGCA AAGGCGGAGA AGGAGAAGCT CGTATTCGTT CTAACGGCGG ATAGGGGCAG GGGGAAGTCC TCAGCCGTGG GTCTAGGGGT TGGCTGGCTG GCCCACAGGC TTAGAAGGGC GAAGGGTAAA TGCAAGGTTG TCGTAACGGC GCCCGCGGTG ACGAACGTCC AGGAAGTGTT CCGCTTCTCG GCAGCCGTCC TGGACCTCTT CAAGCACAAG GTGGAGGTGC TAGAGGACGA GTCCGGTATG ATAACCAAGT TGTTGTCTAA AGGCATCGAA ATAGAGTACG TGACGCCCCT AGACGTTTTG AAGGCTAAGG GCGACTTGCT CGTAGTGGAC GAAGCCGCCT CGATACCCGT ACCTCTCCTC TTCAAAATGC TGAAGCGGTT CAACAAGGTT GTCTACTCCT CTACGATACA CGGCTACGAG GGAGCCGGCA GGGGTTTCTC GCTGAGATTC CTCAAGCGCT TGAAGAACGA GGAAGGTGTA AAGCTTTACG AGTACGAAAT GTCGGAGCCC ATACGCTACG CTCCGGAGGA CCCAATAGAG AAGTGGACTT TCGACTTACT GTTGCTGGAC GCGGAACCGT GCGAGATAAC GGAGGACGAT CTCTCACTGG TAAGCGCCGG AGAAGTTTAC TACGACGCGC CAAACGAGGA GGAGCTTTTC CTAAAGAACG AGGAGGAGCT TAGGCAGTTC TTCGGGATAT ACATCATGGC GCACTACAGG AACAACCCCA ACGACCTAGG CATAATGATG GACGCACCGC ACCACTTCCT GCGCATGGTT AGGCTGAAGA ACGGTAAGAT TGTAGTCTCG TTGGAGCTGG CGTCCGAAGG GAACCTCGGA GAAGACCTCT CCAAGGAGTC GGCGAAGGGG GCGTGGCTAA TGGGTAACAT CATACCCGAC CGGCTCATAA AACACTACAA GATACTCGAC TTCGGCAATC TAAGAGGGAT ACGCGTCGTG AGGATAGCGA CGCACCCCTC CGTTATGGGG AAAGGTCTCG GTAGCTTTGC CCTTAGCAGG CTAGAGGAGG AGGCACGCAG AAACGGTTAC GACTGGGTAG GCGCGGGCTT CGGGGTTACC TACGAGCTCC TTAAGTTCTG GCTTAAGAAC GGCTATATAC CAGTTCACAT GAGCCCTGAA AAGAACCCTG TAAGCGGAGA ATATACGGTC ATAGTTGTAA AGCCTTTAAG CGAGAAGGCT AAAAGAATAG TCGACGTGAT AGCCAAAGAG TTCAAGCAAA AGCTTCTAGG CTCGCTGGCA TCGCCTTACT TCGACTTAGA ACCTGAGGTA GCGCTTCTTC TTCTAAAGTC TACCCCGAGT TTCGAGGTAA AAGTCAACCT AACCAAGCTA CAGCTTGCAC GCTTCCTGAC GTATGCGTGG AGCGACATGA CGCTTGAGAA CTGCATCGAC GTGGTAGGTA TAATGACGCG CCTATACTTC CTATCCAAGA AAAAGCCCTC TCTAAGCGAG CTACAGGAGC TTCTACTGGT CTCCAAGATC TTGCAGGCGA AGAGCTGGCA TCTGACGTGC CAAGAGCTAA ATCTCAGCCT CGCAGAGGCG ACGAGCAACA TGAAACAGAT TGCGCAAATA TTTTCGAAGG AGTTTCTGGG AGTGAACAGC GAGGAGGAGG CTCTAAGGTA CTTTTTCCTC AGAATGGACG ACCTCAACGA GGGGGTTAGT GCCTGA
|
Protein sequence | MPLVPLEHLD EVREELVKAR KSRHRRLLVI TGDDDSRLVT TALDFIYNVK DLLSGEKVLY TYHAFYSDGA MRKELFEKGV PRELSVDYVS YHKLDEVLGR TYAAAVADLV NNLEPNDLGR VMGVVEGGGL YIFLLPSFTR LLETVTRFQS NLIVPGYTDK DLKRYFEKRF IKKVMEHQGV AVYDADNRYW VKKFGKTPST PYARPKPVLP QKSKIPVKVF NLALTQDQVE VLKIFEHFYA KAEKEKLVFV LTADRGRGKS SAVGLGVGWL AHRLRRAKGK CKVVVTAPAV TNVQEVFRFS AAVLDLFKHK VEVLEDESGM ITKLLSKGIE IEYVTPLDVL KAKGDLLVVD EAASIPVPLL FKMLKRFNKV VYSSTIHGYE GAGRGFSLRF LKRLKNEEGV KLYEYEMSEP IRYAPEDPIE KWTFDLLLLD AEPCEITEDD LSLVSAGEVY YDAPNEEELF LKNEEELRQF FGIYIMAHYR NNPNDLGIMM DAPHHFLRMV RLKNGKIVVS LELASEGNLG EDLSKESAKG AWLMGNIIPD RLIKHYKILD FGNLRGIRVV RIATHPSVMG KGLGSFALSR LEEEARRNGY DWVGAGFGVT YELLKFWLKN GYIPVHMSPE KNPVSGEYTV IVVKPLSEKA KRIVDVIAKE FKQKLLGSLA SPYFDLEPEV ALLLLKSTPS FEVKVNLTKL QLARFLTYAW SDMTLENCID VVGIMTRLYF LSKKKPSLSE LQELLLVSKI LQAKSWHLTC QELNLSLAEA TSNMKQIAQI FSKEFLGVNS EEEALRYFFL RMDDLNEGVS A
|
| |