Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0665 |
Symbol | |
ID | 4601623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 614229 |
End bp | 616073 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639773438 |
Product | hypothetical protein |
Protein accession | YP_920070 |
Protein GI | 119719575 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.545372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAGTT TCGAGGTGCT TAAACGCTCC CAGGTTCATG TATGGTGGGA TTTAAAAGAG AACCTCCCAC GGCTTACCGC TTCGAATAGT GAGCCGTGCT TCAGAGTCCG CCTCTCTCCT CCAGGCGACG CAAGGCCCCT CGTCGGCCAC TATTACAAGT TTGTCTTCAG GGTCATCGAG ACATACTTCG GTGGAGACCC TCCCTTTCCC AGGCGTAAGA TAGCCCTGGT GAACAAGGTC CCGTATCCCG ACCTCGCGGA AGAAGTAGTT ATCGACGGTC AGGTAGCCGG GCATCTACTC TACGACCTGA GGTCGGAGCG ATGGCGGTTC AAGCCGCTGT ATGCTGGCGC TGAGGAAGCC CTGAGGAATA GGAAAGGGTA CTACGCTATT GTAGATTTGC CAAGGTTGTC GAGAGGCTAC GTTGTGAAGA AGGACTCGGT AGTAGAGGCT AACCTTCCTG AGGGGGACGA GTACGTAACC ATAGGCACGA AGAGCGGGGA CTTCTACGGC GTAGGAGCTA TGCTCAGGAA CAGAAGGATC TACGTGCTGA AGGCCTGGAG GGCGAGGCCC AGGGCGTGGC TCTCTAGAGA CCCTGACTGG AGTACCGCGG TGCTACGCAA CAGGGAATAC CTGCTCGCAA AGGAGGCTGA AGCCGTAAGC TTCATCAGAG AGGTTGCCGA GAAGTACCGG CTACCCGTCT TCGTATCTCT CTCGGGAGGC AAAGACAGCC TCGTAACTCT CCACCTCGCA GTCAAGGCCC TTGGAAACGA GAAGGTGAAA GCCCTCTTCA ATAACACGGG CCTCGAGTTC GAGGAGACGG TAGAGTACGC TAGGAGAATC GCGGACTACT ACGGCGTTGA ACTCATAGAG GCGGATGCCG GCGACAACTT TTGGAGAGCA CTCCCAGTCA TGGGGCCCCC TGCGAGAGAC TACAGGTGGT GCTGCAAGGT GACTAAGTTC TCGACGATCT CTAGGGCCGT GAAGAAATTC TTCCCAGAGG GAGCCCTGAG CCTCGTAGGG CAGAGAAAGT ACGAGTCGAG CGCTAGGGCC CTCTCTCCAA GAATCTGGCG GAACTACTGG CTTCCAGGCG TGGTGGCGGC AAGCCCGGTG CACGATTGGA GCGCGATGGA TATTTGGCTG TACATCTTCA TGGAGAGGTT GCCGGTGAAC AAACTCTACT ACTACGGATT CGACAGGCTC GGCTGCTGGC TCTGCCCGGC AAGCGAGATG GGAGAACTCG ACCTTTTAAG GATTGTTAAG CCTGGTCTCT ATGATAAGTG GAAGTCCTAC CTGGAGAGCT ACGCCCAGAT GAACGGTCTA GGCGAGGAAT GGGTTAAATT TGGACTCTGG AGGTGGGTTA GACCCCCGAA GGATATACAG AGGATCTGTG TCTCCCAGGT CCAGGCGAGG AGAGGAGCGC GCTACGACAG ACACGCATCA GGCAATACCG TAGAGTACAG GCTTCATAAC CCCTCGGTGC GCATTATCGA GGAGAAAGTT CGAAACCTTT GGCACACCCT TAACAAACCG CTCGAAATAG AAAGCGTACA GGTAACAGGA AATGCCGTAG CGCTGAGATT TAAGCGTGAA GCTACGGCTA GCGAGCTAGA ACTTGTCGAC AGGGTTTTGA TAAGGTCGTA TTTCTGCGTA GAATGTTTAG AGTGCTCCAA CTGGTGCCCC ACGAAGTCTA TAAGCATAGA TTCAGAGAAC GGAGGGATAA AGGTCAACGA ATCCACGTGC ATCCACTGCG GGACTTGTAA CTACAAGTGC CCCGTAGTCG AGTACACGTT TAAGCATCTC GAAATGCTTA AACCGCAACC ACAAACACGC ACTACGGCTT CTTGA
|
Protein sequence | MSSFEVLKRS QVHVWWDLKE NLPRLTASNS EPCFRVRLSP PGDARPLVGH YYKFVFRVIE TYFGGDPPFP RRKIALVNKV PYPDLAEEVV IDGQVAGHLL YDLRSERWRF KPLYAGAEEA LRNRKGYYAI VDLPRLSRGY VVKKDSVVEA NLPEGDEYVT IGTKSGDFYG VGAMLRNRRI YVLKAWRARP RAWLSRDPDW STAVLRNREY LLAKEAEAVS FIREVAEKYR LPVFVSLSGG KDSLVTLHLA VKALGNEKVK ALFNNTGLEF EETVEYARRI ADYYGVELIE ADAGDNFWRA LPVMGPPARD YRWCCKVTKF STISRAVKKF FPEGALSLVG QRKYESSARA LSPRIWRNYW LPGVVAASPV HDWSAMDIWL YIFMERLPVN KLYYYGFDRL GCWLCPASEM GELDLLRIVK PGLYDKWKSY LESYAQMNGL GEEWVKFGLW RWVRPPKDIQ RICVSQVQAR RGARYDRHAS GNTVEYRLHN PSVRIIEEKV RNLWHTLNKP LEIESVQVTG NAVALRFKRE ATASELELVD RVLIRSYFCV ECLECSNWCP TKSISIDSEN GGIKVNESTC IHCGTCNYKC PVVEYTFKHL EMLKPQPQTR TTAS
|
| |