Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0720 |
Symbol | |
ID | 4601485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 669256 |
End bp | 670311 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773496 |
Product | hypothetical protein |
Protein accession | YP_920125 |
Protein GI | 119719630 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTCGG GCTTGGCGTA CAGGTTACTG CGAGCCGCCG CCGGTAGGCG AATCTACGCG CTAACGGGAG ACCTCGCAAT CCTAAGCCCC TTCGCAGTCC TGCCACCCTC GGAGGAAAGC GGGGGGCTCT TCCTGGGCAG GGACGATAGA GGAAGAAACG TGTACTTGAA CCCCGAAAAG CTACCGAATA TGCACGGGGT CATCCTGGGG ACCACTGGGA GCGGGAAGTC GAGCCTAGCG AGGCACATTA TGCTCGAAGC GAGGAGGCTG GGCGTCACGT CGTGGGTCAT CGACCCTCAC TCCGAGGCTA CTTACAGGAA GCTCTTCGAG AGGTCTTTCG GGCTCGGCGA CTTCAGGGTG AACGTTCTCG AAGCGCCGGG GTGGGGGGCC AGCGAGCTGG CCTCGGAGCT GAGCAGGTAC ATCGAGGCTA TCTACGGCTT TCCCGGCTAT AGGAGCATCC TCAGGGAGAT CCTAAAGAGG TGCTTCGAGG AGGGAAGCCT AGAGTTCTTC GAGAAAGTGT CCAAGGAGGA TCCGAACCTC CTCAGGATAT ACGACGACCT GTCGAGAATC CACTCCAATG AGGGGGCATC GATCGCGGAG CTCACTAGGG ACGTTTACTT CTACTACCCG GCGCTGGTCT CGAGGGAGTT CCTGGCCCTC AGCTCGCAGA TACTCTTACT GCTCCTCGAA GGCTACATGA GGACTAAAGG TGCTAGACAC AGACTGGAGC ACCTGGTAGT ACTCGAAGAG GCCCACTTAG TCAAAGACTA CATCCTCTCC CTATTCAAGC AAGTTAGGAA GTACGGGTGG GGGTTACTAG CAGTAACGCA GCTCCCCCGG GAGATGGACC CAAGAGTGTA CCAGCTCGCG GGCTTCCTCG TAGTCCTCTC CGGCCCCGAG AGCTTCGTCC TAGACGTTGC GCGCGTGGTG CAGTTGACGA GGCTCGACTA CGACCACCTA CTCTACAGCG CCCGGGGCAA CGCGCTCCTC GTGAGGCAGG GAGACCCCAG GCCCCGCAGG ATCCACCTCG AGCTACACTC TTCGGCGTTA TCGTAG
|
Protein sequence | MTSGLAYRLL RAAAGRRIYA LTGDLAILSP FAVLPPSEES GGLFLGRDDR GRNVYLNPEK LPNMHGVILG TTGSGKSSLA RHIMLEARRL GVTSWVIDPH SEATYRKLFE RSFGLGDFRV NVLEAPGWGA SELASELSRY IEAIYGFPGY RSILREILKR CFEEGSLEFF EKVSKEDPNL LRIYDDLSRI HSNEGASIAE LTRDVYFYYP ALVSREFLAL SSQILLLLLE GYMRTKGARH RLEHLVVLEE AHLVKDYILS LFKQVRKYGW GLLAVTQLPR EMDPRVYQLA GFLVVLSGPE SFVLDVARVV QLTRLDYDHL LYSARGNALL VRQGDPRPRR IHLELHSSAL S
|
| |