Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0738 |
Symbol | |
ID | 4601145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 685436 |
End bp | 687424 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639773514 |
Product | hypothetical protein |
Protein accession | YP_920143 |
Protein GI | 119719648 |
COG category | [S] Function unknown |
COG ID | [COG2433] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000149054 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGTTCA GGCGGGTGCT AGGGCTGGAT ATACTGCCGG GTAGCTCTCC CCTCGGGAGG CAACACCTCT TCGCCGCCGT GCTCCTGGTG GACGGCAGGG TGGAGCAGAG GGTTCGGGAA GCCTCCCTAG AGGACGTCGT GAGGCTCGCC ACCTCGGGCG GGGTCGAGGC GCTGGCGCTG GACAACGTGT TCGAGCTGGC GCCCACCGTG GAGGGGCTCG CGGAGTTCTT GAGGCTGTTC CCGGGGAGGC CGCCGCGCCT GATCCAGGTG ACAGTCGTCA ACGGCGAGGA GGTTAGCGTC GAGACCCTCT GCGCCGTTAC GGGGCTGTGT AGGGGTAGGC TCGACCCCCT CGGGACGGCG GAGGCGTGCG CCCTACTCGC GTACGCCGGC GTCGGTAGCG AGGTCCTCGT CTTTCACGAC GAGACGGTGG TGCACGTTGG CCGCGGCAGG GTGCCCGGGC AGGGAGGGAT GAGCAGGGAG AGGTTCAAGA GGGGCATCGA GGTCCTCGTG AAGCGGAAGG TTAGGGAGAT AGCGGAGGCG CTACAGAGAA AGGGGCTCGA CTTCGACGTT TTCCCCAGGA AGAGCGGGGA GGGGCTCGTC GGCGCGACGT TCATCGTGTA CGCGCCGCGC GAGGAACTCA ACGGCGTCGT TAAGAGCGAG GAGGGGCACG ACCTCTTCGT CAGGGTGGAG CCTGCGAGGA GGGATAGGGT GGAGTTCAGG CCCCTGGGCT CCAGGCTCCA CAGGGCGCTC TCCCAGGAGC GCCTCCTCAT AGTGGGCGTC GACCCGGGGA TGGCCACGGG CTTCGCCGTG CTGGACTTCT CCGGGAGGGT GCTCGCGGTC GACAGCAGGA GGCTCCTCGG CAGGGGGCAG CTCGTCAGGG AGCTCTACGG CTTCGGGAGG CCGGCTATAG TCGCGACGGA CGTTAACCCT CCACCCGCCT ACGTGAAGAA GCTGGCGTCG ACGCTCGGCG CCGTGCTGTA CGTCCCCAGC CGCTCGCTGA GCGTGGAGGA GAAGCGGAGG CTCGCGCTGG AGGCCGCGGG GCAGCAGGGG GTGAGGCTTA GGACTTCCCA CGAGAGGGAC TCCCTAGCCG CCGCCTACAA GGCTTTCCTG TCCTACAGGG AGCTCTTCGA GGAGGTGGAG AGGGAGGCTG CTAGGTACGG GGTGCCGTTC TCCCTGGACG AGGCGAAGCT CCTCGCGGTA AAGGGTAAGC CCGTGGCCCT CGCCGTCGAG GAGGCTCTGA GGAGGCAGGT GGGGGTGAAG ATCCCGAGGA TCGAGCTCCA GAAGGAGGCG GAGGAGCCCC GGAGGGTCGA GGAGGAGCTG GAGGAGGTAC GGCGCGCCCT CTCGGAGCTC CTCCGGGAGA ACGTCGAGCT TAGGCGGAGG CTTGAGGAGG CGGAGGAGAG GGCTAGGAGG AGCGAGGAGG CGCTTAGGGG GCTGCTGAGG GCTAGGGAGG TCGCGAGGGG CCTCGAGTCG GAGTACGCGA AGCTCCGGGC GAGGATAGAG CTACTCCAGT CGGAGCTAGA CTCCTTCAGG AGGGAGCTAG CAGAAAAGGA GAGGGCGCTG GAGTCCCTGG GCGACGCGTT GCTCTCCTAC CTCTCCGGGG AGGCGGTAGT CGCGGTGAGG CTTTCCTACG TCCTCGAACG TGGCGCCGCG AGGGTTCCGG CGGTCTACGT TGACAAGCAG TTGCCGTCCG ACTCCCTGAG GAGGGTTCTC GAGGAGCTGA AGCCGCCGGG CTCTCATCTC ATCGCCTTCT TCGAGGGGGC GGCGCGCGGG GCCGCCGAGA GGCTCCCGCT CGGCGTGGTT CCGGTCGCCC TCGGAGAGGT TAAGCCTTTA GCCGAGGTTG GACCCTTCGT CTTCGTGCCT GCCGAGACAG CGCTCGGCGC CGCCACTGCT TCGCGCGACG CCGATAAGGA GAGGCTTAGG CGCCTCCTGG AGGACTACAG GTTGCAGAGA AAGAGGGAGT TGGAGGGGCT TTCCGCCGGG CGCCTCTAG
|
Protein sequence | MAFRRVLGLD ILPGSSPLGR QHLFAAVLLV DGRVEQRVRE ASLEDVVRLA TSGGVEALAL DNVFELAPTV EGLAEFLRLF PGRPPRLIQV TVVNGEEVSV ETLCAVTGLC RGRLDPLGTA EACALLAYAG VGSEVLVFHD ETVVHVGRGR VPGQGGMSRE RFKRGIEVLV KRKVREIAEA LQRKGLDFDV FPRKSGEGLV GATFIVYAPR EELNGVVKSE EGHDLFVRVE PARRDRVEFR PLGSRLHRAL SQERLLIVGV DPGMATGFAV LDFSGRVLAV DSRRLLGRGQ LVRELYGFGR PAIVATDVNP PPAYVKKLAS TLGAVLYVPS RSLSVEEKRR LALEAAGQQG VRLRTSHERD SLAAAYKAFL SYRELFEEVE REAARYGVPF SLDEAKLLAV KGKPVALAVE EALRRQVGVK IPRIELQKEA EEPRRVEEEL EEVRRALSEL LRENVELRRR LEEAEERARR SEEALRGLLR AREVARGLES EYAKLRARIE LLQSELDSFR RELAEKERAL ESLGDALLSY LSGEAVVAVR LSYVLERGAA RVPAVYVDKQ LPSDSLRRVL EELKPPGSHL IAFFEGAARG AAERLPLGVV PVALGEVKPL AEVGPFVFVP AETALGAATA SRDADKERLR RLLEDYRLQR KRELEGLSAG RL
|
| |